12.07.2015 Views

Fundamental Limitations in Filtering and Control

Fundamental Limitations in Filtering and Control

Fundamental Limitations in Filtering and Control

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

María M. SeronJulio H. BraslavskyGraham C. Goodw<strong>in</strong><strong>Fundamental</strong><strong>Limitations</strong> <strong>in</strong>Filter<strong>in</strong>g <strong>and</strong> <strong>Control</strong>With 114 Figures


This book was orig<strong>in</strong>ally Published by Spr<strong>in</strong>ger-Verlag London Limited<strong>in</strong> 1997. The present PDF file fixes typos found until December 6, 2004.Spr<strong>in</strong>ger Copyright NoticeMaría M. Seron, PhDJulio H. Braslavsky, PhDGraham C. Goodw<strong>in</strong>, ProfessorSchool of Electrical Eng<strong>in</strong>eer<strong>in</strong>g <strong>and</strong> Computer Science,The University of Newcastle,Callaghan, New South Wales 2308, AustraliaSeries EditorsB.W. Dick<strong>in</strong>son • A. Fettweis • J.L. Massey • J.W. Modest<strong>in</strong>oE.D. Sontag • M. ThomaISBN 3-540-76126-8 Spr<strong>in</strong>ger-Verlag Berl<strong>in</strong> Heidelberg New YorkBritish Library Catalogu<strong>in</strong>g <strong>in</strong> Publication DataA catalogue record for this book is available from the British LibraryApart from any fair deal<strong>in</strong>g for the purposes of research or private study, or criticismor review, as permitted under the Copyright, Designs <strong>and</strong> Patents Act 1988,this publication may only be reproduced, stored or transmitted, <strong>in</strong> any form or byany means, with the prior permission <strong>in</strong> writ<strong>in</strong>g of the publishers, or <strong>in</strong> the caseof reprographic reproduction <strong>in</strong> accordance with the terms of licenses issued bythe Copyright Licens<strong>in</strong>g Agency. Enquires concern<strong>in</strong>g reproduction outside thoseterms should be sent to the publishers.c○ Spr<strong>in</strong>ger-Verlag London Limited 1997Pr<strong>in</strong>ted <strong>in</strong> Great Brita<strong>in</strong>The use of registered names, trademarks, etc. <strong>in</strong> this publication does not imply,even <strong>in</strong> the absence of a specific statement, that such names are exempt from therelevant laws <strong>and</strong> regulations <strong>and</strong> therefore free for general use.The publisher makes no representation, express or implied, with regard to theaccuracy of the <strong>in</strong>formation conta<strong>in</strong>ed <strong>in</strong> this book <strong>and</strong> cannot accept any legalresponsibility or liability for any errors or omissions that may be made.Typesett<strong>in</strong>g: Camera ready by authorsPr<strong>in</strong>ted <strong>and</strong> bound at the Athenæum Press Ltd, Gateshead69/3830-543210 Pr<strong>in</strong>ted on acid-free paper


PrefaceThis book deals with the issue of fundamental limitations <strong>in</strong> filter<strong>in</strong>g <strong>and</strong>control system design. This issue lies at the very heart of feedback theorys<strong>in</strong>ce it reveals what is achievable, <strong>and</strong> conversely what is not achievable,<strong>in</strong> feedback systems.The subject has a rich history beg<strong>in</strong>n<strong>in</strong>g with the sem<strong>in</strong>al work of Bodedur<strong>in</strong>g the 1940’s <strong>and</strong> as subsequently published <strong>in</strong> his well-known bookFeedback Amplifier Design (Van Nostr<strong>and</strong>, 1945). An <strong>in</strong>terest<strong>in</strong>g fact is that,although Bode’s book is now fifty years old, it is still extensively quoted.This is supported by a science citation count which rema<strong>in</strong>s comparablewith the best contemporary texts on control theory.Interpretations of Bode’s results <strong>in</strong> the context of control system designwere provided by Horowitz <strong>in</strong> the 1960’s. For example, it has been shownthat, for s<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output stable open-loop systems hav<strong>in</strong>g relativedegree greater than one, the <strong>in</strong>tegral of the logarithmic sensitivitywith respect to frequency is zero. This result implies, among other th<strong>in</strong>gs,that a reduction <strong>in</strong> sensitivity <strong>in</strong> one frequency b<strong>and</strong> is necessarily accompaniedby an <strong>in</strong>crease of sensitivity <strong>in</strong> other frequency b<strong>and</strong>s. Althoughthe orig<strong>in</strong>al results were restricted to open-loop stable systems, they havebeen subsequently extended to open-loop unstable systems <strong>and</strong> systemshav<strong>in</strong>g nonm<strong>in</strong>imum phase zeros.The orig<strong>in</strong>al motivation for the study of fundamental limitations <strong>in</strong>feedback was control system design. However, it has been recently realizedthat similar constra<strong>in</strong>ts hold for many related problems <strong>in</strong>clud<strong>in</strong>gfilter<strong>in</strong>g <strong>and</strong> fault detection. To give the flavor of the filter<strong>in</strong>g results, considerthe frequently quoted problem of an <strong>in</strong>verted pendulum. It is well


viPrefaceknown that this system is completely observable from measurements ofthe carriage position. What is less well known is that it is fundamentallydifficult to estimate the pendulum angle from measurements of the carriageposition due to the location of open-loop nonm<strong>in</strong>imum phase zeros<strong>and</strong> unstable poles. M<strong>in</strong>imum sensitivity peaks of 40 dB are readily predictableus<strong>in</strong>g Poisson <strong>in</strong>tegral type formulae without need<strong>in</strong>g to carry outa specific design. This clearly suggests that a change <strong>in</strong> the <strong>in</strong>strumentationis called for, i.e., one should measure the angle directly. We see, <strong>in</strong> this example,that the fundamental limitations po<strong>in</strong>t directly to the <strong>in</strong>escapablenature of the difficulty <strong>and</strong> thereby elim<strong>in</strong>ate the possibility of expend<strong>in</strong>geffort on various filter design strategies that we know, ab <strong>in</strong>itio, aredoomed to failure.Recent developments <strong>in</strong> the field of fundamental design limitations <strong>in</strong>cludeextensions to multivariable l<strong>in</strong>ear systems, sampled-data systems,<strong>and</strong> nonl<strong>in</strong>ear systems.At this po<strong>in</strong>t <strong>in</strong> time, a considerable body of knowledge has been assembledon the topic of fundamental design limitations <strong>in</strong> feedback systems.It is thus timely to summarize the key developments <strong>in</strong> a modern <strong>and</strong>comprehensive text. This has been our pr<strong>in</strong>cipal objective <strong>in</strong> writ<strong>in</strong>g thisbook. We aim to cover all necessary background <strong>and</strong> to give new succ<strong>in</strong>cttreatments of Bode’s orig<strong>in</strong>al work together with all contemporary results.The book is organized <strong>in</strong> four parts. The first part is <strong>in</strong>troductory <strong>and</strong> itconta<strong>in</strong>s a chapter where we cover the significance <strong>and</strong> history of designlimitations, <strong>and</strong> motivate future chapters by analyz<strong>in</strong>g design limitationsaris<strong>in</strong>g <strong>in</strong> the time doma<strong>in</strong>.The second part of the book is devoted to design limitations <strong>in</strong> feedbackcontrol systems <strong>and</strong> is divided <strong>in</strong> five chapters. In Chapter 2, wesummarize the key concepts from the theory of control systems that willbe needed <strong>in</strong> the sequel. Chapter 3 exam<strong>in</strong>es fundamental design limitations<strong>in</strong> l<strong>in</strong>ear s<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output control, while Chapter 4 presentsresults on multi-<strong>in</strong>put multi-output control. Chapters 5 <strong>and</strong> 6 develop correspond<strong>in</strong>gresults for periodic <strong>and</strong> sampled-data systems respectively.Part III deals with design limitations <strong>in</strong> l<strong>in</strong>ear filter<strong>in</strong>g problems. Aftersett<strong>in</strong>g up some notation <strong>and</strong> def<strong>in</strong>itions <strong>in</strong> Chapter 7, Chapter 8 coversthe s<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output filter<strong>in</strong>g case, while Chapter 9 studies themultivariable case. Chapters 10 <strong>and</strong> 11 develop the extensions to the relatedproblems of prediction <strong>and</strong> fixed-lag smooth<strong>in</strong>g.F<strong>in</strong>ally, Part IV presents three chapters with very recent results on sensitivitylimitations for nonl<strong>in</strong>ear filter<strong>in</strong>g <strong>and</strong> control systems. Chapter 12<strong>in</strong>troduces notation <strong>and</strong> some prelim<strong>in</strong>ary results, Chapter 13 covers feedbackcontrol systems, <strong>and</strong> Chapter 14 the filter<strong>in</strong>g case.In addition, we provide an appendix with an almost self-conta<strong>in</strong>ed reviewof complex variable theory, which furnishes the necessary mathematicalbackground required <strong>in</strong> the book.


PrefaceviiBecause of the pivotal role played by design limitations <strong>in</strong> the study offeedback systems, we believe that this book should be of <strong>in</strong>terest to research<strong>and</strong> practitioners from a variety of fields <strong>in</strong>clud<strong>in</strong>g <strong>Control</strong>, Communications,Signal Process<strong>in</strong>g, <strong>and</strong> Fault Detection. The book is selfconta<strong>in</strong>ed<strong>and</strong> <strong>in</strong>cludes all necessary background <strong>and</strong> mathematical prelim<strong>in</strong>aries.It would therefore also be suitable for junior graduate students<strong>in</strong> <strong>Control</strong>, Filter<strong>in</strong>g, Signal Process<strong>in</strong>g or Applied Mathematics.The authors wish to deeply thank several people who, directly or <strong>in</strong>directly,assisted <strong>in</strong> the preparation of the text. Our appreciation goesto Greta Davies for facilitat<strong>in</strong>g the authors the opportunity to completethis project <strong>in</strong> Australia. In the technical ground, <strong>in</strong>put <strong>and</strong> <strong>in</strong>sight wereobta<strong>in</strong>ed from Gjerrit Me<strong>in</strong>sma, Guillermo Gómez, Rick Middleton <strong>and</strong>Thomas Br<strong>in</strong>smead. The <strong>in</strong>fluence of Jim Freudenberg <strong>in</strong> this work is immense.


ContentsPrefaceContentsvixI Introduction 11 A Chronicle of System Design <strong>Limitations</strong> 31.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31.2 Performance <strong>Limitations</strong> <strong>in</strong> Dynamical Systems . . . . . . . 61.3 Time Doma<strong>in</strong> Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . 91.3.1 Integrals on the Step Response . . . . . . . . . . . . . 91.3.2 Design Interpretations . . . . . . . . . . . . . . . . . . 131.3.3 Example: Inverted Pendulum . . . . . . . . . . . . . . 161.4 Frequency Doma<strong>in</strong> Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . 181.5 A Brief History . . . . . . . . . . . . . . . . . . . . . . . . . . 191.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 21II <strong>Limitations</strong> <strong>in</strong> L<strong>in</strong>ear <strong>Control</strong> 232 Review of General Concepts 252.1 L<strong>in</strong>ear Time-Invariant Systems . . . . . . . . . . . . . . . . . 26


xContents2.1.1 Zeros <strong>and</strong> Poles . . . . . . . . . . . . . . . . . . . . . . 272.1.2 S<strong>in</strong>gular Values . . . . . . . . . . . . . . . . . . . . . . 292.1.3 Frequency Response . . . . . . . . . . . . . . . . . . . 292.1.4 Coprime Factorization . . . . . . . . . . . . . . . . . . 302.2 Feedback <strong>Control</strong> Systems . . . . . . . . . . . . . . . . . . . . 312.2.1 Closed-Loop Stability . . . . . . . . . . . . . . . . . . 322.2.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . 322.2.3 Performance Considerations . . . . . . . . . . . . . . 332.2.4 Robustness Considerations . . . . . . . . . . . . . . . 352.3 Two Applications of Complex Integration . . . . . . . . . . . 362.3.1 Nyquist Stability Criterion . . . . . . . . . . . . . . . 372.3.2 Bode Ga<strong>in</strong>-Phase Relationships . . . . . . . . . . . . . 402.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 453 SISO <strong>Control</strong> 473.1 Bode Integral Formulae . . . . . . . . . . . . . . . . . . . . . 473.1.1 Bode’s Attenuation Integral Theorem . . . . . . . . . 483.1.2 Bode Integrals for S <strong>and</strong> T . . . . . . . . . . . . . . . . 513.1.3 Design Interpretations . . . . . . . . . . . . . . . . . . 593.2 The Water-Bed Effect . . . . . . . . . . . . . . . . . . . . . . . 623.3 Poisson Integral Formulae . . . . . . . . . . . . . . . . . . . . 643.3.1 Poisson Integrals for S <strong>and</strong> T . . . . . . . . . . . . . . 653.3.2 Design Interpretations . . . . . . . . . . . . . . . . . . 673.3.3 Example: Inverted Pendulum . . . . . . . . . . . . . . 733.4 Discrete Systems . . . . . . . . . . . . . . . . . . . . . . . . . 743.4.1 Poisson Integrals for S <strong>and</strong> T . . . . . . . . . . . . . . 753.4.2 Design Interpretations . . . . . . . . . . . . . . . . . . 783.4.3 Bode Integrals for S <strong>and</strong> T . . . . . . . . . . . . . . . . 793.4.4 Design Interpretations . . . . . . . . . . . . . . . . . . 823.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 844 MIMO <strong>Control</strong> 854.1 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 854.2 Bode Integral Formulae . . . . . . . . . . . . . . . . . . . . . 874.2.1 Prelim<strong>in</strong>aries . . . . . . . . . . . . . . . . . . . . . . . 884.2.2 Bode Integrals for S . . . . . . . . . . . . . . . . . . . . 914.2.3 Design Interpretations . . . . . . . . . . . . . . . . . . 964.3 Poisson Integral Formulae . . . . . . . . . . . . . . . . . . . . 984.3.1 Prelim<strong>in</strong>aries . . . . . . . . . . . . . . . . . . . . . . . 984.3.2 Poisson Integrals for S . . . . . . . . . . . . . . . . . . 994.3.3 Design Interpretations . . . . . . . . . . . . . . . . . . 102


Contentsxi4.3.4 The Cost of Decoupl<strong>in</strong>g . . . . . . . . . . . . . . . . . 1034.3.5 The Impact of Near Pole-Zero Cancelations . . . . . . 1054.3.6 Examples . . . . . . . . . . . . . . . . . . . . . . . . . 1074.4 Discrete Systems . . . . . . . . . . . . . . . . . . . . . . . . . 1134.4.1 Poisson Integral for S . . . . . . . . . . . . . . . . . . . 1144.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 1165 Extensions to Periodic Systems 1195.1 Periodic Discrete-Time Systems . . . . . . . . . . . . . . . . . 1195.1.1 Modulation Representation . . . . . . . . . . . . . . . 1205.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . . . . . 1225.3 Integral Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . . . . 1245.4 Design Interpretations . . . . . . . . . . . . . . . . . . . . . . 1265.4.1 Time-Invariant Map as a Design Objective . . . . . . 1275.4.2 Periodic <strong>Control</strong> of Time-<strong>in</strong>variant Plant . . . . . . . 1305.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 1326 Extensions to Sampled-Data Systems 1356.1 Prelim<strong>in</strong>aries. . . . . . . . . . . . . . . . . . . . . . . . . . . . 1366.1.1 Signals <strong>and</strong> System . . . . . . . . . . . . . . . . . . . . 1366.1.2 Sampler, Hold <strong>and</strong> Discretized System . . . . . . . . 1376.1.3 Closed-loop Stability . . . . . . . . . . . . . . . . . . . 1406.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . . . . . 1416.2.1 Frequency Response . . . . . . . . . . . . . . . . . . . 1416.2.2 Sensitivity <strong>and</strong> Robustness . . . . . . . . . . . . . . . 1436.3 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 1456.4 Poisson Integral formulae . . . . . . . . . . . . . . . . . . . . 1506.4.1 Poisson Integral for S 0 . . . . . . . . . . . . . . . . . . 1506.4.2 Poisson Integral for T 0 . . . . . . . . . . . . . . . . . . 1536.5 Example: Robustness of Discrete Zero Shift<strong>in</strong>g . . . . . . . . 1566.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 158III <strong>Limitations</strong> <strong>in</strong> L<strong>in</strong>ear Filter<strong>in</strong>g 1617 General Concepts 1637.1 General Filter<strong>in</strong>g Problem . . . . . . . . . . . . . . . . . . . . 1637.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . . . . . 1657.2.1 Interpretation of the Sensitivities . . . . . . . . . . . . 1677.2.2 Filter<strong>in</strong>g <strong>and</strong> <strong>Control</strong> Complementarity . . . . . . . . 169


xiiContents7.3 Bounded Error Estimators . . . . . . . . . . . . . . . . . . . . 1727.3.1 Unbiased Estimators . . . . . . . . . . . . . . . . . . . 1757.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 1778 SISO Filter<strong>in</strong>g 1798.1 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 1798.2 Integral Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . . . . 1818.3 Design Interpretations . . . . . . . . . . . . . . . . . . . . . . 1848.4 Examples: Kalman Filter . . . . . . . . . . . . . . . . . . . . . 1898.5 Example: Inverted Pendulum . . . . . . . . . . . . . . . . . . 1938.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 1969 MIMO Filter<strong>in</strong>g 1979.1 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 1989.2 Poisson Integral Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . 1999.3 The Cost of Diagonalization . . . . . . . . . . . . . . . . . . . 2029.4 Application to Fault Detection . . . . . . . . . . . . . . . . . . 2059.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 20810 Extensions to SISO Prediction 20910.1 General Prediction Problem . . . . . . . . . . . . . . . . . . . 20910.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . . . . . 21210.3 BEE Derived Predictors . . . . . . . . . . . . . . . . . . . . . . 21310.4 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 21410.5 Integral Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . . . . 21710.6 Effect of the Prediction Horizon . . . . . . . . . . . . . . . . . 21910.6.1 Large Values of τ . . . . . . . . . . . . . . . . . . . . . 21910.6.2 Intermediate Values of τ . . . . . . . . . . . . . . . . . 22010.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 22611 Extensions to SISO Smooth<strong>in</strong>g 22711.1 General Smooth<strong>in</strong>g Problem . . . . . . . . . . . . . . . . . . . 22711.2 Sensitivity Functions . . . . . . . . . . . . . . . . . . . . . . . 23011.3 BEE Derived Smoothers . . . . . . . . . . . . . . . . . . . . . 23111.4 Interpolation Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . 23211.5 Integral Constra<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . . . . 23411.5.1 Effect of the Smooth<strong>in</strong>g Lag . . . . . . . . . . . . . . . 23611.6 Sensitivity Improvement of the Optimal Smoother . . . . . . 23711.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240


ContentsxiiiNotes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 241IV <strong>Limitations</strong> <strong>in</strong> Nonl<strong>in</strong>ear <strong>Control</strong> <strong>and</strong> Filter<strong>in</strong>g 24312 Nonl<strong>in</strong>ear Operators 24512.1 Nonl<strong>in</strong>ear Operators . . . . . . . . . . . . . . . . . . . . . . . 24512.1.1 Nonl<strong>in</strong>ear Operators on a L<strong>in</strong>ear Space . . . . . . . . 24612.1.2 Nonl<strong>in</strong>ear Operators on a Banach Space . . . . . . . . 24712.1.3 Nonl<strong>in</strong>ear Operators on a Hilbert Space . . . . . . . . 24812.2 Nonl<strong>in</strong>ear Cancelations . . . . . . . . . . . . . . . . . . . . . . 24912.2.1 Nonl<strong>in</strong>ear Operators on Extended Banach Spaces . . 25012.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 25213 Nonl<strong>in</strong>ear <strong>Control</strong> 25313.1 Review of L<strong>in</strong>ear Sensitivity Relations . . . . . . . . . . . . . 25313.2 A Complementarity Constra<strong>in</strong>t . . . . . . . . . . . . . . . . . 25413.3 Sensitivity <strong>Limitations</strong> . . . . . . . . . . . . . . . . . . . . . . 25613.4 The Water-Bed Effect . . . . . . . . . . . . . . . . . . . . . . . 25813.5 Sensitivity <strong>and</strong> Stability Robustness . . . . . . . . . . . . . . 26013.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 26314 Nonl<strong>in</strong>ear Filter<strong>in</strong>g 26514.1 A Complementarity Constra<strong>in</strong>t . . . . . . . . . . . . . . . . . 26514.2 Bounded Error Nonl<strong>in</strong>ear Estimation . . . . . . . . . . . . . . 26814.3 Sensitivity <strong>Limitations</strong> . . . . . . . . . . . . . . . . . . . . . . 26914.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 271V Appendices 273A Review of Complex Variable Theory 275A.1 Functions, Doma<strong>in</strong>s <strong>and</strong> Regions . . . . . . . . . . . . . . . . 275A.2 Complex Differentiation . . . . . . . . . . . . . . . . . . . . . 276A.3 Analytic functions . . . . . . . . . . . . . . . . . . . . . . . . . 278A.3.1 Harmonic Functions . . . . . . . . . . . . . . . . . . . 280A.4 Complex Integration . . . . . . . . . . . . . . . . . . . . . . . 281A.4.1 Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 281A.4.2 Integrals . . . . . . . . . . . . . . . . . . . . . . . . . . 283A.5 Ma<strong>in</strong> Integral Theorems . . . . . . . . . . . . . . . . . . . . . 289


xivContentsA.5.1 Green’s Theorem . . . . . . . . . . . . . . . . . . . . . 289A.5.2 The Cauchy Integral Theorem . . . . . . . . . . . . . . 291A.5.3 Extensions of Cauchy’s Integral Theorem . . . . . . . 293A.5.4 The Cauchy Integral Formula . . . . . . . . . . . . . . 296A.6 The Poisson Integral Formula . . . . . . . . . . . . . . . . . . 298A.6.1 Formula for the Half Plane . . . . . . . . . . . . . . . 298A.6.2 Formula for the Disk . . . . . . . . . . . . . . . . . . . 302A.7 Power Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304A.7.1 Derivatives of Analytic Functions . . . . . . . . . . . 304A.7.2 Taylor Series . . . . . . . . . . . . . . . . . . . . . . . . 306A.7.3 Laurent Series . . . . . . . . . . . . . . . . . . . . . . . 308A.8 S<strong>in</strong>gularities . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310A.8.1 Isolated S<strong>in</strong>gularities . . . . . . . . . . . . . . . . . . . 310A.8.2 Branch Po<strong>in</strong>ts . . . . . . . . . . . . . . . . . . . . . . . 313A.9 Integration of Functions with S<strong>in</strong>gularities . . . . . . . . . . 315A.9.1 Functions with Isolated S<strong>in</strong>gularities . . . . . . . . . . 315A.9.2 Functions with Branch Po<strong>in</strong>ts . . . . . . . . . . . . . . 319A.10 The Maximum Modulus Pr<strong>in</strong>ciple . . . . . . . . . . . . . . . 321A.11 Entire Functions . . . . . . . . . . . . . . . . . . . . . . . . . . 322Notes <strong>and</strong> References . . . . . . . . . . . . . . . . . . . . . . . 324B Proofs of Some Results <strong>in</strong> the Chapters 325B.1 Proofs for Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . 325B.2 Proofs for Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . 332B.2.1 Proof of Lemma 6.2.2 . . . . . . . . . . . . . . . . . . . 332B.2.2 Proof of Lemma 6.2.4 . . . . . . . . . . . . . . . . . . . 337B.2.3 Proof of Lemma 6.2.5 . . . . . . . . . . . . . . . . . . . 339C The Laplace Transform of the Prediction Error 341D Least Squares Smoother Sensitivities for Large τ 345Bibliography 349Index 359


Part IIntroduction


1A Chronicle of System Design<strong>Limitations</strong>1.1 IntroductionThis book is concerned with fundamental limits <strong>in</strong> the design of feedbackcontrol systems <strong>and</strong> filters. These limits tell us what is feasible <strong>and</strong>, conversely,what is <strong>in</strong>feasible, <strong>in</strong> a given set of circumstances. Their significancearises from the fact that they subsume any particular solution to aproblem by def<strong>in</strong><strong>in</strong>g the characteristics of all possible solutions.Our emphasis throughout is on system analysis, although the resultsthat we provide convey strong implications <strong>in</strong> system synthesis. For a varietyof dynamical systems, we will derive relations that represent fundamentallimits on the achievable performance of all possible designs. Theserelations depend on both constitutive <strong>and</strong> structural properties of the systemunder study, <strong>and</strong> are stated <strong>in</strong> terms of functions that quantify systemperformance <strong>in</strong> various senses.<strong>Fundamental</strong> limits are actually at the core of many fields of eng<strong>in</strong>eer<strong>in</strong>g,science <strong>and</strong> mathematics. The follow<strong>in</strong>g examples are probably wellknown to the reader.Example 1.1.1 (The Cramér-Rao Inequality). In Po<strong>in</strong>t Estimation Theory, afunction ^θ(Y) of a r<strong>and</strong>om variable Y — whose distribution depends on anunknown parameter θ — is an unbiased estimator for θ if its expected valuesatisfiesE θ {^θ(Y)} = θ , (1.1)


4 1. A Chronicle of System Design <strong>Limitations</strong>where E θ denotes expectation over the parametrized density functionp(· ; θ) for the data.A natural measure of performance for a parameter estimator is the covarianceof the estimation error, def<strong>in</strong>ed by E θ {(^θ−θ) 2 }. Achiev<strong>in</strong>g a smallcovariance of the error is usually considered to be a good property of anunbiased estimator. There is, however, a limit on the m<strong>in</strong>imum value ofcovariance that can be atta<strong>in</strong>ed. Indeed, a relatively straightforward mathematicalderivation from (1.1) leads to the follow<strong>in</strong>g <strong>in</strong>equality, whichholds for any unbiased estimator,E θ {(^θ − θ) 2 } ≥(E θ{ (∂ log p(y; θ)∂θ) 2}) −1,where p(· ; θ) def<strong>in</strong>es the density function of the data y ∈ Y.The above relation is known as the Cramér-Rao Inequality, <strong>and</strong> the righth<strong>and</strong> side (RHS) the Cramér-Rao Lower Bound (Cramér, 1946). This plays afundamental role <strong>in</strong> Estimation Theory (Ca<strong>in</strong>es, 1988). Indeed, an estimatoris considered to be efficient if its covariance is equal to the Cramér-RaoLower Bound. Thus, this bound provides a benchmark aga<strong>in</strong>st which allpractical estimators can be compared.◦Another illustration of a relation express<strong>in</strong>g fundamental limits is givenby Shannon’s Theorem of Communications.Example 1.1.2 (The Shannon Theorem). A celebrated result <strong>in</strong> CommunicationTheory is the Shannon Theorem (Shannon, 1948). This crucial theoremestablishes that given an <strong>in</strong>formation source <strong>and</strong> a communicationchannel, there exists a cod<strong>in</strong>g technique such that the <strong>in</strong>formation can betransmitted over the channel at any rate R less than the channel capacityC <strong>and</strong> with arbitrarily small frequency of errors despite the presenceof noise (Carlson, 1975). In short, the probability of error <strong>in</strong> the received<strong>in</strong>formation can be made arbitrarily small provided thatR ≤ C . (1.2)Conversely, if R > C, then reliable communication is impossible. Whenspecialized to cont<strong>in</strong>uous channels, 1 a complementary result (known asthe Shannon-Hartley Theorem) gives the channel capacity of a b<strong>and</strong>-limitedchannel corrupted by white gaussian noise asC = B log 2(1 + S/N)bits/sec,where the b<strong>and</strong>width, B, <strong>and</strong> the signal-to-noise ratio, S/N, are the relevantchannel parameters.1 A cont<strong>in</strong>uous channel is one <strong>in</strong> which messages are represented as waveforms, i.e., cont<strong>in</strong>uousfunctions of time, <strong>and</strong> the relevant parameters are the b<strong>and</strong>width <strong>and</strong> the signal-tonoiseratio (Carlson, 1975).


1.1 Introduction 5The Shannon-Hartley law, together with <strong>in</strong>equality (1.2), are fundamentalto communication eng<strong>in</strong>eers s<strong>in</strong>ce they (i) represent the absolute bestthat can be achieved <strong>in</strong> the way of reliable <strong>in</strong>formation transmission, <strong>and</strong>(ii) they show that, for a specified <strong>in</strong>formation rate, one can reduce the signalpower provided one <strong>in</strong>creases the b<strong>and</strong>width, <strong>and</strong> vice versa (Carlson,1975). Hence these results both provide a benchmark aga<strong>in</strong>st which practicalcommunication systems can be evaluated, <strong>and</strong> capture the <strong>in</strong>herenttrade-offs associated with physical communication systems.◦Compar<strong>in</strong>g the fundamental relations <strong>in</strong> the above examples, we seethat they possess common qualities. Firstly, they evolve from basic axiomsabout the nature of the universe. Secondly, they describe <strong>in</strong>escapableperformance bounds that act as benchmarks for practical systems. Andthirdly, they are recognized as be<strong>in</strong>g central to the design of real systems.The reader may wonder why it is important to know the existence offundamental limitations before carry<strong>in</strong>g out a particular design to meetsome desired specifications. Åström (1996) quotes an <strong>in</strong>terest<strong>in</strong>g exampleof the latter issue. This example concerns the design of the flight controllerfor the X-29 aircraft. Considerable design effort was recently devotedto this problem <strong>and</strong> many different optimization methods werecompared <strong>and</strong> contrasted. One of the design criteria was that the phasemarg<strong>in</strong> should be greater than 45 ◦ for all flight conditions. At one flightcondition the model conta<strong>in</strong>ed an unstable pole at 6 <strong>and</strong> a nonm<strong>in</strong>imumphase zero at 26. A relatively simple argument based on the fundamentallaws applicable to feedback loops (see Example 2.3.2 <strong>in</strong> Chapter 2) showsthat a phase marg<strong>in</strong> of 45 ◦ is <strong>in</strong>feasible! It is <strong>in</strong>terest<strong>in</strong>g to note that manydesign methods were used <strong>in</strong> a futile attempt to reach the desired goal.As another illustration of <strong>in</strong>herently difficult problems, we learn fromvirtually every undergraduate text book on control that the states ofan <strong>in</strong>verted pendulum are completely observable from measurements ofthe carriage position. However, the system has an open right half plane(ORHP) zero to the left of a real ORHP pole. A simple calculation basedon <strong>in</strong>tegral sensitivity constra<strong>in</strong>ts (see §8.5 <strong>in</strong> Chapter 8) shows that sensitivitypeaks of the order of 50:1 are unavoidable <strong>in</strong> the estimation of thependulum angle when only the carriage position is measured. This, <strong>in</strong>turn, implies that relative <strong>in</strong>put errors of the order of 1% will appear asangle relative estimation errors of the order of 50%. Note that this claimcan be made before any particular estimator is considered. Thus much wastedeffort can aga<strong>in</strong> be avoided. The <strong>in</strong>escapable conclusion is that we shouldredirect our efforts to build<strong>in</strong>g angle measur<strong>in</strong>g transducers rather thanattempt<strong>in</strong>g to estimate the angle by an <strong>in</strong>herently sensitive procedure.In the rema<strong>in</strong>der of the book we will exp<strong>and</strong> on the themes outl<strong>in</strong>edabove. We will f<strong>in</strong>d that the fundamental laws divide problems <strong>in</strong>to thosethat are essentially easy (<strong>in</strong> which case virtually any sensible designmethod will give a satisfactory solution) <strong>and</strong> those that are essentially


6 1. A Chronicle of System Design <strong>Limitations</strong>hard (<strong>in</strong> which case no design method will give a satisfactory solution).We believe that underst<strong>and</strong><strong>in</strong>g these <strong>in</strong>herent design difficulties readilyjustifies the effort needed to appreciate the results.1.2 Performance <strong>Limitations</strong> <strong>in</strong> DynamicalSystemsIn this book we will deal with very general classes of dynamic systems.The dynamic systems that we consider are characterized by three key attributes,namely:(i) they consist of particular <strong>in</strong>terconnections of a “known part” — theplant — <strong>and</strong> a “design part” — the controller or filter — whose structureis such that certa<strong>in</strong> signals <strong>in</strong>terconnect<strong>in</strong>g the parts are <strong>in</strong>dicatorsof the performance of the overall system;(ii) the parts of the <strong>in</strong>terconnection are modeled as <strong>in</strong>put-output operators2 with causal dynamics, i.e., an <strong>in</strong>put applied at time t 0 producesan output response for t > t 0 ; <strong>and</strong>(iii) the <strong>in</strong>terconnection regarded as a whole system is stable, i.e., abounded <strong>in</strong>put produces a bounded response (the precise def<strong>in</strong>itionwill be given later).We will show that, when these attributes are comb<strong>in</strong>ed with<strong>in</strong> an appropriatemathematical formalism, we can derive fundamental relations thatmay be considered as be<strong>in</strong>g systemic versions of the Cramér-Rao LowerBound of Probability <strong>and</strong> the Channel Capacity Limit of Communications.These relations are fundamental <strong>in</strong> the sense that they describe achievable— or non achievable — properties of the overall system only <strong>in</strong> terms ofthe known part of the system, i.e., they hold for any particular choice ofthe design part.As a simple illustrative example, consider the unity feedback controlsystem shown <strong>in</strong> Figure 1.1.To add a mathematical formalism to the problem, let us assume thatthe plant <strong>and</strong> controller are described by f<strong>in</strong>ite dimensional, l<strong>in</strong>ear time<strong>in</strong>variant(LTI), scalar, cont<strong>in</strong>uous-time dynamical systems. We can thususe Laplace transforms to represent signals. The plant <strong>and</strong> controller canbe described <strong>in</strong> transfer function form by G(s) <strong>and</strong> K(s), whereG(s) = N G(s)D G (s) , <strong>and</strong> K(s) = N K(s)D K (s) . (1.3)2 It is sufficient here to consider an <strong>in</strong>put-output operator as a mapp<strong>in</strong>g between <strong>in</strong>put<strong>and</strong> output signals.


1.2 Performance <strong>Limitations</strong> <strong>in</strong> Dynamical Systems 7Disturbance❜ dReference Error+❄ Outputr ❜ ✲ ✐ ✲ <strong>Control</strong>ler ✲ Plant ✲ ✐ ✲ ❜+✻−e+ yFIGURE 1.1. Feedback control system.The reader will undoubtedly know 3 that the transfer functions from reference<strong>in</strong>put to output <strong>and</strong> from disturbance <strong>in</strong>put to output are givenrespectively by T <strong>and</strong> S, whereN G N KT =,N G N K + D G D K(1.4)D G D KS =.N G N K + D G D K(1.5)Note that these are dimensionless quantities s<strong>in</strong>ce they represent thetransfer function (ratio) between like quantities that are measured <strong>in</strong> thesame units. Also T(jω) <strong>and</strong> S(jω) describe the response to <strong>in</strong>puts of aparticular type, namely pure s<strong>in</strong>usoids. S<strong>in</strong>ce T(jω) <strong>and</strong> S(jω) are dimensionless,it is appropriate to compare their respective amplitudes to benchmarkvalues. At each frequency, the usual value chosen as a benchmarkis unity, s<strong>in</strong>ce T(jω 0 ) = 1 implies that the magnitude of the output isequal to the magnitude of the reference <strong>in</strong>put at frequency ω 0 , <strong>and</strong> s<strong>in</strong>ceS(jω 0 ) = 1 implies that the magnitude of the output is equal to the magnitudeof the disturbance <strong>in</strong>put at frequency ω 0 . More generally, the frequencyresponse of T <strong>and</strong> S can be used as measures of stability robustnesswith respect to model<strong>in</strong>g uncerta<strong>in</strong>ties, <strong>and</strong> hence it is sensible to comparethem to “desired shapes” that act as benchmarks.Other doma<strong>in</strong>s also use dimensionless quantities. For example, <strong>in</strong> ElectricalPower Eng<strong>in</strong>eer<strong>in</strong>g it is common to measure currents, voltages, etc.,as a fraction of the “rated” currents, voltages, etc., of the mach<strong>in</strong>e. Thissystem of units is commonly called a “per-unit” system. Similarly, <strong>in</strong> FluidDynamics, it is often desirable to determ<strong>in</strong>e when two different flow situationsare similar. It was shown by Osborne Reynolds (Reynolds, 1883)that two flow scenarios are dynamically similar when the quantityR = ulρµ ,3 See Chapter 2 for more details.


8 1. A Chronicle of System Design <strong>Limitations</strong>(now called the Reynolds number) is the same for both problems. 4 TheReynolds number is the ratio of <strong>in</strong>ertial to viscous forces, <strong>and</strong> high valuesof R <strong>in</strong>variable imply that the flow will be turbulent rather than lam<strong>in</strong>ar.As can be seen from these examples, dimensionless quantities facilitatethe comparison of problems with critical (or benchmark) values.The key question <strong>in</strong> scalar feedback control synthesis is how to f<strong>in</strong>d aparticular value for the design polynomials N K <strong>and</strong> D K <strong>in</strong> (1.3) so that thefeedback loop satisfies certa<strong>in</strong> desired properties. For example, it is usuallydesirable (see Chapter 2) to have T(jω) = 1 at low frequencies <strong>and</strong>S(jω) = 1 at high frequencies. These k<strong>in</strong>ds of design goals are, of course,important questions; but we seek deeper <strong>in</strong>sights. Our aim is to exam<strong>in</strong>ethe fundamental <strong>and</strong> unavoidable constra<strong>in</strong>ts on T <strong>and</strong> S that hold irrespectiveof which controller K is used — provided only that the loop isstable, l<strong>in</strong>ear, <strong>and</strong> time-<strong>in</strong>variant (actually, <strong>in</strong> the text we will relax theselatter restrictions <strong>and</strong> also consider nonl<strong>in</strong>ear <strong>and</strong> time-vary<strong>in</strong>g loops).In the l<strong>in</strong>ear scalar case, equations (1.5) <strong>and</strong> (1.4) encapsulate the keyrelationships that lead to the constra<strong>in</strong>ts. The central observation is thatwe require the loop to be stable <strong>and</strong> hence we require that, whatever valuefor the controller transfer function we choose, the resultant closed loopcharacteristic polynomial N G N K + D G D K must have its zeros <strong>in</strong> the openleft half plane.A further observation is that the two terms N G N K <strong>and</strong> D G D K of thecharacteristic polynomial appear <strong>in</strong> the numerator of T <strong>and</strong> S respectively.These observations, <strong>in</strong> comb<strong>in</strong>ation, have many consequences, for examplewe see that(i) S(s) + T(s) = 1 for all s (called the complementarity constra<strong>in</strong>t);(ii) if the characteristic polynomial has all its zeros to the left of −α,where α is some nonnegative real number, then the functions S <strong>and</strong>T are analytic <strong>in</strong> the half plane to the right of −α (called analyticityconstra<strong>in</strong>t);(iii) if q is a zero of the plant numerator N G (i.e., a plant zero), such thatRe q > −α (here Re s denotes real part of the complex number s),then T(q) = 0 <strong>and</strong> S(q) = 1; similarly, if p is a zero of the plant denom<strong>in</strong>atorD G (i.e., a plant pole), such that Re q > −α, then T(p) = 1<strong>and</strong> S(p) = 0 (called <strong>in</strong>terpolation constra<strong>in</strong>ts).The above seem<strong>in</strong>gly <strong>in</strong>nocuous constra<strong>in</strong>ts actually have profound implicationson the achievable performance as we will see below.4 Here u is a characteristic velocity, l a characteristic length, ρ the fluid density, <strong>and</strong> µ theviscosity.


1.3 Time Doma<strong>in</strong> Constra<strong>in</strong>ts 91.3 Time Doma<strong>in</strong> Constra<strong>in</strong>tsIn the ma<strong>in</strong> body of the book we will carry out an <strong>in</strong>-depth treatmentof constra<strong>in</strong>ts for <strong>in</strong>terconnected dynamic systems. However, to motivateour future developments we will first exam<strong>in</strong>e some prelim<strong>in</strong>ary resultsthat follow very easily from the use of the Laplace transform formalism.In particular we have the follow<strong>in</strong>g result.Lemma 1.3.1. Let H(s) be a strictly proper transfer function that has all itspoles <strong>in</strong> the half plane Re s ≤ −α, where α is some f<strong>in</strong>ite real positive number(i.e., H(s) is analytic <strong>in</strong> Re s > −α). Also, let h(t) be the correspond<strong>in</strong>gtime doma<strong>in</strong> function, i.e.,H(s) = Lh(t) ,where L· denotes the Laplace transform. Then, for any s 0 such that Re s 0 >−α, we have ∫ ∞0e −s 0t h(t) dt = lims→s0H(s) .Proof. From the def<strong>in</strong>ition of the Laplace transform we have that, for all s<strong>in</strong> the region of convergence of the transform, i.e., for Re s > −α,H(s) =∫ ∞0e −st h(t) dt .The result then follows s<strong>in</strong>ce s 0 is <strong>in</strong> the region of convergence of the transform.□In the follow<strong>in</strong>g subsection, we will apply the above result to exam<strong>in</strong>ethe properties of the step responses of the output <strong>and</strong> error <strong>in</strong> Figure 1.1.1.3.1 Integrals on the Step ResponseWe will analyze here the impact on the step response of the closed-loopsystem of open-loop poles at the orig<strong>in</strong>, unstable poles, <strong>and</strong> nonm<strong>in</strong>imumphase zeros. We will then see that the results below quantify limits <strong>in</strong> performanceas constra<strong>in</strong>ts on transient properties of the system such as risetime, settl<strong>in</strong>g time, overshoot <strong>and</strong> undershoot.Throughout this subsection, we refer to Figure 1.1, where the plant <strong>and</strong>controller are as <strong>in</strong> (1.3), <strong>and</strong> where e <strong>and</strong> y are the time responses to aunit step <strong>in</strong>put (i.e., r(t) = 1, d(t) = 0, ∀t).We then have the follow<strong>in</strong>g results relat<strong>in</strong>g open-loop poles <strong>and</strong> zeroswith the step response.Theorem 1.3.2 (Open-loop <strong>in</strong>tegrators). Suppose that the closed loop <strong>in</strong>Figure 1.1 is stable. Then,


10 1. A Chronicle of System Design <strong>Limitations</strong>(i) for lim s→0 sG(s)K(s) = c 1 , 0 < |c 1 | < ∞, we have thatlim e(t) = 0 ,t→∞∫ ∞0e(t) dt = 1 c 1;(ii) for lim s→0 s 2 G(s)K(s) = c 2 , 0 < |c 2 | < ∞, we have thatlim e(t) = 0 ,t→∞∫ ∞0e(t) dt = 0 .Proof. Let E, Y, R <strong>and</strong> D denote the Laplace transforms of e, y, r <strong>and</strong> d,respectively. Then,E(s) = S(s)[R(s) − D(s)] , (1.6)where S is the sensitivity function def<strong>in</strong>ed <strong>in</strong> (1.5), <strong>and</strong> R(s) − D(s) = 1/sfor a unit step <strong>in</strong>put. Next, note that <strong>in</strong> case (i) the open-loop system GKhas a simple pole at s = 0, i.e., G(s)K(s) = ˜L(s)/s, where lim s→0 ˜L(s) = c 1 .Accord<strong>in</strong>gly, the sensitivity function has the form<strong>and</strong> thus, from (1.6),S(s) =ss + ˜L(s) ,lim E(s) = 1 . (1.7)s→0 c 1From (1.7) <strong>and</strong> the F<strong>in</strong>al Value Theorem (e.g., Middleton <strong>and</strong> Goodw<strong>in</strong>,1990), we have thatlim e(t) = lim sE(s)t→∞ s→0= 0 .Similarly, from (1.7) <strong>and</strong> Lemma 1.3.1,∫ ∞0e(t) dt = lims→0E(s)= 1 c 1.This completes the proof of case (i).Case (ii) follows <strong>in</strong> the same fashion, on not<strong>in</strong>g that here the open-loopsystem GK has a double pole at s = 0.□


1.3 Time Doma<strong>in</strong> Constra<strong>in</strong>ts 11Theorem 1.3.2 states conditions that the error step response has to satisfyprovided the open-loop system has poles at the orig<strong>in</strong>, i.e., it has pure<strong>in</strong>tegrators. The follow<strong>in</strong>g result gives similar constra<strong>in</strong>ts for ORHP openlooppoles.Theorem 1.3.3 (ORHP open-loop poles). Consider Figure 1.1, <strong>and</strong> supposethat the open-loop plant has a pole at s = p, such that Re p > 0.Then, if the closed loop is stable,∫ ∞e −pt e(t) dt = 0 , (1.8)0<strong>and</strong> ∫ ∞0e −pt y(t) dt = 1 p . (1.9)Proof. Note that, by assumption, s = p is <strong>in</strong> the region of convergence ofE(s), the Laplace transform of the error. Then, us<strong>in</strong>g (1.6) <strong>and</strong> Lemma 1.3.1,we have that∫ ∞e −pt e(t) dt = E(p)0= S(p)p= 0 ,where the last step follows s<strong>in</strong>ce s = p is a zero of S, by the <strong>in</strong>terpolationconstra<strong>in</strong>ts. This proves (1.8). Relation (1.9) follows easily from (1.8) <strong>and</strong>the fact that r = 1, i.e.,∫ ∞ ∫ ∞e −pt y(t) dt = e −pt (r(t) − e(t)) dt00=∫ ∞0e −pt dt= 1 p . □A result symmetric to that of Theorem 1.3.3 holds for plants with nonm<strong>in</strong>imumphase zeros, as we see <strong>in</strong> the follow<strong>in</strong>g theorem.Theorem 1.3.4 (ORHP open-loop zeros). Consider Figure 1.1, <strong>and</strong> supposethat the open-loop plant has a zero at s = q, such that Re q > 0.Then, if the closed loop is stable,∫ ∞e −qt e(t) dt = 10q , (1.10)


12 1. A Chronicle of System Design <strong>Limitations</strong><strong>and</strong> ∫ ∞0e −qt y(t) dt = 0 . (1.11)Proof. Similar to that of Theorem 1.3.3, except that here T(q) = 0.□The above theorems assert that if the plant has an ORHP open-loop poleor zero, then the error <strong>and</strong> output time responses to a step must satisfy <strong>in</strong>tegralconstra<strong>in</strong>ts that hold for all possible controller giv<strong>in</strong>g a stable closedloop. Moreover, if the plant has real zeros or poles <strong>in</strong> the ORHP, then theseconstra<strong>in</strong>ts display a balance of exponentially weighted areas of positive<strong>and</strong> negative error (or output). It is evident that the same conclusionshold for ORHP zeros <strong>and</strong>/or poles of the controller. Actually, equations(1.8) <strong>and</strong> (1.10) hold for open-loop poles <strong>and</strong> zeros that lie to the rightof all closed-loop poles, provided the open-loop system has an <strong>in</strong>tegrator.Hence, stable poles <strong>and</strong> m<strong>in</strong>imum phase zeros also lead to limitations <strong>in</strong>certa<strong>in</strong> circumstances.The time doma<strong>in</strong> <strong>in</strong>tegral constra<strong>in</strong>ts of the previous theorems tell usfundamental properties of the result<strong>in</strong>g performance. For example, Theorem1.3.2 shows that a plant-controller comb<strong>in</strong>ation conta<strong>in</strong><strong>in</strong>g a double<strong>in</strong>tegrator will have an error step response that necessarily overshoots(changes sign) s<strong>in</strong>ce the <strong>in</strong>tegral of the error is zero. Similarly, Theorem1.3.4 implies that if the open-loop plant (or controller) has real ORHPzeros then the closed-loop transient response can be arbitrarily poor (depend<strong>in</strong>gonly on the location of the closed-loop poles relative to q), as weshow next. Assume that the closed-loop poles are located to the left of −α,α > 0. Observe that the time evolution of e is governed by the closed-looppoles. Then as q becomes much smaller than α, the weight <strong>in</strong>side the <strong>in</strong>tegral,e −qt , can be approximated to 1 over the transient response of theerror. Hence, s<strong>in</strong>ce the RHS of (1.10) grows as q decreases, we can immediatelyconclude that real ORHP zeros much smaller than the magnitudeof the closed-loop poles will produce large transients <strong>in</strong> the step responseof a feedback loop. Moreover this effect gets worse as the zeros approachthe imag<strong>in</strong>ary axis.The follow<strong>in</strong>g example illustrates the <strong>in</strong>terpretation of the above constra<strong>in</strong>ts.Example 1.3.1. Consider the plantG(s) =q − ss(s + 1) ,where q is a positive real number. For this plant we use the <strong>in</strong>ternal modelcontrol paradigm (Morari <strong>and</strong> Zafiriou, 1989) to design a controller <strong>in</strong> Figure1.1 that achieves the follow<strong>in</strong>g complementarity sensitivity functionT(s) =q − sq(0.2s + 1) 2 .


14 1. A Chronicle of System Design <strong>Limitations</strong>F<strong>in</strong>ally, the overshoot is the maximum value by which the output exceedsits f<strong>in</strong>al set po<strong>in</strong>t value, i.e.,y os sup {−e(t)} ;t<strong>and</strong> the undershoot is the maximum negative peak of the system’s output,i.e.,y us sup {−y(t)} .tFigure 1.3 shows a typical step response <strong>and</strong> illustrates these quantities.y(t)1y os2ɛ0ty ust rt sFIGURE 1.3. Time doma<strong>in</strong> specifications.Corollary 1.3.5 (Overshoot <strong>and</strong> real ORHP poles). A stable unity feedbacksystem with a real ORHP open-loop pole, say at s = p, must haveovershoot <strong>in</strong> its step response. Moreover, if t r is the rise time def<strong>in</strong>ed by(1.12), theny os ≥ (pt r − 1)e ptr + 1pt r≥ pt (1.14)r2 .Proof. The existence of overshoot follows immediately from Theorem1.3.3, s<strong>in</strong>ce e(t) cannot have a s<strong>in</strong>gle sign unless it is zero for all t.From the def<strong>in</strong>ition of rise time <strong>in</strong> (1.12) we have that y(t) ≤ t/t r fort ≤ t r , i.e., e(t) ≥ 1 − t/t r . Us<strong>in</strong>g this, we can write from the <strong>in</strong>tegralequality (1.8)∫ ∞ ∫ tr(− e −pt e(t) dt ≥ e −pt 1 − t )dt . (1.15)t rt r0


1.3 Time Doma<strong>in</strong> Constra<strong>in</strong>ts 15From (1.15) <strong>and</strong> the def<strong>in</strong>ition of overshoot, it follows thaty ose −ptrp= y os∫ ∞t re −pt dt (1.16)≥∫ tr0e −pt (1 − tt r)dt= (pt r − 1) + e −ptrp 2 t r. (1.17)Equation (1.14) is then obta<strong>in</strong>ed from (1.16) - (1.17).□Corollary 1.3.5 shows that if the closed-loop system is “slow”, i.e., it hasa large rise time, the step response will present a large overshoot if thereare open-loop unstable real poles 5 . Intuitively, we can deduce from thisresult that unstable poles will dem<strong>and</strong> a “fast” closed-loop system — orequivalently, a larger closed-loop b<strong>and</strong>width — to keep an acceptable performance.The farther from the jω-axis the poles are, the more str<strong>in</strong>gentthis b<strong>and</strong>width dem<strong>and</strong> will be.An analogous situation is found <strong>in</strong> relation with real nonm<strong>in</strong>imumphase zeros <strong>and</strong> undershoot <strong>in</strong> the system’s response, as we see <strong>in</strong> thenext corollary.Corollary 1.3.6 (Undershoot <strong>and</strong> real ORHP zeros). A stable unity feedbacksystem with a real ORHP open-loop zero, say at s = q, must haveundershoot <strong>in</strong> its step response. Moreover, if t s <strong>and</strong> ɛ are the settl<strong>in</strong>g time<strong>and</strong> level def<strong>in</strong>ed by (1.13),y us ≥1 − ɛe qts − 1 . (1.18)Proof. Similar to Corollary 1.3.5, this time us<strong>in</strong>g (1.11) <strong>and</strong> the def<strong>in</strong>itionof settl<strong>in</strong>g time <strong>and</strong> undershoot.□The <strong>in</strong>terpretation for Corollary 1.3.6 is that if the system has real nonm<strong>in</strong>imumphase zeros, then its step response will display large undershootsas the settl<strong>in</strong>g time is reduced, i.e., the closed-loop system is made“faster”. Notice that this situation is quite the opposite to that for real unstablepoles, for now real nonm<strong>in</strong>imum phase zeros will dem<strong>and</strong> a shortclosed-loop b<strong>and</strong>width for good performance. Moreover, here the closerto the imag<strong>in</strong>ary axis the zeros are, the stronger the dem<strong>and</strong> for a shortb<strong>and</strong>width will be.Evidently from the previous remarks, a clear trade-off <strong>in</strong> design ariseswhen the open-loop system is both unstable <strong>and</strong> nonm<strong>in</strong>imum phase,5 This is <strong>in</strong> contrast with the case of open-loop stable systems, where large overshootsnormally arise from short rise times.


16 1. A Chronicle of System Design <strong>Limitations</strong>s<strong>in</strong>ce, depend<strong>in</strong>g on the relative position of these poles <strong>and</strong> zeros, a completelysatisfactory performance may not be possible. The follow<strong>in</strong>g resultconsiders such a case.Corollary 1.3.7. Suppose a stable unity feedback system has a real ORHPopen-loop zero at s = q <strong>and</strong> a real ORHP open-loop pole at s = p, p ≠ q.Then,(i) if p < q, the overshoot satisfiesy os ≥(ii) if p > q, the undershoot satisfiesy us ≥pq − p ,qp − q .Proof. For case (i) comb<strong>in</strong>e (1.8) <strong>and</strong> (1.10) to obta<strong>in</strong>∫ ∞(e −pt − e −qt) [−e(t)] dt = 10q .Us<strong>in</strong>g the def<strong>in</strong>ition of overshoot yields1q ≤ y os∫ ∞0(e −pt − e −qt) dt= y osq − ppq . (1.19)The result then follows from (1.19) by us<strong>in</strong>g the fact that q > p.Case (ii) can be shown similarly by comb<strong>in</strong><strong>in</strong>g (1.9) <strong>and</strong> (1.11) <strong>and</strong> us<strong>in</strong>gthe fact that q < p.□In the follow<strong>in</strong>g subsection, we illustrate the previous results by analyz<strong>in</strong>gtime doma<strong>in</strong> limitations aris<strong>in</strong>g <strong>in</strong> the control of an <strong>in</strong>verted pendulum.This example will be revisited <strong>in</strong> Chapter 3, where we studyfrequency doma<strong>in</strong> limitations <strong>in</strong> the context of feedback control, <strong>and</strong> <strong>in</strong>Chapter 8, where we analyze frequency doma<strong>in</strong> limitations from a filter<strong>in</strong>gpo<strong>in</strong>t of view.1.3.3 Example: Inverted PendulumConsider the <strong>in</strong>verted pendulum shown <strong>in</strong> Figure 1.4. The l<strong>in</strong>earizedmodel for this system about the orig<strong>in</strong> (i.e., θ = ˙θ = y = ẏ = 0) hasthe follow<strong>in</strong>g transfer function from force, u, to carriage position, yY(s) (s − q)(s + q)=U(s) M s 2 (s − p)(s + p) , (1.20)


1.3 Time Doma<strong>in</strong> Constra<strong>in</strong>ts 17θ muyMlFIGURE 1.4. Inverted pendulum.whereq = √ g/l ,√(M + m)gp =.MlIn the above def<strong>in</strong>itions, g is the gravitational constant, m is the mass atthe end of the pendulum, M is the carriage’s mass, <strong>and</strong> l is the pendulum’slength.We readily see that this system satisfies the conditions discussed <strong>in</strong>Corollary 1.3.7, part (ii). Say that we normalize so that q = 1 <strong>and</strong> takem/M = 0.1, so that p = 1.05. Corollary 1.3.7 then predicts an undershootgreater than 20!500m / M = 1y(t)0.4−500.20.1−1000 2 4 6 8 10FIGURE 1.5. Position time response of the <strong>in</strong>verted pendulum.t


18 1. A Chronicle of System Design <strong>Limitations</strong>To test the results, we designed an LQG-LQR controller 6 that fixes theclosed-loop poles at s = −1, −2, −3, −4. Figure 1.5 shows the step responseof the carriage position for fixed q = 1, <strong>and</strong> for different valuesof the mass ratio m/M (which imply, <strong>in</strong> turn, different locations of theopen-loop poles of the plant). We can see from this figure that (i) the lowerbound on the undershoot predicted by Corollary 1.3.7 is conservative (thisis due to the approximation −y ≈ y us implicitly used to derived thisbound), <strong>and</strong> (ii) the bound correctly predicts an <strong>in</strong>crease of the undershootas the difference p − q decreases.1.4 Frequency Doma<strong>in</strong> Constra<strong>in</strong>tsThe results presented <strong>in</strong> §1.3 were expressed <strong>in</strong> the time doma<strong>in</strong> us<strong>in</strong>gLaplace transforms. However, one might expect that correspond<strong>in</strong>g resultshold <strong>in</strong> the frequency doma<strong>in</strong>. This will be a major theme <strong>in</strong> therema<strong>in</strong>der of the book. To give the flavor of the results, we will brieflydiscuss constra<strong>in</strong>ts <strong>in</strong>duced by zeros on the imag<strong>in</strong>ary axis, or ORHP zerosarbitrarily close to the imag<strong>in</strong>ary axis. Analogous conclusions hold forpoles on the imag<strong>in</strong>ary axis.Note that, assum<strong>in</strong>g closed-loop stability, then an open-loop zero on theimag<strong>in</strong>ary axis at jω q implies thatT(jω q ) = 0, <strong>and</strong> S(jω q ) = 1. (1.21)We have remarked earlier that a common design objective is to haveS(jω) ≪ 1 at low frequencies, i.e., for ω ∈ [0, ω 1 ] for some ω 1 . Clearly,if ω q < ω 1 , then this goal is <strong>in</strong>consistent with (1.21). Now say that theopen-loop plant has a zero at q = ɛ + jω q , where ɛ is small <strong>and</strong> positive.Then we might expect (by cont<strong>in</strong>uity) that |S(jω)| would have a tendencyto be near 1 <strong>in</strong> the vic<strong>in</strong>ity of ω = ω q . Actually, it turns out to be possibleto force |S(jω)| to be small for frequencies ω ∈ [0, ω 1 ] where ω 1 > ω q .However, one has to pay a heavy price for try<strong>in</strong>g to defeat “the laws of nature”by not allow<strong>in</strong>g |S(jω)| to approach 1 near ω q . Indeed, it turns outthat the “price” is an even larger peak <strong>in</strong> |S(jω)| for some other value of ω.We will show this us<strong>in</strong>g the cont<strong>in</strong>uity (analyticity) properties of functionsof a complex variable. Actually, we will see that many <strong>in</strong>terest<strong>in</strong>g propertiesof l<strong>in</strong>ear feedback systems are a direct consequence of the propertiesof analytic functions.6 See e.g., Kwakernaak <strong>and</strong> Sivan (1972).


1.5 A Brief History 191.5 A Brief HistoryBode (1945) used analytic function theory to exam<strong>in</strong>e the properties offeedback loops <strong>in</strong> the frequency doma<strong>in</strong>. At the time, Bode was work<strong>in</strong>gat Bell Laboratories. He used complex variable theory to show that therewere restrictions on the type of frequency doma<strong>in</strong> response that could beobta<strong>in</strong>ed from a stable feedback amplifier circuit. In particular, he showed— mutatis mut<strong>and</strong>is — that the sensitivity function, S, (def<strong>in</strong>ed <strong>in</strong> (1.5) fora particular case) must satisfy the follow<strong>in</strong>g <strong>in</strong>tegral relation for a stableopen-loop plant∫ ∞log |S(jω)| dω = 0 . (1.22)0This result shows that it is not possible to achieve arbitrary sensitivityreduction (i.e., |S| < 1) at all po<strong>in</strong>ts of the imag<strong>in</strong>ary axis. Thus, if |S(jω)| issmaller than one over a particular frequency range then it must necessarilybe greater than one over some other frequency range.Bode also showed that, for stable m<strong>in</strong>imum phase systems, it was notnecessary to specify both the magnitude <strong>and</strong> phase response <strong>in</strong> the frequencydoma<strong>in</strong> s<strong>in</strong>ce each was determ<strong>in</strong>ed uniquely by the other.Horowitz (1963) applied Bode’s theorems to the feedback control problem,<strong>and</strong> also obta<strong>in</strong>ed some prelim<strong>in</strong>ary results for open-loop unstablesystems. These latter extensions turned out to be <strong>in</strong> error due to a miss<strong>in</strong>gterm, but the pr<strong>in</strong>ciple is sound.Francis <strong>and</strong> Zames (1984) studied the feedback constra<strong>in</strong>ts imposed byORHP zeros of the plant <strong>in</strong> the context of H ∞ optimization. They showedthat if the plant has zeros <strong>in</strong> the ORHP, then the peak magnitude of thefrequency response of S(jω) necessarily becomes very large if |S(jω)| ismade small over frequencies which exceed the magnitude of the zeros.This phenomenon has become known as the “water-bed” or “push-pop”effect.Freudenberg <strong>and</strong> Looze (1985) brought many of the results together.They also produced def<strong>in</strong>itive results for the open-loop unstable case. Forexample, <strong>in</strong> the case of an unstable open-loop plant, (1.22) generalizes to(see Theorem 3.1.4 <strong>in</strong> Chapter 3)∫1 ∞π 0n p∑log |S(jω)| dω ≥ p i , (1.23)where {p i : i = 1, . . . , n p } is the set of ORHP poles of the open-loop plant.Equality is achieved <strong>in</strong> (1.23) if the set {p i : i = 1, . . . , n p } also <strong>in</strong>cludes allthe ORHP poles of the controller.In addition, Freudenberg <strong>and</strong> Looze expressed the <strong>in</strong>tegral constra<strong>in</strong>ts<strong>in</strong> various formats, both along the l<strong>in</strong>es of Bode <strong>and</strong> <strong>in</strong> a different formus<strong>in</strong>g the related idea of Poisson <strong>in</strong>tegrals. In particular, the Poisson <strong>in</strong>tegralspermit the derivation of an <strong>in</strong>sightful closed expression that displaysi=1


20 1. A Chronicle of System Design <strong>Limitations</strong>the water-bed effect experienced by nonm<strong>in</strong>imum phase systems. This expression,however, is not tight if the plant has more than one ORHP zero.About the same time, O’Young <strong>and</strong> Francis (1985) used Nevanl<strong>in</strong>na-Pick theory to characterize the smallest upper bound on the norm of themultivariable sensitivity function over a frequency range, with the constra<strong>in</strong>tthat the norm rema<strong>in</strong> bounded at all frequencies. This characterizationcan be used to show the water-bed effect <strong>in</strong> multivariable nonm<strong>in</strong>imumphase systems, <strong>and</strong> is tight for any number of ORHP zeros of theplant. Yet, no closed expression is available for this characterization butrather it has to be computed iteratively for each given plant.In 1987, Freudenberg <strong>and</strong> Looze extended the Bode <strong>in</strong>tegrals to scalarplants with time delays. In 1988 the same authors published a book thatsummarized the results for scalar systems, <strong>and</strong> also addressed the multivariablecase us<strong>in</strong>g s<strong>in</strong>gular values.In 1990, Middleton obta<strong>in</strong>ed Bode-type <strong>in</strong>tegrals for the complementarysensitivity function T. For example, the result equivalent to (1.23), foran open-loop system hav<strong>in</strong>g at least two pure <strong>in</strong>tegrators, is (see Theorem3.1.5 <strong>in</strong> Chapter 3)∫1 ∞π 0log |T(jω)| dωω 2 ≥ τ n q2 + ∑ 1, (1.24)q ii=1where {q i : i = 1, . . . , n q } is the set of ORHP zeros of the open-loop plant,<strong>and</strong> τ is the plant pure time delay.Compar<strong>in</strong>g (1.24) with (1.10) we see that (perhaps not unexpectedly)there is a strong connection between the time <strong>and</strong> frequency doma<strong>in</strong> results.Indeed, this is reasonable s<strong>in</strong>ce the only elements that are be<strong>in</strong>g usedare the complementarity, <strong>in</strong>terpolation <strong>and</strong> analyticity constra<strong>in</strong>ts <strong>in</strong>troduced<strong>in</strong> §1.2.Recent extensions of the results <strong>in</strong>clude multivariable systems, filter<strong>in</strong>gproblems, periodic systems, sampled-data systems <strong>and</strong>, very recently,nonl<strong>in</strong>ear systems. We will cover all of these results <strong>in</strong> the rema<strong>in</strong>der ofthe book.1.6 SummaryThis chapter has <strong>in</strong>troduced the central topic of this book. We are concernedwith fundamental limitations <strong>in</strong> the design of dynamical systems,limitations that are imposed by structural <strong>and</strong> constitutive characteristicsof the system under study. As we have seen, fundamental limitations arecentral to other discipl<strong>in</strong>es; <strong>in</strong>deed, we have provided as examples theCramér-Rao Inequality of Estimation Theory, <strong>and</strong> the Shannon Theoremof Communications.


1.6 Summary 21We have presented, through an example of scalar feedback control, twoof the mapp<strong>in</strong>gs that are central to this book, namely, the sensitivity <strong>and</strong>complementarity sensitivity functions. These mapp<strong>in</strong>gs are <strong>in</strong>dicators ofclosed-loop performance as well as stability robustness <strong>and</strong>, as such, it isnatural to require that they meet certa<strong>in</strong> desired design specifications. Weargue that it is important to establish the limits that one faces when attempt<strong>in</strong>gto achieve these specifications before any design is carried out.For example, it is impossible for a stable closed-loop system to achievesensitivity reduction over a frequency range where the open-loop systemhas a pure imag<strong>in</strong>ary zero. More generally, ORHP zeros <strong>and</strong> polesof the open-loop system impose constra<strong>in</strong>ts on the achievable frequencyresponse of the sensitivity <strong>and</strong> complementarity sensitivity functions.As a further illustration of these constra<strong>in</strong>ts, we have studied the effecton the closed-loop step response of pure <strong>in</strong>tegrators <strong>and</strong> ORHP zeros<strong>and</strong> poles of the open-loop system. We have seen, <strong>in</strong>ter-alia, that a stableunity feedback system with a real ORHP open-loop zero must haveundershoot <strong>in</strong> its step response. A similar conclusion holds with ORHPopen-loop poles <strong>and</strong> overshoot <strong>in</strong> the step response. These limitations areobviously worse for plants hav<strong>in</strong>g both nonm<strong>in</strong>imum phase zeros <strong>and</strong>unstable poles; the <strong>in</strong>verted pendulum example illustrates these complications.F<strong>in</strong>ally, we have provided an overview of the published work that focuseson systems design limitations, the majority of which build on theorig<strong>in</strong>al work of Bode (1945).Notes <strong>and</strong> ReferencesSome of the studies of Bode seem to have been paralleled <strong>in</strong> Europe. For example,some old books refer to the Bode ga<strong>in</strong>-phase relationship as the Bayard-Bode ga<strong>in</strong>phaserelationship (e.g., Gille et al. (1959, pp. 154-155), Nasl<strong>in</strong> (1965)), althoughprecise references are not given.§1.3 is ma<strong>in</strong>ly extracted from Middleton (1991).


Part II<strong>Limitations</strong> <strong>in</strong> L<strong>in</strong>ear<strong>Control</strong>


2Review of General ConceptsThis chapter collects some concepts related to l<strong>in</strong>ear, time-<strong>in</strong>variant systems,as well as properties of feedback control systems. It is ma<strong>in</strong>ly <strong>in</strong>tendedto <strong>in</strong>troduce notation <strong>and</strong> term<strong>in</strong>ology, <strong>and</strong> also to provide motivation<strong>and</strong> a brief review of the background material for Part II. The <strong>in</strong>terestedreader may f<strong>in</strong>d a more extensive treatment of the topics coveredhere <strong>in</strong> the books <strong>and</strong> papers cited <strong>in</strong> the Notes <strong>and</strong> References section atthe end of the chapter.Notation. As usual, ¡ , ¢ <strong>and</strong> denote the natural, real <strong>and</strong> complex numbers,respectively. 0 denotes the set ∪ {0}. The extended complex plane isthe set of all f<strong>in</strong>ite complex numbers (the complex ¢ plane ) <strong>and</strong> the po<strong>in</strong>tat <strong>in</strong>f<strong>in</strong>ity, ∞. We denote the extended complex plane ¢ by e ¢ = ∪ {∞}.The real <strong>and</strong> imag<strong>in</strong>ary parts of a complex number, s, are denoted by Re s<strong>and</strong> Im s respectively.We will denote ¢ by + ¢ <strong>and</strong> − the open right <strong>and</strong> left halves of the complexplane, <strong>and</strong> ¢ by +¢ <strong>and</strong> −their correspond<strong>in</strong>g closed versions. Sometimes,we will use the obvious acronyms ORHP, OLHP, CRHP, <strong>and</strong> CLHP.cdenote the regions <strong>in</strong>side <strong>and</strong> outside theSimilarly, the £ symbols £ <strong>and</strong>unit circle |z| = 1 <strong>in</strong> the complex plane, £ <strong>and</strong> £ <strong>and</strong> c their correspond<strong>in</strong>gclosed versions.The Laplace <strong>and</strong> Z transforms of a function f are denoted by Lf <strong>and</strong>Zf, respectively. In general, the symbol s is used to denote variables whenwork<strong>in</strong>g with Laplace transforms, <strong>and</strong> z when work<strong>in</strong>g with Z transforms.F<strong>in</strong>ally, we use lower case letters for time doma<strong>in</strong> functions, <strong>and</strong> uppercase letters for both constant matrices <strong>and</strong> transfer functions.


26 2. Review of General Concepts2.1 L<strong>in</strong>ear Time-Invariant SystemsA common practice is to assume that the system under study is l<strong>in</strong>eartime-<strong>in</strong>variant (LTI), causal, <strong>and</strong> of f<strong>in</strong>ite-dimension. 1 If the signals are assumedto evolve <strong>in</strong> cont<strong>in</strong>uous time, 2 then an <strong>in</strong>put-output model for sucha system <strong>in</strong> the time doma<strong>in</strong> has the form of a convolution equation,y(t) =∫ ∞h(t − τ) u(τ) dτ , (2.1)−∞where u <strong>and</strong> y are the system’s <strong>in</strong>put <strong>and</strong> output respectively. The functionh <strong>in</strong> (2.1) is called the impulse response of the system, <strong>and</strong> causalitymeans that h(t) = 0 for t < 0.The above system has an equivalent state-space descriptionẋ(t) = Ax(t) + Bu(t) ,y(t) = Cx(t) + Du(t) ,(2.2)where A, B, C, D are real matrices of appropriate dimensions.An alternative <strong>in</strong>put-output description, which is of special <strong>in</strong>teresthere, makes use of the transfer function, 3 correspond<strong>in</strong>g to system (2.1).The transfer function, H say, is given by the Laplace transform of h <strong>in</strong>(2.1), i.e.,H(s) =∫ ∞0e −st h(t) dt .After tak<strong>in</strong>g Laplace transform, (2.1) takes the formY(s) = H(s)U(s) , (2.3)where U <strong>and</strong> Y are the Laplace transforms of the <strong>in</strong>put <strong>and</strong> output signalsrespectively.The transfer function is related with the state-space description as followsH(s) = C(sI − A) −1 B + D ,which is sometimes denoted as[H =s A BC D]. (2.4)1 For an <strong>in</strong>troduction to these concepts see e.g., Kailath (1980), or Sontag (1990) for a moremathematically oriented perspective.2 Chapters 3 <strong>and</strong> 4 assume LTI systems <strong>in</strong> cont<strong>in</strong>uous time, Chapter 5 deals with periodicallytime-vary<strong>in</strong>g systems <strong>in</strong> discrete time, <strong>and</strong> Chapter 6 with sampled-data systems, i.e.,a comb<strong>in</strong>ation of digital control <strong>and</strong> LTI plants <strong>in</strong> cont<strong>in</strong>uous time.3 Sometimes the name transfer matrix is also used <strong>in</strong> the multivariable case.


2.1 L<strong>in</strong>ear Time-Invariant Systems 27We next discuss some properties of transfer functions. The transfer functionH <strong>in</strong> (2.4) is a matrix whose entries are scalar rational functions (dueto the hypothesis of f<strong>in</strong>ite-dimensionality) with real coefficients. A scalarrational function will be said to be proper if its relative degree, def<strong>in</strong>ed as thedifference between the degree of the denom<strong>in</strong>ator polynomial m<strong>in</strong>us thedegree of the numerator polynomial, is nonnegative. We then say that atransfer matrix H is proper if all its entries are proper scalar transfer functions.We say that H is biproper if both H <strong>and</strong> H −1 are proper. A squaretransfer matrix H is nons<strong>in</strong>gular if its determ<strong>in</strong>ant, det H, is not identicallyzero.For a discrete-time system mapp<strong>in</strong>g a discrete <strong>in</strong>put sequence, u k , <strong>in</strong>toan output sequence, y k , an appropriate <strong>in</strong>put-output model is given byY(z) = H(z)U(z) ,where U <strong>and</strong> Y are the Z transforms of the sequences u k <strong>and</strong> y k , <strong>and</strong> aregiven by∞∑∞∑U(z) = u k z −k , <strong>and</strong> Y(z) = y k z −k ,k=0<strong>and</strong> where H is the correspond<strong>in</strong>g transfer matrix <strong>in</strong> the Z-transform doma<strong>in</strong>.All of the above properties of transfer matrices apply also to transferfunctions of discrete-time systems.2.1.1 Zeros <strong>and</strong> PolesThe zeros <strong>and</strong> poles of a scalar, or s<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output (SISO), transferfunction H are the roots of its numerator <strong>and</strong> denom<strong>in</strong>ator polynomialsrespectively. Then H is said to be m<strong>in</strong>imum phase if all its zeros are <strong>in</strong> theOLHP, <strong>and</strong> stable if all its poles are <strong>in</strong> the OLHP. If H has a zero <strong>in</strong> theCRHP, then H is said to be nonm<strong>in</strong>imum phase; similarly, if H has a pole <strong>in</strong>the CRHP, then H is said to be unstable.Zeros <strong>and</strong> poles of multivariable, or multiple-<strong>in</strong>put multiple-output(MIMO), systems are similarly def<strong>in</strong>ed but also <strong>in</strong>volve directionalityproperties. Given a proper transfer matrix H with the m<strong>in</strong>imal realization4 (A, B, C, D) as <strong>in</strong> (2.2), a po<strong>in</strong>t ¢ q ∈ is called a transmission zero 5 ofH if there exist complex vectors x <strong>and</strong> Ψ o such that the relation[x∗Ψ ∗ ] [ ]qI − A −Bo= 0 (2.5)−C −Dk=04 A m<strong>in</strong>imal realization is a state-space description that is both controllable <strong>and</strong> observable.5 S<strong>in</strong>ce transmission zeros are the only type of multivariable zeros that we will deal with,we will often refer to them simply as “zeros”. See MacFarlane <strong>and</strong> Karcanias (1976) for acomplete characterization.


28 2. Review of General Conceptsholds, where Ψ ∗ oΨ o =1 (the superscript ‘∗’ <strong>in</strong>dicates conjugate transpose).The vector Ψ o is called the output zero direction associated with q <strong>and</strong>, from(2.5), it satisfies Ψ ∗ oH(q) = 0. Transmission zeros verify a similar propertywith <strong>in</strong>put zero directions, i.e., there exists a complex vector Ψ i , Ψ ∗ i Ψ i = 1,such that H(q)Ψ i = 0. A zero direction is said to be canonical if it has onlyone nonzero component.For a given zero at s = q of a transfer matrix H, there may exist morethan one <strong>in</strong>put (or output) direction. In fact, there exist as many <strong>in</strong>put (oroutput) directions as the drop <strong>in</strong> rank of the matrix H(q). This deficiency<strong>in</strong> rank of the matrix H(s) at s = q is called the geometric multiplicity of thezero at frequency q.The poles of a transfer matrix H are the eigenvalues of the evolutionmatrix of any m<strong>in</strong>imal realization of H. We will assume that the sets ofORHP zeros <strong>and</strong> poles of H are disjo<strong>in</strong>t. Then, as <strong>in</strong> the scalar case, H issaid to be nonm<strong>in</strong>imum phase if it has a transmission zero at s = q with q <strong>in</strong>the CRHP. Similarly, H is said to be unstable if it has a pole at s = p with p<strong>in</strong> the CRHP. By extension, a pole <strong>in</strong> the CRHP is said to be unstable, <strong>and</strong>a zero <strong>in</strong> the CRHP is called nonm<strong>in</strong>imum phase.It is known (e.g., Kailath, 1980, p. 467) that if H admits a left or right<strong>in</strong>verse, then a pole of H will be a zero of H −1 . In this case we will refer tothe <strong>in</strong>put <strong>and</strong> output directions of the pole as those of the correspond<strong>in</strong>gzero of H −1 .With a slight abuse of term<strong>in</strong>ology, the above notions of zeros <strong>and</strong> poleswill be used also for nonproper transfer functions, without of course thestate-space <strong>in</strong>terpretation.F<strong>in</strong>ally, poles <strong>and</strong> zeros of discrete-time systems are def<strong>in</strong>ed <strong>in</strong> a similarway, the stability region be<strong>in</strong>g then the open unit disk <strong>in</strong>stead of theOLHP. In particular, a transfer function is nonm<strong>in</strong>imum phase if it has zerosoutside the open unit disk, £ , <strong>and</strong> it is unstable if it has poles outside£ .For certa<strong>in</strong> applications, it will be convenient to factorize transfer functionsof discrete systems <strong>in</strong> a way that their zeros at <strong>in</strong>f<strong>in</strong>ity are explicitlydisplayed.Example 2.1.1. A proper transfer function correspond<strong>in</strong>g to a scalar discretetimesystem has the formH(z) =b 0 z m + · · · + b mz n + a 1 z n−1 + · · · + a n, (2.6)where n ≥ m. Let δ = n−m be the relative degree of H given above. ThenH can be equivalently written asH(z) = ˜H(z)z −δ , (2.7)


2.1 L<strong>in</strong>ear Time-Invariant Systems 29where ˜H is a biproper transfer function, i.e., it has relative degree zero.Note that (2.7) explicitly shows the zeros at <strong>in</strong>f<strong>in</strong>ity of H. 6◦2.1.2 S<strong>in</strong>gular ValuesAt a fixed po<strong>in</strong>t s ∈ ¢ , let the s<strong>in</strong>gular value decomposition (Golub <strong>and</strong>Van Loan, 1983) of a transfer matrix H ∈ ¢ n×n be given byH(s) =n∑σ i (H(s))v ∗ i (H(s))u i (H(s)) ,i=1where σ i (H(s)) are the s<strong>in</strong>gular values of H(s), <strong>and</strong> are ordered so thatσ 1 ≥ σ 2 ≥ · · · ≥ σ n . Each set of vectors v i <strong>and</strong> u i form an orthonormalbasis of the space ¢ n <strong>and</strong> are termed the left <strong>and</strong> right s<strong>in</strong>gular vectors, respectively.When the s<strong>in</strong>gular values are evaluated on the imag<strong>in</strong>ary axis,i.e., for s = jω, then they are called pr<strong>in</strong>cipal ga<strong>in</strong>s of the transfer matrix,<strong>and</strong> the correspond<strong>in</strong>g s<strong>in</strong>gular vectors are the pr<strong>in</strong>cipal directions. Pr<strong>in</strong>cipalga<strong>in</strong>s <strong>and</strong> directions are useful <strong>in</strong> characteriz<strong>in</strong>g directionality propertiesof matrix transfer functions (Freudenberg <strong>and</strong> Looze, 1987).It is well-known that the s<strong>in</strong>gular values of H can be alternatively determ<strong>in</strong>edfrom the relationσ 2 i (H(s)) = λ i (H ∗ (s)H(s)) , (2.8)where λ i (H ∗ H) denotes the i-th eigenvalue of the matrix H ∗ H. We willdenote the largest s<strong>in</strong>gular value of H by σ(H), <strong>and</strong> its smallest s<strong>in</strong>gularvalue by σ(H).2.1.3 Frequency ResponseThe frequency response of a stable system, is def<strong>in</strong>ed as the response <strong>in</strong>steady-state (i.e., after the natural response has died out) to complex s<strong>in</strong>usoidal<strong>in</strong>puts of the form u = u 0 e jωt , where u 0 is a constant vector. Itis well known that this response, denoted by y ss , is given byy ss (t) = H(jω) u 0 e jωt .Hence, the steady-state response of a stable transfer function H to a complexs<strong>in</strong>usoid of frequency ω is given by the <strong>in</strong>put scaled by a “complexga<strong>in</strong>” equal to H(jω).For scalar systems, note that H(jω) = |H(jω)|e j arg H(jω) . It is usual tocall H(jω) the frequency response of the system, <strong>and</strong> |H(jω)| <strong>and</strong> arg H(jω)6 Note that the transfer function H(z) has, <strong>in</strong>clud<strong>in</strong>g those at ∞, the same number of zeros<strong>and</strong> poles, i.e., n. In fact, a rational function assumes every value the same number of times(e.g., Markushevich, 1965, p. 163), 0 <strong>and</strong> ∞ be<strong>in</strong>g just two particular values of <strong>in</strong>terest.


30 2. Review of General Conceptsthe magnitude <strong>and</strong> phase frequency responses, respectively. Note that themagnitude frequency response gives the “ga<strong>in</strong>” of H at each frequency,i.e., the ratio of output amplitude to <strong>in</strong>put amplitude. It is a commonpractice to plot the logarithm of the magnitude response, 7 <strong>and</strong> the phaseresponse versus ω on a logarithmic scale. These are called the Bode plots.For multivariable systems, the extension of these concepts is not unique.One possible characterization of the ga<strong>in</strong> of a MIMO system is by means ofits pr<strong>in</strong>cipal ga<strong>in</strong>s, def<strong>in</strong>ed <strong>in</strong> §2.1.2. In particular, the smallest <strong>and</strong> largestpr<strong>in</strong>cipal ga<strong>in</strong>s are of special <strong>in</strong>terest, s<strong>in</strong>ceσ(H(jω)) ≤ |H(jω)u(jω)||u(jω)|≤ σ(H(jω)) , (2.9)where | · | denotes the Euclidean norm. Hence, the ga<strong>in</strong> of H (understoodhere as the ratio of output norm to <strong>in</strong>put norm) is always between itssmallest <strong>and</strong> largest pr<strong>in</strong>cipal ga<strong>in</strong>s.A useful measure of the ga<strong>in</strong> of a system is obta<strong>in</strong>ed by tak<strong>in</strong>g the supremumover all frequencies of its largest pr<strong>in</strong>cipal ga<strong>in</strong>. Let H be propertransfer function with no poles on the imag<strong>in</strong>ary axis; then the <strong>in</strong>f<strong>in</strong>itynorm of H, denoted by ‖H‖ ∞ , is def<strong>in</strong>ed as‖H‖ ∞ = sup σ(H(jω)) . (2.10)ωFor scalar systems σ(H(jω)) = |H(jω)|, <strong>and</strong> hence the <strong>in</strong>f<strong>in</strong>ity norm issimply the peak value of the magnitude frequency response.2.1.4 Coprime FactorizationCoprime factorization of transfer matrices is a useful way of describ<strong>in</strong>gmultivariable systems. It consists of express<strong>in</strong>g the transfer matrix <strong>in</strong> questionas a “ratio” between stable transfer matrices. Due to the noncommutativityof matrices, there exist left <strong>and</strong> right coprime factorizations. Wewill use the notation “lcf” <strong>and</strong> “rcf” to st<strong>and</strong> for left <strong>and</strong> right coprimefactorization, respectively.The follow<strong>in</strong>g def<strong>in</strong>itions are reviewed from Vidyasagar (1985).Def<strong>in</strong>ition 2.1.1 (Coprimeness). Two stable <strong>and</strong> proper transfer matrices˜D, Ñ (N, D) hav<strong>in</strong>g the same number of rows (columns) are left (right)coprime if <strong>and</strong> only if there exist stable <strong>and</strong> proper transfer matrices Ỹ, ˜X(X, Y) such thatÑ ˜X + ˜DỸ = I . (XN + YD = I .)7 Frequently <strong>in</strong> decibels (dB), where |H| dB = 20 log 10 |H|.◦


2.2 Feedback <strong>Control</strong> Systems 31Def<strong>in</strong>ition 2.1.2 (Lcf, Rcf). Suppose H is a proper transfer matrix. An orderedpair, ( ˜D, Ñ), of proper <strong>and</strong> stable transfer matrices is a lcf of H if ˜Dis square <strong>and</strong> det( ˜D) ≠ 0, H = ˜D −1 Ñ, <strong>and</strong> ˜D, Ñ are right coprime.Similarly, an ordered pair (N, D), of proper <strong>and</strong> stable transfer matricesis a rcf of H if D is square <strong>and</strong> det(D) ≠ 0, H = ND −1 , <strong>and</strong> N, D are rightcoprime.◦Ñ <strong>and</strong> N will be called the numerators of the lcf or rcf, respectively. Similarly,˜D <strong>and</strong> D will be called the denom<strong>in</strong>ators of the lcf or rcf, respectively.Every proper transfer matrix admits left <strong>and</strong> right coprime factorizations.Also, if H = ˜D −1 Ñ = ND −1 , then (Kailath, 1980, Chapter 6)• q is a zero of H if <strong>and</strong> only if N(s) (Ñ(s)) loses rank at s = q.• p is a pole of H if <strong>and</strong> only if D(s) ( ˜D(s)) loses rank at s = p.Note that all of the concepts def<strong>in</strong>ed above apply to both cont<strong>in</strong>uous<strong>and</strong> discrete-time systems. In particular, for cont<strong>in</strong>uous-time systems, thefactorizations are performed over the r<strong>in</strong>g of proper transfer matrices withpoles <strong>in</strong> the OLHP; on the other h<strong>and</strong>, for discrete-time systems, the factorizationsare performed over the r<strong>in</strong>g of proper transfer matrices withpoles <strong>in</strong> the open unit disk.2.2 Feedback <strong>Control</strong> SystemsMost of Part II is concerned with the unity feedback configuration of Figure2.1, where the open-loop system, L, is formed of the series connection ofthe plant, G, <strong>and</strong> controller, K, i.e.,L = GK .In broad terms, the general feedback control problem is to design the controllerfor a given plant such that the output, y, follows the reference <strong>in</strong>put,r, <strong>in</strong> some specified way. This task has to be accomplished, <strong>in</strong> general,<strong>in</strong> the presence of disturbances affect<strong>in</strong>g the loop. Two common disturbancesare output disturbances, d, <strong>and</strong> sensor or measurement noise, w. Thedesign generally assumes a nom<strong>in</strong>al model for the plant, <strong>and</strong> then takesadditional precautions to ensure that the system cont<strong>in</strong>ues to perform <strong>in</strong>a reasonable fashion under perturbations of this nom<strong>in</strong>al model.We consider throughout the rest of this chapter that the (nom<strong>in</strong>al) plant<strong>and</strong> controller are cont<strong>in</strong>uous-time, l<strong>in</strong>ear, time-<strong>in</strong>variant systems, describedby transfer functions G <strong>and</strong> K respectively. Some of these assumptions willbe relaxed <strong>in</strong> other chapters.


32 2. Review of General Concepts❜dr + e +❜ ✲ ✐✲ ❄ yL ✲ ✐ ✲ ❜−+✻❄+w✐✛❜+2.2.1 Closed-Loop StabilityFIGURE 2.1. Feedback control system.A basic requirement for a feedback control loop is that of <strong>in</strong>ternal stability,or simply closed-loop stability, accord<strong>in</strong>g to the follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition 2.2.1 (Internal Stability). Let the open-loop system <strong>in</strong> Figure2.1 be given by L = GK. Then the closed loop is <strong>in</strong>ternally stable if<strong>and</strong> only if I + GK is nons<strong>in</strong>gular <strong>and</strong> the four transfer functions[ ](I + GK)−1−(I + GK) −1 GK(I + GK) −1 I − K(I + GK) −1 ,Ghave all of their poles <strong>in</strong> the OLHP.We say that L is free of unstable hidden modes if there are no cancelationsof CRHP zeros <strong>and</strong> poles between the plant <strong>and</strong> controller whose cascadeconnection forms L. In the multivariable case, cancelations <strong>in</strong>volve bothlocation <strong>and</strong> directions of zeros <strong>and</strong> poles. For example, if the plant <strong>and</strong>controller are expressed us<strong>in</strong>g coprime factorizations asG = ˜D −1G ÑG = N G D −1G ,K = ˜D −1K ÑK = N K D −1K ,then L = GK has an unstable hidden mode if ˜D G <strong>and</strong> Ñ K share an ORHPzero with the same <strong>in</strong>put direction (Gómez <strong>and</strong> Goodw<strong>in</strong>, 1995). It is easyto see that the closed loop is not <strong>in</strong>ternally stable if L has unstable hiddenmodes.For discrete-time systems, the above concepts apply with the correspond<strong>in</strong>gdef<strong>in</strong>ition of the stability region.◦2.2.2 Sensitivity FunctionsFrom Figure 2.1 we see thatY = D + L[R − W − Y] .Solv<strong>in</strong>g for Y we haveY = (I + L) −1 D + L(I + L) −1 [R − W] . (2.11)


2.2 Feedback <strong>Control</strong> Systems 33The above expression suggests that two functions of central <strong>in</strong>terest <strong>in</strong> thedesign of feedback systems are the sensitivity function, S, <strong>and</strong> complementarysensitivity function, T, given respectively byS(s) = [I + L(s)] −1 , <strong>and</strong> T(s) = L(s) [I + L(s)] −1 . (2.12)We see that S maps output disturbances to the output, <strong>and</strong> T maps bothreference <strong>and</strong> sensor noise to the output. It is also straightforward to checkthat S also maps the reference <strong>in</strong>put to the error <strong>in</strong> Figure 2.1. 8 In fact, S<strong>and</strong> T are <strong>in</strong>timately connected with closed-loop performance <strong>and</strong> robustnessproperties, as we show <strong>in</strong> the follow<strong>in</strong>g subsections.2.2.3 Performance ConsiderationsIf the closed loop is <strong>in</strong>ternally stable, S <strong>and</strong> T are stable transfer functions.Then, the steady-state response of the system output to a disturbance d =d 0 e jωt , for d 0 ∈ ¢ n , is given by (cf. §2.1.3)y d (t) = S(jω)d 0 e jωt .Thus, the response to d can be made small by requir<strong>in</strong>g |S(jω)d 0 | ≪ 1.Clearly, from (2.9), the response of the system to output disturbances ofany direction <strong>and</strong> frequency ω can be made small ifσ(S(jω)) ≪ 1 . (2.13)A similar analysis for T shows that the response of the system to sensornoise of any direction <strong>and</strong> frequency ω can be made small ifσ(T(jω)) ≪ 1 . (2.14)Recall that T also maps the reference <strong>in</strong>put to the system output. It isthus clear that the feedback loop will have poor performance unless thefrequency content of the reference <strong>in</strong>put <strong>and</strong> the measurement noise aredisjo<strong>in</strong>t. This shows that there is an <strong>in</strong>herent trade-off between referencetrack<strong>in</strong>g <strong>and</strong> noise attenuation.Another trade-off arises between attenuation of output disturbances<strong>and</strong> sensor noise. Namely, the relationship S+T = I implies that the specifications(2.13) <strong>and</strong> (2.14) cannot be both satisfied over the same frequencyrange.The above discussion suggests that a sensible design should focus onachievable specifications. Consider, for example, the scalar case. If we assumethat the reference <strong>in</strong>put has low frequency content (which is typicallythe case), then it is reasonable to require|T(jω)| ≈ 1, ∀ω ∈ [0, ω 1 ] ,8 In the follow<strong>in</strong>g chapters, d <strong>and</strong> w <strong>in</strong> this figure are frequently set to zero; the reader isasked to keep <strong>in</strong> m<strong>in</strong>d that S <strong>and</strong> T also lead the output response to these signals.


34 2. Review of General Conceptsfor some ω 1 . This is equivalent to|S(jω)| ≪ 1, ∀ω ∈ [0, ω 1 ] . (2.15)Note that the above specification implies that output disturbances hav<strong>in</strong>gfrequency content <strong>in</strong> the range [0, ω 1 ] will be attenuated at the systemoutput. However, mak<strong>in</strong>g ω 1 too large may result <strong>in</strong> large magnitudesat the plant <strong>in</strong>put. To see this, note that |S| = 1/|1 + L| can only be madesmall by mak<strong>in</strong>g the magnitude of the open-loop system L = GK large.However, mak<strong>in</strong>g the open-loop ga<strong>in</strong> large over a frequency range wherethe ga<strong>in</strong> of the plant |G| is small requires a high controller ga<strong>in</strong>; hence, theresponse of the plant <strong>in</strong>put to disturbances <strong>in</strong> this range will be very large,usually lead<strong>in</strong>g to saturation. We conclude that the range of sensitivityreduction (<strong>and</strong> thus of reference track<strong>in</strong>g) is limited by the plant’s <strong>in</strong>putresponse to disturbances.It is then common to aim at a design that achieves specifications of theform|S(jω)| ≪ 1, ∀ω ∈ [0, ω 1 ] ,|T(jω)| ≪ 1, ∀ω ∈ [ω 2 , ∞) ,for some ω 2 > ω 1 . Typical shapes for S <strong>and</strong> T satisfy<strong>in</strong>g specifications ofthis k<strong>in</strong>d are shown <strong>in</strong> Figure 2.2.11|S(jω)|0.5|T(jω)|0.5010 −2 10 −1 10 0ω10 1 10 2010 −2 10 −1 10 0ω10 1 10 2FIGURE 2.2. Typical shapes for |S(jω)| <strong>and</strong> |T(jω)|.If the above requirements are satisfied, then the peak values of S <strong>and</strong> Twill occur <strong>in</strong> the <strong>in</strong>termediate range (ω 1 , ω 2 ). It is desirable to keep thesepeaks as small as possible <strong>in</strong> order to avoid overly large sensitivity to disturbances<strong>and</strong> excessive <strong>in</strong>fluence of sensor noise. A key po<strong>in</strong>t to emergelater <strong>in</strong> the book, however, is that nonm<strong>in</strong>imum phase zeros <strong>and</strong> unstablepoles of the open-loop system impose lower limits on the achievablepeaks of the sensitivity <strong>and</strong> complementary sensitivity functions.


2.2 Feedback <strong>Control</strong> Systems 352.2.4 Robustness ConsiderationsIf the actual plant, ˜G say, differs from the nom<strong>in</strong>al model G, then additionalrequirements are needed to preserve stability <strong>and</strong> good performance.The usual procedure is to consider a particular model for the allowableperturbations (usually based on practical considerations), <strong>and</strong> thento impose a condition on the design that will guarantee robust stability(<strong>and</strong>/or performance) for all perturbed plants with<strong>in</strong> the set of modelshav<strong>in</strong>g the allowable perturbations.Two common models used to describe plant uncerta<strong>in</strong>ty are divisive<strong>and</strong> multiplicative perturbation models. The (<strong>in</strong>put) divisive perturbationmodel, or perturbation of the plant <strong>in</strong>verse, assumes that˜G = G(I + ∆) −1 , (2.16)where ∆ is a stable transfer function satisfy<strong>in</strong>g a frequency dependentmagnitude boundσ(∆(jω)) ≤ W(ω), ∀ω . (2.17)The (output) multiplicative perturbation model assumes that˜G = (I + ∆)G , (2.18)where ∆ is a stable transfer function satisfy<strong>in</strong>g a frequency dependentmagnitude bound similar to (2.17). If this is the only <strong>in</strong>formation availableabout the uncerta<strong>in</strong>ty, then ∆ is termed unstructured uncerta<strong>in</strong>ty.It turns out that the sensitivity <strong>and</strong> complementary sensitivity functionseach characterize stability robustness of the system aga<strong>in</strong>st divisive <strong>and</strong>multiplicative plant uncerta<strong>in</strong>ty respectively. Indeed, the feedback systemwill be stable for all plants described by (2.16), (2.17), with ∆ stable, if <strong>and</strong>only if the system is stable when ∆ = 0 <strong>and</strong>σ(S(jω)) < 1/W(ω) , ∀ω , (2.19)where W is the bound <strong>in</strong> (2.17).Similarly, if the nom<strong>in</strong>al closed loop is stable, then the perturbed closedloop will rema<strong>in</strong> stable for all plants described by (2.18), (2.17), with ∆stable, if <strong>and</strong> only ifσ(T(jω)) < 1/W(ω) , ∀ω . (2.20)We remark that the same condition (2.19) is also necessary <strong>and</strong> sufficientfor robust stability aga<strong>in</strong>st additive perturbations of the open-loop of theform ˜L = L + ∆, where ∆ is stable <strong>and</strong> satisfies (2.17).Note that specifications such as (2.19) <strong>and</strong> (2.20) give more <strong>in</strong>sights <strong>in</strong>tothe desirable shapes for S <strong>and</strong> T. For example, <strong>in</strong> the scalar case, a typicalbound W(ω) for multiplicative perturbation grows at high frequencies. 99 The multiplicative perturbation model is useful to describe high frequency modell<strong>in</strong>g<strong>in</strong>accuracy, common <strong>in</strong> practice (Doyle <strong>and</strong> Ste<strong>in</strong>, 1981).


36 2. Review of General ConceptsThus, robust stability aga<strong>in</strong>st multiplicative uncerta<strong>in</strong>ty requires that |T(jω)|be small at high frequencies.F<strong>in</strong>ally, we briefly turn to performance robustness considerations. Wehave seen <strong>in</strong> §2.2.3 that S <strong>and</strong> T are <strong>in</strong>dicators of feedback system performance.We will next consider how these functions are affected by plantvariations. We focus on the scalar case but similar conclusions hold, mutatismut<strong>and</strong>is, for multivariable systems.Assume that the loop ga<strong>in</strong> changes from its nom<strong>in</strong>al value L to its actualvalue ˜L. It is not difficult to show that the relative changes <strong>in</strong> the sensitivity<strong>and</strong> complementary sensitivity functions are given by˜S − S˜S˜T − T˜T= −T ˜L − LL= S ˜L − L˜LThese relations show that the sensitivity function will be robust with respectto changes <strong>in</strong> the loop ga<strong>in</strong> <strong>in</strong> those frequency ranges where thenom<strong>in</strong>al complementary sensitivity is small (typically at high frequencies);conversely the complementary sensitivity will be robust with respectto changes <strong>in</strong> the loop ga<strong>in</strong> <strong>in</strong> those frequency ranges where the nom<strong>in</strong>alsensitivity is small (typically at low frequencies)..,2.3 Two Applications of Complex IntegrationIn the previous section we discussed the importance of atta<strong>in</strong><strong>in</strong>g a desiredshape for the frequency response of relevant transfer functions of feedbackcontrol loops. As we will see later, zeros <strong>and</strong> poles of the plant to becontrolled impose restrictions on the behavior of these functions at particularcomplex frequencies <strong>in</strong> the ORHP. A powerful tool exists by meansof which these restrictions <strong>in</strong> the ORHP can be transformed directly <strong>in</strong>toequivalent constra<strong>in</strong>ts on the frequency response. Indeed, the imag<strong>in</strong>aryaxis can be looked upon as the boundary of the ORHP, which is the regionwhere the special restrictions on the transfer functions occur, so that thebroad mathematical problem is that of relat<strong>in</strong>g the behavior of a function<strong>in</strong>side a region to its behavior on the boundary of the region. A mechanismfor this purpose is found <strong>in</strong> Cauchy’s theory of analytic functions <strong>and</strong> its<strong>in</strong>tegrals around closed contours. The rema<strong>in</strong>der of the book will makeextensive use of this theory, <strong>and</strong> a self-conta<strong>in</strong>ed review of this theory isgiven <strong>in</strong> Appendix A for convenience.As a prelim<strong>in</strong>ary use of Cauchy’s theory of complex <strong>in</strong>tegration, we willpresent two well-known applications. First we will establish the Nyquiststability theorem. We treat the cont<strong>in</strong>uous-time case; however, similar argumentsapply to the discrete-time problem. As a second application, we


2.3 Two Applications of Complex Integration 37will derive the Bode ga<strong>in</strong>-phase relationship. The latter relationship willalso serve as an <strong>in</strong>troduction to the constra<strong>in</strong>ts <strong>in</strong>troduced by nonm<strong>in</strong>imumphase zeros on the achievable shape of the frequency response.2.3.1 Nyquist Stability CriterionThe Nyquist criterion depends on a result known as the Pr<strong>in</strong>ciple of theArgument. This result uses the residue theorem (Theorem A.9.1 <strong>in</strong> AppendixA) to obta<strong>in</strong> <strong>in</strong>formation about the number of zeros of an analyticfunction (or about the number of zeros m<strong>in</strong>us the number of poles of ameromorphic function. 10Theorem 2.3.1 (Pr<strong>in</strong>ciple of the Argument). Let C be a closed simple contourconta<strong>in</strong>ed <strong>in</strong> a simply connected doma<strong>in</strong> D. Let f be a meromorphicfunction <strong>in</strong> D <strong>and</strong> suppose that f has no zeros or poles on C. Let n q bethe number of zeros <strong>and</strong> n p the number of poles of f <strong>in</strong> the <strong>in</strong>terior of C,where a multiple zero or pole is counted accord<strong>in</strong>g to its multiplicity. Then∮f ′ (s)f(s) ds = j2π (n q − n p ) , (2.21)Cwhere f ′ denotes the derivative of f, <strong>and</strong> <strong>in</strong>tegration <strong>in</strong> (2.21) is performed<strong>in</strong> the counter-clockwise direction.Proof. The only possible s<strong>in</strong>gularities of the meromorphic function f ′ /f<strong>in</strong>side C are the zeros <strong>and</strong> poles of f. Suppose that s 0 represents a zero orpole of f with multiplicity n. We can writef(s) = (s − s 0 ) n ˜f(s) , (2.22)where ˜f is analytic <strong>in</strong> a neighborhood of s 0 <strong>and</strong> ˜f(s 0 ) ≠ 0. Hencef ′ (s)f(s) = n + ˜f ′ (s)s − s 0˜f(s)(2.23)<strong>in</strong> a neighborhood of s 0 . Thus the residue of f ′ /f at s 0 is n (see §A.9.1 <strong>in</strong>Appendix A). Note that n is positive if s 0 is a zero of f, <strong>and</strong> n is negative ifs 0 is a pole of f. Apply<strong>in</strong>g Theorem A.9.1 of Appendix A yields (2.21) <strong>and</strong>completes the proof.□We will next see why Theorem 2.3.1 is called the pr<strong>in</strong>ciple of the argument.Let the equation for C be s = s(ζ), with ζ <strong>in</strong> [a, b] <strong>and</strong> let ξ = f(s).Consider now the image of C under f, ˜C say. Thus, the equation of ˜C <strong>in</strong>the ξ plane isξ = f(s(ζ)) , ζ ∈ [a, b] .10 A function is meromorphic <strong>in</strong> a doma<strong>in</strong> D if it is def<strong>in</strong>ed <strong>and</strong> analytic <strong>in</strong> D except forpoles.


38 2. Review of General ConceptsS<strong>in</strong>ce f is never zero on C, then the curve ˜C does not pass through theorig<strong>in</strong> of the ξ plane. Hence, it is possible to def<strong>in</strong>e a cont<strong>in</strong>uous logarithmF(ζ) = log f(s(ζ))on ˜C. On any smooth portion of C, we have, by differentiation,F ′ (ζ) = f ′ (s(ζ))f(s(ζ)) s ′ (ζ) ,<strong>and</strong> hence, by the fundamental theorem of calculus (cf. (A.18) <strong>in</strong> Appendix A),C∫ baf ′ (s(ζ))f(s(ζ)) s ′ (ζ) dζ = log f(s(ζ))∣This is equivalent to∮∣ f ′ (s)∣∣∣Cf(s) ds = log f(s) = log |f(s)|∣ + j arg f(s)∣ .C CThe first term on the right is zero s<strong>in</strong>ce C is closed. Then, divid<strong>in</strong>g by 2π<strong>and</strong> us<strong>in</strong>g (2.21), we get2π (n q − n p ) = arg f(s)∣ . (2.24)CThat is, n q − n p is the number of times the image curve ˜C w<strong>in</strong>ds aroundthe orig<strong>in</strong> <strong>in</strong> the ξ plane, when C is traversed <strong>in</strong> the counter-clockwisesense.The pr<strong>in</strong>ciple of the argument f<strong>in</strong>ds immediate application <strong>in</strong> the Nyquiststability criterion. Specifically, let L = GK be the open-loop transfer functionof the feedback control system shown <strong>in</strong> Figure 2.1. Assume furtherthat L is scalar. We have seen <strong>in</strong> §2.2.1 that the closed loop is <strong>in</strong>ternallystable if <strong>and</strong> only if there are no unstable cancelations <strong>in</strong> L, L(∞) ≠ −1,<strong>and</strong> all the zeros of the characteristic equationba1 + L(s) = 0 (2.25)have negative real part.Consider next the contour C shown <strong>in</strong> Figure 2.3, consist<strong>in</strong>g of the imag<strong>in</strong>aryaxis <strong>and</strong> a semicircle of <strong>in</strong>f<strong>in</strong>ite radius <strong>in</strong>to the ORHP. If L has poleson the imag<strong>in</strong>ary axis, then C must have small <strong>in</strong>dentations to avoid them.Such a contour is called Nyquist contour <strong>and</strong> its image through L(s) iscalled the Nyquist plot of L.A necessary <strong>and</strong> sufficient condition for closed-loop stability is furnishedby the follow<strong>in</strong>g theorem..


2.3 Two Applications of Complex Integration 39jωσFIGURE 2.3. Nyquist contour.Theorem 2.3.2 (Nyquist Stability Criterion). The feedback control systemof Figure 2.1 is <strong>in</strong>ternally stable if <strong>and</strong> only if L(∞) ≠ −1 <strong>and</strong> thenumber of counter-clockwise revolutions made by the Nyquist plot of theopen-loop transfer function L = GK around the po<strong>in</strong>t s = −1, is equal tothe number of unstable poles of G plus the number of unstable poles of K.Proof. We apply Theorem 2.3.1 to the function f = 1 + L us<strong>in</strong>g the Nyquistcontour, C <strong>in</strong> Figure 2.3 (note that now <strong>in</strong>tegration is <strong>in</strong> the clockwise direction).Any pole or zero of 1 + L with positive real part will therefore liewith<strong>in</strong> this contour. Thus, closed-loop stability is equivalent to n q = 0 <strong>in</strong>(2.21) or (2.24).It follows from (2.24) that the phase-shift <strong>in</strong>crement of 1 + L(s) as s traversesthe Nyquist contour is 2π (n p − n q ), where n p is the number of unstablepoles of L. But, s<strong>in</strong>ce L <strong>and</strong> 1+L have the same poles, it is equivalentto count revolutions of L(s) around the po<strong>in</strong>t (−1, 0). Thus the number ofcounter-clockwise revolutions of the Nyquist plot of L = GK around thepo<strong>in</strong>t s = −1 is different from the number of unstable poles of G plus thenumber of unstable poles of K if <strong>and</strong> only if there is an unstable cancelation<strong>in</strong> L, or 1 + L is nonm<strong>in</strong>imum phase (i.e., n q ≠ 0). The result thenfollows.□Example 2.3.1. Let L <strong>in</strong> Figure 2.1 be a strictly proper transfer function.S<strong>in</strong>ce L(Re jθ ) vanishes as R becomes <strong>in</strong>f<strong>in</strong>ite, it follows that the phase-shiftof 1 + L(s) as s traverses the Nyquist contour is given by its phase-shiftwhen s moves on the imag<strong>in</strong>ary axis. Moreover, s<strong>in</strong>ce L has real coefficients,then it suffices to consider 2 times the phase-shift encountered as smoves along the positive imag<strong>in</strong>ary axis. Also, note that 1 + L(jω) is thevector from the −1 po<strong>in</strong>t to the po<strong>in</strong>t on the Nyquist plot at frequency ω.Now assume that L is strictly proper <strong>and</strong> has one unstable pole. Accord<strong>in</strong>gto the Nyquist stability criterion <strong>and</strong> the previous discussion, ifthe closed loop is stable, then the vector 1 + L(jω) must rotate an angle


40 2. Review of General Conceptsof π (<strong>in</strong> the counter-clockwise sense) as ω goes from 0 to ∞. But, s<strong>in</strong>ce Lis strictly proper, 1 + L(j∞) is the vector from the −1 po<strong>in</strong>t to the orig<strong>in</strong>of the complex plane. It thus follows that closed-loop stability requiresL(0) < −1.◦2.3.2 Bode Ga<strong>in</strong>-Phase RelationshipsAs a further application of analytic function theory, we will review thega<strong>in</strong>-phase relationships orig<strong>in</strong>ally developed by Bode (1945), which establishthat, for a stable m<strong>in</strong>imum phase transfer function, the phase ofthe frequency response is uniquely determ<strong>in</strong>ed by the magnitude of thefrequency response <strong>and</strong> vice versa.We beg<strong>in</strong> by show<strong>in</strong>g that the real <strong>and</strong> imag<strong>in</strong>ary parts of a proper stablerational function with real coefficients are dependent of each other. Weconsider this dependence at po<strong>in</strong>ts of the imag<strong>in</strong>ary axis.Theorem 2.3.3 (Bode’s Real-imag<strong>in</strong>ary Parts Relationship). 11 Let H be aproper stable transfer function, <strong>and</strong> suppose that, at s = jω, H(s) can bewritten as H(jω) = U(ω) + jV(ω), where U <strong>and</strong> V are real valued. Thenfor any ω 0V(ω 0 ) = 2ω 0π∫ ∞0U(ω) − U(ω 0 )ω 2 − ω 2 0dω . (2.26)Proof. Let C be the clockwise oriented contour shown <strong>in</strong> Figure 2.4, <strong>and</strong>consist<strong>in</strong>g of the imag<strong>in</strong>ary axis, with <strong>in</strong>f<strong>in</strong>itely small <strong>in</strong>dentations at thepo<strong>in</strong>ts +jω 0 <strong>and</strong> −jω 0 (C 2 <strong>and</strong> C 3 <strong>in</strong> Figure 2.4), <strong>and</strong> the semicircle C 1 ,which has <strong>in</strong>f<strong>in</strong>ite radius <strong>in</strong> the ORHP 12 .Then the functionsH(s) − U(ω 0 )s − jω 0<strong>and</strong> H(s) − U(ω 0)s + jω 0are analytic on <strong>and</strong> <strong>in</strong>side C. Hence, apply<strong>in</strong>g Cauchy’s <strong>in</strong>tegral theorem(see §A.5.2 <strong>in</strong> Appendix A) to both functions <strong>and</strong> subtract<strong>in</strong>g yields∮ ( H(s) − U(ω0 )− H(s) − U(ω )0)ds = 0 . (2.27)s − jω 0 s + jω 0CThe <strong>in</strong>tegral above may be decomposed as2ω 0∫ ∞−∞H(jω) − U(ω 0 )ω 2 − ω 2 0dω + I 1 + I 2 + I 3 = 0 , (2.28)11 This is, <strong>in</strong> fact, one of the many real-imag<strong>in</strong>ary relationships derived by Bode .12 This should be understood as follows: the <strong>in</strong>dentations on the imag<strong>in</strong>ary axis have radiiρ > 0, <strong>and</strong> the large semicircle <strong>in</strong> the ORHP has f<strong>in</strong>ite radius R. These are then considered<strong>in</strong> the limit as ρ → 0 <strong>and</strong> R → ∞. In fact, the large semicircle is noth<strong>in</strong>g but an <strong>in</strong>dentationaround ∞.


jω 0C 22.3 Two Applications of Complex Integration 41jω−jω 0C 3C 1σFIGURE 2.4. Contour used <strong>in</strong> Theorem 2.3.3.where I 1 , I 2 , <strong>and</strong> I 3 are the <strong>in</strong>tegrals over C 1 , C 2 , <strong>and</strong> C 3 respectively. The<strong>in</strong>tegral on the imag<strong>in</strong>ary axis can be written as2ω 0∫ ∞−∞H(jω) − U(ω 0 )ω 2 − ω 2 0dω = 4ω 0∫ ∞0U(ω) − U(ω 0 )ω 2 − ω 2 0dω ,s<strong>in</strong>ce the real <strong>and</strong> imag<strong>in</strong>ary parts of a transfer function evaluated at s =jω are even <strong>and</strong> odd functions of ω respectively.Next note that the <strong>in</strong>tegral I 1 vanishes because H is proper.Now consider the <strong>in</strong>tegral I 2 . S<strong>in</strong>ce the radius of C 2 is <strong>in</strong>f<strong>in</strong>itely small,we can approximate H(s) on C 2 by the constant H(jω 0 ). We can also neglectthe contribution of the fraction 1/(s + jω) <strong>in</strong> comparison with that of1/(s − jω). Then∫H(jω 0 ) − U(ω 0 )I 2 =dsC 2s − jω 0∫1= jV(ω 0 )dsC 2s − jω 0= −πV(ω 0 ) ,where the last step follows from (A.32) <strong>in</strong> Appendix A.Similarly, we can show that the <strong>in</strong>tegral I 3 equals −πV(ω 0 ). F<strong>in</strong>ally, substitut<strong>in</strong>g<strong>in</strong>to (2.28) <strong>and</strong> rearrang<strong>in</strong>g gives the desired expression. □The above theorem shows that values on the jω-axis of the imag<strong>in</strong>arypart of a stable <strong>and</strong> proper transfer function can be reconstructed fromknowledge of the real part on the entire jω-axis. Conversely, under theassumption that the transfer function is strictly proper <strong>and</strong> stable, it can beshown that the real part can be obta<strong>in</strong>ed from the imag<strong>in</strong>ary part, i.e.,U(ω 0 ) = − 2 π∫ ∞0ω [V(ω) − V(ω 0 )]ω 2 − ω 2 0dω , (2.29)


42 2. Review of General Conceptswhich follows similarly by add<strong>in</strong>g <strong>in</strong>stead of subtract<strong>in</strong>g <strong>in</strong> (2.27).It is shown <strong>in</strong> §A.6 of Appendix A that the values of a function analytic<strong>in</strong> a given region can be reconstructed from its values on the boundary.Comb<strong>in</strong><strong>in</strong>g this with relations (2.26) <strong>and</strong> (2.29), we deduce that the valuesof a stable <strong>and</strong> strictly proper transfer function on the CRHP are completelydeterm<strong>in</strong>ed by the real or imag<strong>in</strong>ary part of its frequency response.We will next show that the ga<strong>in</strong> <strong>and</strong> the phase of the frequency responseof a stable, m<strong>in</strong>imum phase transfer function are dependent on each other.Theorem 2.3.4 (Bode’s Ga<strong>in</strong>-phase Relationship). Let H be a proper, stable,<strong>and</strong> m<strong>in</strong>imum phase transfer function, such that H(0) > 0. Then, atany frequency ω 0 , the phase φ(ω 0 ) arg H(jω 0 ) satisfiesφ(ω 0 ) = 1 πwhere u = log(ω/ω 0 ).Proof. Consider∫ ∞−∞d log |H(jω 0 e u )|dulog coth ∣ u ∣ du , (2.30)2H(jω 0 ) = U(ω 0 ) + jV(ω 0 ) |H(jω 0 )| e jφ(ω 0) . (2.31)S<strong>in</strong>ce H has no zeros <strong>in</strong> the CRHP, tak<strong>in</strong>g logarithms <strong>in</strong> (2.31) giveslog H(jω 0 ) = log |H(jω 0 )| + jφ(ω 0 ) m(ω 0 ) + jφ(ω 0 ). (2.32)Compar<strong>in</strong>g (2.31) <strong>and</strong> (2.32) shows that the magnitude characteristic m(ω 0 )<strong>and</strong> the phase φ(ω 0 ) are related to log H(jω 0 ) <strong>and</strong> to each other <strong>in</strong> thesame way that U(ω 0 ) <strong>and</strong> V(ω 0 ) are related to H(jω 0 ) <strong>and</strong> each other.Hence (2.29) <strong>and</strong> (2.26) immediately implym(ω 0 ) = − 2 πφ(ω 0 ) = 2 ω 0π∫ ∞0∫ ∞0ω [φ(ω) − φ(ω 0 )]ω 2 − ω 2 dω ,0(2.33)m(ω) − m(ω 0 )ω 2 − ω 2 dω .0(2.34)Note that the assumption that H has no zeros or poles <strong>in</strong> the CRHP guaranteesthe validity of the <strong>in</strong>tegrals above, s<strong>in</strong>ce log H is analytic <strong>in</strong> thef<strong>in</strong>ite CRHP. The s<strong>in</strong>gularity at ∞ aris<strong>in</strong>g from a strictly proper H is ruledout <strong>in</strong> the chosen contour of <strong>in</strong>tegration (see footnote 12 on page 40). Thefact that | log H(s)|/|s| → 0 when |s| → ∞ elim<strong>in</strong>ates the <strong>in</strong>tegral along thelarge semicircle <strong>in</strong> the ORHP.Next, consider (2.34). Chang<strong>in</strong>g variables to u = log(ω/ω 0 ) <strong>and</strong> denot<strong>in</strong>g˜m(u) = m(ω) givesφ(ω 0 ) = 2 π= 1 π∫ ∞−∞∫ ∞−∞˜m(u) − m(ω 0 )e u − e −u du˜m(u) − m(ω 0 )du .s<strong>in</strong>h(u)


2.3 Two Applications of Complex Integration 43Divid<strong>in</strong>g the complete range of <strong>in</strong>tegration <strong>in</strong>to separate ranges above <strong>and</strong>below u = 0 <strong>and</strong> <strong>in</strong>tegrat<strong>in</strong>g by parts yields∫ 0−∞φ(ω 0 ) = 1 ˜m(u) − m(ω 0 )du + 1 ˜m(u) − m(ω 0 )duπ s<strong>in</strong>h(u) π s<strong>in</strong>h(u)= − 1 ( )∣ −u ∣∣∣0π [ ˜m(u) − m(ω 0)] log coth2−∞+ 1 ∫ 0( )d ˜m(u) −uπ −∞ dulog coth du2− 1 ( u) ∣ ∞π [ ˜m(u) − m(ω 0)] log coth ∣∣2+ 1 π∫ ∞0∫ ∞0d ˜m(u)( u)du log coth du .20(2.35)Near u = 0, the quantity ˜m(u) − m(ω 0 ) behaves proportionally to u,whilst log coth(u/2) will vary as − log(u/2). Thus, at the limit u → 0, the<strong>in</strong>tegrated portions of (2.35) behave as u log u, which is known to vanishas u → 0. As for the other limits, we have that lim u→−∞ ˜m(u) =lim ω→0 m(ω) = m(0), which is f<strong>in</strong>ite s<strong>in</strong>ce H is stable <strong>and</strong> m<strong>in</strong>imumphase; also lim u→∞ ˜m(u) log coth(u/2) = lim ω→∞ m(ω)2(ω 0 /ω) = 0.Hence both <strong>in</strong>tegrated portions <strong>in</strong> (2.35) are equal to zero. The result thenfollows on comb<strong>in</strong><strong>in</strong>g the rema<strong>in</strong><strong>in</strong>g two <strong>in</strong>tegrals <strong>in</strong> (2.35).□The implications of (2.30) can be easily appreciated us<strong>in</strong>g properties ofthe weight<strong>in</strong>g function appear<strong>in</strong>g <strong>in</strong> (2.30), namelylog coth ∣ u ∣ ∣ = logω + ω 0 ∣∣∣2 ∣ . (2.36)ω − ω 0This function is plotted <strong>in</strong> Figure 2.5.As we can see from this figure, the weight<strong>in</strong>g function becomes logarithmically<strong>in</strong>f<strong>in</strong>ite at the po<strong>in</strong>t ω = ω 0 . Thus, we conclude from (2.30) thatthe slope of the magnitude curve <strong>in</strong> the vic<strong>in</strong>ity of ω 0 , say c, determ<strong>in</strong>esthe phase φ(ω 0 ):φ(ω 0 ) ≈ c π∫ ∞−∞log coth ∣ u ∣ du = c 2 ππ 22 = cπ 2 .Hence, for stable <strong>and</strong> m<strong>in</strong>imum phase transfer functions, a slope of 20cdB/decade <strong>in</strong> the ga<strong>in</strong> <strong>in</strong> the vic<strong>in</strong>ity of ω 0 implies a phase angle of approximatelyc π/2 rad sec −1 .The above arguments lead classical designers to conclude that, to ensureclosed-loop stability, the slope of the open-loop ga<strong>in</strong> characteristic, |L(jω)|,should be <strong>in</strong> the range -20 to -30 dB/decade at the ga<strong>in</strong> cross-over po<strong>in</strong>t


44 2. Review of General Concepts32.5log (coth |u / 2| )21.510.5010 −1 10 0 e u10 1FIGURE 2.5. Weight<strong>in</strong>g function of the Bode ga<strong>in</strong>-phase relationship.(i.e., at the frequency at which |L| = 1 or 0 dB), s<strong>in</strong>ce this would imply thephase would be less than 180 ◦ i.e. the Nyquist plot would not encircle the‘-1‘ po<strong>in</strong>t.Example 2.3.2. Let us consider the X-29 aircraft design example discussed<strong>in</strong> §1.1 of Chapter 1. This system is (approximately) modelled by a strictlyproper transfer function, G, which is unstable <strong>and</strong> nonm<strong>in</strong>imum phase.For one flight condition, the unstable pole is at 6 <strong>and</strong> the nonm<strong>in</strong>imumphase zero at 26. It is desired to use a stable, m<strong>in</strong>imum phase controller <strong>in</strong>series with G, such that the closed loop is stable <strong>and</strong> has a phase marg<strong>in</strong>of π/4. Consider the open-loop system formed by the cascade of plant <strong>and</strong>controller <strong>and</strong> modeled by a transfer function, L say. This system is neitherstable nor m<strong>in</strong>imum phase <strong>in</strong> open loop. However, we can associate withL another transfer function, ˜L, def<strong>in</strong>ed as followsL(s) = −˜L(s)( s + ps − p) ( s − qs + q), (2.37)where ˜L is stable, m<strong>in</strong>imum phase, <strong>and</strong> such that ˜L(0) > 0, <strong>and</strong> where p<strong>and</strong> q correspond to the (real) ORHP pole <strong>and</strong> zero of G respectively. 13The negative sign <strong>in</strong> (2.37) is necessary to guarantee a stable closed loop(see Example 2.3.1). We note that|˜L(jω)| = |L(jω)| . (2.38)Now from (2.37) we have that φ(ω 0 ) arg L(jω 0 ) is given byφ(ω 0 ) = arg ˜L(jω 0 ) − arg p − jω 0p + jω 0− arg jω 0 + qjω 0 − q .13 Actually, the device used above to associate a stable, m<strong>in</strong>imum phase transfer functionwith an unstable, nonm<strong>in</strong>imum phase one will be used repeatedly <strong>in</strong> subsequent chapters.


2.4 Summary 45Also, s<strong>in</strong>ce ˜L is stable, m<strong>in</strong>imum phase, <strong>and</strong> ˜L(0) > 0, we can use (2.30) toobta<strong>in</strong>φ(ω 0 ) = 1 d log |˜L(jω 0 e u )|log coth ∣ u ∣ du−arg (p−jω 0)(jω 0 +q)π du2 (p+jω 0 )(jω 0 −q) .∫ ∞−∞F<strong>in</strong>ally, us<strong>in</strong>g (2.38) we obta<strong>in</strong>φ(ω 0 ) = 1 π= 1 π∫ ∞−∞∫ ∞−∞d log |L(jω 0 e u )|dud log |L(jω 0 e u )|dulog coth ∣ u ∣ du−arg (p−jω 0)(jω 0 +q)2 (p+jω 0 )(jω 0 −q)log coth ∣ u ∣ du−2 arctan ω 0/q+p/ω 0.21−p/qSay that we want a slope of -10 dB/decade at the ga<strong>in</strong> cross-over frequency;then this means that the first term <strong>in</strong> the above expression is approximately−π/4. Now, the second term has its least negative value forω 0 = √ pq, <strong>and</strong> for p = 6, q = 26, we obta<strong>in</strong>φ(ω 0 ) ≤ (−π/4 − 1.79)rad= −147 ◦ .Hence, the maximum possible phase marg<strong>in</strong> <strong>in</strong> this case is 33 ◦ .2.4 SummaryIn this chapter we have provided an overview of system properties, withemphasis on the frequency response of LTI systems. We have consideredstability, performance <strong>and</strong> robustness of feedback control loops. We havealso shown that the sensitivity <strong>and</strong> complementary sensitivity functionscan be used to quantify important feedback properties.In addition, we have given two applications of Cauchy’s <strong>in</strong>tegral theoremsto systems theory, namely the Nyquist stability criterion <strong>and</strong> theBode ga<strong>in</strong>-phase relationship.Notes <strong>and</strong> ReferencesThis chapter is ma<strong>in</strong>ly oriented towards people with some background <strong>in</strong> control<strong>and</strong> systems theory. For a more extensive <strong>in</strong>troduction see e.g., Kailath (1980),Frankl<strong>in</strong> et al. (1994) or Sontag (1990). The material <strong>in</strong>cluded here was based onthe references below.LTI SystemsSee e.g., Francis (1991), Maciejowski (1991), Middleton <strong>and</strong> Goodw<strong>in</strong> (1990), Kailath(1980) <strong>and</strong> Frankl<strong>in</strong> et al. (1994).


46 2. Review of General ConceptsFeedback <strong>Control</strong> SystemsSee e.g., Freudenberg <strong>and</strong> Looze (1988), Doyle <strong>and</strong> Ste<strong>in</strong> (1981), Maciejowski (1991)<strong>and</strong> Kwakernaak (1995).A different approach to the concept of ga<strong>in</strong>s for multivariable systems is found<strong>in</strong> Postlethwaite <strong>and</strong> MacFarlane (1979), where emphasis is placed on the (frequencydependent) eigenvalues of the transfer matrix.§2.3 is ma<strong>in</strong>ly based on Bode (1945).Example 2.3.2 is taken from Åström (1996), where additional limitations on theachievable phase marg<strong>in</strong> are also discussed.


3SISO <strong>Control</strong>In this chapter we present results on performance limitations <strong>in</strong> l<strong>in</strong>ears<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output control systems. These results are the cornerstonesof classical design. We focus on the sensitivity <strong>and</strong> complementarysensitivity functions, S <strong>and</strong> T. Although this chapter is based on contributionsby many authors, ma<strong>in</strong>ly dur<strong>in</strong>g the 80’s, they may be seen withjustice as direct descendants of the many ideas conta<strong>in</strong>ed <strong>in</strong> Bode (1945).3.1 Bode Integral FormulaeBode’s book (Bode, 1945) ma<strong>in</strong>ly dealt with analysis <strong>and</strong> design of feedbackamplifiers, but it settled the mathematical foundation for what isnow known as classical control theory. Bode’s methods had a deep <strong>and</strong>last<strong>in</strong>g impact <strong>in</strong> systems science, <strong>and</strong> many of his results still have repercussions.From his analysis, for example, it can be concluded (see §3.1.2)that, if the open-loop system is a stable rational function with relative degreeat least two, then, provided the closed-loop system is stable, the sensitivityfunction must satisfy the follow<strong>in</strong>g <strong>in</strong>tegral relation:∫ ∞log |S(jω)|dω = 0 . (3.1)0This <strong>in</strong>tegral establishes that the net area subtended by the plot of |S(jω)|on a logarithmic scale is zero. Hence, the contribution to this area of thosefrequency ranges where there is sensitivity amplification (|S(jω)| > 1)


48 3. SISO <strong>Control</strong>must equal that of those ranges where there is sensitivity attenuation (|S(jω)| 1|S| < 1ωFIGURE 3.1. Area balance of the sensitivity <strong>in</strong>tegral.Due to the aforementioned balance of areas, the relation (3.1) is sometimescalled the area formula. As a matter of fact, (3.1) is a control system<strong>in</strong>terpretation due to Horowitz (1963) of the orig<strong>in</strong>al results of Bode(1945). These results dealt with the application of Cauchy’s <strong>in</strong>tegral theorem(Theorem A.5.3 <strong>in</strong> Appendix A) to the real <strong>and</strong> imag<strong>in</strong>ary parts ofanalytic functions (some of them have already been discussed <strong>in</strong> §2.3.2 ofChapter 2). In the follow<strong>in</strong>g subsection we review the orig<strong>in</strong>al formula ofBode that <strong>in</strong>spired (3.1).3.1.1 Bode’s Attenuation Integral TheoremThe theorem that we present <strong>in</strong> this section further develops the relationshipbetween real <strong>and</strong> imag<strong>in</strong>ary components of analytic functions. Morespecifically, it provides an expression for the behavior of the imag<strong>in</strong>arypart at high frequencies, <strong>in</strong> terms of the <strong>in</strong>tegral of the real part, of a functionH that satisfies the follow<strong>in</strong>g conditions:(i) H(jω) = P(ω) + jQ(ω) = H(−jω), where P <strong>and</strong> Q are real-valuedfunctions of ω;(ii) H(s) is analytic at s = ∞ <strong>and</strong> <strong>in</strong> the CRHP except possibly for s<strong>in</strong>gularitiess 0 = jω 0 on the f<strong>in</strong>ite imag<strong>in</strong>ary axis that satisfy lim s→s0 (s −s 0 )H(s) = 0.


3.1 Bode Integral Formulae 49Bode called the real <strong>and</strong> imag<strong>in</strong>ary parts, P <strong>and</strong> Q, attenuation <strong>and</strong> phase,respectively, s<strong>in</strong>ce he studied functions of the form H = log µ, where µrepresented characteristics of an electrical network. This suggested thename of the theorem.Before present<strong>in</strong>g Bode’s Attenuation Integral Theorem, we discuss adifferent <strong>in</strong>terpretation of Theorem 2.3.3 <strong>in</strong> Chapter 2 that serves as motivation.Recall that Theorem 2.3.3 1 gave an expression of the imag<strong>in</strong>arypart of H at any frequency ω 0 <strong>in</strong> terms of the <strong>in</strong>tegral of the completefrequency response of the real part of H. In the proof of Theorem 2.3.3,the function H was manipulated <strong>in</strong> order to create poles at s = jω 0 <strong>and</strong>s = −jω 0 <strong>in</strong> the <strong>in</strong>tegr<strong>and</strong> of (2.27). This led to the result that the <strong>in</strong>tegr<strong>and</strong>had residues of value jQ(ω 0 ) at each of these poles, <strong>and</strong> thus thereal-imag<strong>in</strong>ary parts relation (2.26) can be alternatively <strong>in</strong>terpreted as anexpression for the residue at a f<strong>in</strong>ite frequency. There is no reason why thesame procedure cannot be applied to evaluate the residue at <strong>in</strong>f<strong>in</strong>ity of H,denoted by Res s=∞ H(s) (see §A.9.1 <strong>in</strong> Chapter A). This is the essentialformat of Bode’s Attenuation Integral Theorem.Theorem 3.1.1 (Bode’s Attenuation Integral Theorem). Let H be a functionsatisfy<strong>in</strong>g conditions (i) <strong>and</strong> (ii). Then, for H(jω) = P(ω) + jQ(ω),∫ ∞0[P(ω) − P(∞)] dω = − π 2 Res H(s) . (3.2)s=∞Proof. S<strong>in</strong>ce H is analytic at <strong>in</strong>f<strong>in</strong>ity, it follows from Example A.8.4 of AppendixA that it has a Laurent series expansion of the formH(s) = · · · + c −ks k+ · · · + c −1s + c 0 , (3.3)which is uniformly convergent <strong>in</strong> R ≤ |s| < ∞, for some R > 0. In viewof the assumption (i) on H, it follows that the coefficients {c k } are real <strong>and</strong>that c 0 = P(∞).Consider next the contour of <strong>in</strong>tegration shown <strong>in</strong> Figure 2.3 of Chapter2, where the small <strong>in</strong>dentations correspond to possible s<strong>in</strong>gularities ofH on the imag<strong>in</strong>ary axis, <strong>and</strong> where the large semicircle has radius largerthan R. Denote this semicircle by C R . Then, accord<strong>in</strong>g to Cauchy’s <strong>in</strong>tegraltheorem∮[H(s) − P(∞)] ds = 0 . (3.4)The contribution of the <strong>in</strong>tegrals on the small <strong>in</strong>dentations on the imag<strong>in</strong>aryaxis can be shown (see the proof of Lemma A.6.2 <strong>in</strong> Appendix A)to tend to zero as the <strong>in</strong>dentations vanish, due to the particular type of1 Theorem 2.3.3 was proven for proper <strong>and</strong> stable transfer functions, but the result alsoholds for any function H satisfy<strong>in</strong>g conditions (i) <strong>and</strong> (ii).


50 3. SISO <strong>Control</strong>s<strong>in</strong>gularities characterized <strong>in</strong> (ii). The <strong>in</strong>tegral on the large semicircle ofthe terms <strong>in</strong> s k , k ≤ 2, of (3.3) tends to zero as the radius becomes <strong>in</strong>f<strong>in</strong>ite(see Example A.4.1 <strong>in</strong> Appendix A). Hence, tak<strong>in</strong>g limits as R → ∞, (3.4)reduces to∫ ∞[P(ω) + jQ(ω) − P(∞)] jdω + lim−∞R→∞∫ ∞= 2j [P(ω) − P(∞)] dω − jπc −1 ,00 =where the second l<strong>in</strong>e is obta<strong>in</strong>ed us<strong>in</strong>g the symmetry properties of P <strong>and</strong>Q, <strong>and</strong> Example A.5.1 of Appendix A. Equation (3.2) then follows us<strong>in</strong>gRes s=∞ H(s) = −c −1 (see (A.80) <strong>in</strong> Appendix A).□Note that, from (3.3), Q(ω) = Im H(jω) = −c −1 /ω + c −3 /ω 3 + · · · .Hence (3.2) can also be written as∫C Rc −1sds∫ ∞0[P(ω) − P(∞)] dω = − π 2lim ω Q(ω) .ω→∞This gives an expression for the behavior at <strong>in</strong>f<strong>in</strong>ity of the imag<strong>in</strong>ary componentof H.A formula parallel to (3.2) but applicable at zero frequency is easily obta<strong>in</strong>edfrom the above result us<strong>in</strong>g the fact that H is analytic at zero. Thisis established <strong>in</strong> the follow<strong>in</strong>g corollary.Corollary 3.1.2. Let H be a function such that H(1/ξ) satisfies conditions(i) <strong>and</strong> (ii). Then∫ ∞0[P(ω) − P(0)] dωω 2 = π 2 lim dH(s). (3.5)s→0 dsProof. S<strong>in</strong>ce H is analytic at zero it has a power series expansion of theformH(s) = a 0 + a 1 s + · · · + a k s k + · · · , (3.6)which is uniformly convergent <strong>in</strong> |s| ≤ r, for some r > 0. Consider thefunction H(1/ξ), which is analytic at <strong>in</strong>f<strong>in</strong>ity <strong>and</strong> admits — from (3.6) — aLaurent series expansion of the formH(1/ξ) = · · · + a kξ k + · · · + a 1ξ + a 0 ,which is uniformly convergent <strong>in</strong> |ξ| ≥ 1/r. Then, us<strong>in</strong>g (3.2), we have∫ ∞[P(1/ν) − P(0)] dν = π02 a 1 .The proof is completed on mak<strong>in</strong>g the change of variable of <strong>in</strong>tegrationω = 1/ν <strong>and</strong> not<strong>in</strong>g that a 1 = lim s→0 [dH(s)/ds].□


3.1 Bode Integral Formulae 51The theorem <strong>and</strong> corollary given above, as well as Theorem 2.3.3 <strong>in</strong>Chapter 2, give examples of the great variety of formulae that can beobta<strong>in</strong>ed via a judicious choice of contour of <strong>in</strong>tegration <strong>and</strong> <strong>in</strong>tegr<strong>and</strong><strong>in</strong> Cauchy’s theorems. Bode collected a number of such formulae <strong>in</strong> hisbook; see, for example, the table at the end of Chapter 13 of Bode (1945).Our choice of the relations presented so far is justified by their immediateapplication <strong>in</strong> systems theory. Otherwise, to quote Bode,“. . . it is extremely difficult to organize all the possible relations<strong>in</strong> any very coherent way. In a purely mathematical sense mostof the formulae are related to one another by such obvioustransformations <strong>and</strong> changes of variable that there is no goodreason for pick<strong>in</strong>g out any particular set as <strong>in</strong>dependent. Basicallythey are merely reflections of Cauchy’s theorem. Thus theexpressions which one chooses to regard as dist<strong>in</strong>ctive must beselected for their physical mean<strong>in</strong>g for the particular problem<strong>in</strong> h<strong>and</strong>.”3.1.2 Bode Integrals for S <strong>and</strong> TThis subsection starts this book’s ma<strong>in</strong> journey through results concern<strong>in</strong>gcomplementary mapp<strong>in</strong>gs, by which we mean mapp<strong>in</strong>gs relat<strong>in</strong>g signals ofthe loop <strong>and</strong> such that their sum is a constant mapp<strong>in</strong>g. We consider herethe sensitivity function, S, <strong>and</strong> the complementary sensitivity function,T, of the classical unity feedback configuration of l<strong>in</strong>ear feedback controltheory, represented <strong>in</strong> Figure 3.2.r + ey❜ ✲ ✐ ✲ L(s) ✲ ❜− ✻FIGURE 3.2. Feedback control system.These two mapp<strong>in</strong>gs satisfy the complementarity constra<strong>in</strong>tS(s) + T(s) = 1 . (3.7)This represents an algebraic trade-off (Freudenberg <strong>and</strong> Looze, 1985), s<strong>in</strong>ceit constra<strong>in</strong>s the properties of the closed-loop system at each frequency.It implies, for example, that the magnitudes of S <strong>and</strong> T cannot both besmaller than 1/2 at the same frequency (Doyle et al., 1992).Assum<strong>in</strong>g m<strong>in</strong>imality, S has zeros at the unstable open-loop plant poles<strong>and</strong> T has zeros at the nonm<strong>in</strong>imum phase open-loop plant zeros. Theseare generally known as <strong>in</strong>terpolation constra<strong>in</strong>ts. More precisely, let the open-


52 3. SISO <strong>Control</strong>loop system be L <strong>and</strong> assume that it is free of unstable hidden modes 2 .Then, the sensitivity <strong>and</strong> complementary sensitivity functions have theforms1L(s)S(s) = , <strong>and</strong> T(s) =1 + L(s) 1 + L(s) . (3.8)The follow<strong>in</strong>g lemma formalizes the <strong>in</strong>terpolation constra<strong>in</strong>ts that S <strong>and</strong>T must satisfy at the CRHP poles <strong>and</strong> zeros of L.Lemma 3.1.3 (Interpolation Constra<strong>in</strong>ts). Assume that the open-loop systemL is free of unstable hidden modes. Then S <strong>and</strong> T must satisfy thefollow<strong>in</strong>g conditions.(i) If p ∈ ¢ +is a pole of L, then(ii) If q ∈ ¢ +is a zero of L, thenS(p) = 0, <strong>and</strong> T(p) = 1. (3.9)S(q) = 1, <strong>and</strong> T(q) = 0. (3.10)Proof. Follows immediately from (3.8).□This result states that the CRHP zeros of S <strong>and</strong> T are determ<strong>in</strong>ed bythose of L −1 <strong>and</strong> L. We <strong>in</strong>troduce for convenience the follow<strong>in</strong>g notationfor the ORHP zeros of S <strong>and</strong> T. 3Z S {s ∈ ¢ + : S(s) = 0} ,Z T {s ∈ ¢ + : T(s) = 0} .(3.11)Lemma 3.1.3 then establishes the fact that Z S is the set of ORHP poles of L<strong>and</strong> Z T is the set of ORHP zeros of L. In other words, we have translatedthe open-loop characteristics of <strong>in</strong>stability <strong>and</strong> “nonm<strong>in</strong>imum phaseness”<strong>in</strong>to properties that the functions S <strong>and</strong> T must satisfy <strong>in</strong> the CRHP. In particular,note that if the open-loop system is formed by the cascade of theplant, G, <strong>and</strong> the controller, K, i.e., L = GK, then the constra<strong>in</strong>ts imposedby CRHP poles <strong>and</strong> zeros of G hold irrespective of the choice of K, providedthat L has no unstable zero-pole cancelation.As a first extension of Bode’s results to <strong>in</strong>tegrals on sensitivity functions,we will derive Horowitz’s formula (3.1) <strong>and</strong> a complementary result for Tus<strong>in</strong>g Theorem 3.1.1 <strong>and</strong> Corollary 3.1.2.2 Recall from Chapter 2 that L is free of unstable hidden modes if there are no cancelationsof CRHP zeros <strong>and</strong> poles between the plant <strong>and</strong> controller whose cascade connectionforms L.3 In the sequel, when def<strong>in</strong><strong>in</strong>g a set of zeros of a transfer function, the zeros are repeatedaccord<strong>in</strong>g to their multiplicities.


3.1 Bode Integral Formulae 53Example 3.1.1 (Horowitz’s formula for S). Suppose that L is a proper rationalfunction without poles <strong>in</strong> the ORHP. Assume that the closed-loop systemis stable (i.e., the numerator of 1 + L is Hurwitz <strong>and</strong> L(∞) ≠ −1) <strong>and</strong> considerthe correspond<strong>in</strong>g sensitivity function S. It follows that Z S <strong>in</strong> (3.11)is empty <strong>and</strong> that the function H = log S satisfies assumptions (i) <strong>and</strong> (ii)of §3.1.1. Thus, we can apply Theorem 3.1.1 to the real part, log |S(jω)|, toobta<strong>in</strong>∫ ∞0[log |S(jω)| − log |S(j∞)|] dω = − π 2 Res log S(s) .s=∞If we further assume that the open-loop plant has relative degree two ormore, then 4 S(j∞) = 1 <strong>and</strong> Res s=∞ log S(s) = 0. This recovers Horowitz’sarea formula given <strong>in</strong> (3.1).◦Example 3.1.2 (Horowitz’s formula for T). Assume that L(s) is a m<strong>in</strong>imumphase rational function such that L(0) ≠ 0. Then, if we consider the complementarysensitivity function T, we have that Z T <strong>in</strong> (3.11) is empty. Next,notice that the function T(1/ξ) can be written asT(1/ξ) =11 + 1/L(1/ξ) . (3.12)Then, under the assumption of closed-loop stability (which implies L(0) ≠−1), the function H(1/ξ) = log T(1/ξ) satisfies conditions (i) <strong>and</strong> (ii) of§3.1.1. We thus obta<strong>in</strong>, from Corollary 3.1.2,∫ ∞0[log |T(jω)| − log |T(0)|] dωω 2 = π 12 T(0) lim dT(s).s→0 dsIf we further assume that L has at least two <strong>in</strong>tegrators then T(0) = 1 <strong>and</strong>lim s→0 dT(s)/ds = 0, which yields∫ ∞0log |T(jω)| dωω 2 = 0 .It is <strong>in</strong>structive at this po<strong>in</strong>t to reflect on the complementarity of theseformulae for S <strong>and</strong> T, as well as the hypotheses required on the open-loopsystem L to derive them. Under appropriate conditions, S <strong>and</strong> T exhibitsymmetry with respect to frequency <strong>in</strong>version (Kwakernaak, 1995). Forexample, the condition that L be proper <strong>and</strong> L(j∞) ≠ −1 (or alternatively,that L be strictly proper) imply the analyticity of S <strong>and</strong> log S at <strong>in</strong>f<strong>in</strong>ity. Onthe other h<strong>and</strong>, the requirement that L(0) ≠ {0, −1} (or that L has poles atzero) gives the analyticity of T <strong>and</strong> log T at zero.◦4 See Example A.9.2 <strong>in</strong> Appendix A.


54 3. SISO <strong>Control</strong>The extensions to unstable open-loop plants <strong>and</strong> to plants with time delaywere derived by Freudenberg <strong>and</strong> Looze 1985; 1987. The complementaryresult for T was obta<strong>in</strong>ed by Middleton <strong>and</strong> Goodw<strong>in</strong> (1990). Thesetwo area formulae will be given next under the name of the Bode Integralsfor S <strong>and</strong> T. We choose these names s<strong>in</strong>ce the results are natural extensionsof Bode’s <strong>in</strong>tegral (3.2) to the sensitivity <strong>and</strong> complementary sensitivityfunctions.Theorem 3.1.4 (Bode Integral for S). Let S be the sensitivity function def<strong>in</strong>edby (3.8). Let {p i : i = 1, . . . , n p } be the set of poles <strong>in</strong> the ORHP of theopen-loop system L. Then, assum<strong>in</strong>g closed-loop stability,(i) if L is a proper rational function,∫ ∞0logS(jω)∣S(j∞)∣ dω = π 2 lims→∞s[S(s) − S(∞)]S(∞)n p∑+ π p i ; (3.13)(ii) if L(s) = L 0 (s)e −sτ , where L 0 (s) is a strictly proper rational function<strong>and</strong> τ > 0,n∑ plog |S(jω)| dω = π p i . (3.14)∫ ∞0Proof. Consider the contour of Figure 3.3, where the <strong>in</strong>dentations C 1 , C 2 ,. . . , C np <strong>in</strong>to the right half plane avoid the branch cuts of log S correspond<strong>in</strong>gto the zeros of S (see Figure A.19 <strong>in</strong> Appendix A). Let C 0 consistof the rema<strong>in</strong><strong>in</strong>g portions of the imag<strong>in</strong>ary axis <strong>and</strong> let C R be the semicircleof radius R.jωi=1i=1C 0C 2RC 3C 1C RσFIGURE 3.3. Contour for Bode Sensitivity Integral.S<strong>in</strong>ce log[S(s)/S(∞)] is analytic on <strong>and</strong> <strong>in</strong>side the total contour, denotedby C, then, by Cauchy’s <strong>in</strong>tegral theorem,∫log S(s)S(∞) ds = 0 .limɛ→0R→∞C


3.1 Bode Integral Formulae 55Note that the computation of the portions of the <strong>in</strong>tegral on the imag<strong>in</strong>aryaxis <strong>and</strong> on the <strong>in</strong>dentations is the same for both cases (i) <strong>and</strong> (ii) (the onlydist<strong>in</strong>ction be<strong>in</strong>g that <strong>in</strong> (ii) we have that S(∞) = 1). However, the <strong>in</strong>tegralon the large semicircle requires a different analysis <strong>in</strong> each case.The portion of the <strong>in</strong>tegral on C 0 gives 5∫lim log S(s) ∫ ∞ɛ→0R→∞C 0S(∞) ds = 2j 0logS(jω)∣S(j∞)∣ dω .For the <strong>in</strong>tegral on the branch cut <strong>in</strong>dentations we have, us<strong>in</strong>g (A.85) ofAppendix A,n∑ p ∫log S(s)n pnC iS(∞) ds = −j2π ∑∑ pRe p i = −j2π p i ,i=1where the last equality follows s<strong>in</strong>ce complex zeros must appear <strong>in</strong> conjugatepairs.It rema<strong>in</strong>s to compute the <strong>in</strong>tegral on C R for each of the cases (i) <strong>and</strong> (ii).(i) If L is a proper rational function, the assumption of closed-loop stabilityguarantees that the function log S(s) is analytic at <strong>in</strong>f<strong>in</strong>ity. Wecan then use Example A.9.2 of Appendix A to compute the <strong>in</strong>tegralon C R <strong>in</strong> the limit when R → ∞ aslimR→∞∫C Rlog S(s)S(∞)i=1S(s)ds = jπ Res logs=∞ S(∞)where we have used (A.84) from Appendix A.i=1= jπ 1S(∞) lims→∞ s[S(∞) − S(s)] , (3.15)(ii) If L = L 0 e −sτ , with τ > 0 <strong>and</strong> L 0 strictly proper, we have that thereis an r > 0 such that, for all s with Re s ≥ 0 <strong>and</strong> |s| > r, the modulusof L(s) satisfies |L(s)| < 1. Then, a similar use of the expansion oflog(1 + s) as <strong>in</strong> Example A.9.2 of Appendix A yieldslog S(s) = −L 0 (s)e −sτ + L2 0 (s)e−2sτ2+ · · · ,<strong>in</strong> {s : |s| > r <strong>and</strong> Re s ≥ 0}. S<strong>in</strong>ce L 0 has relative degree at least one,then the above expansion has the formlog S(s) = c −1s e−sτ + · · · , <strong>in</strong> {s : |s| > r <strong>and</strong> Re s ≥ 0} .5 Under the convention that the phase of a negative real number is equal to π for ω ≥ 0<strong>and</strong> −π for ω < 0.


56 3. SISO <strong>Control</strong>Then, us<strong>in</strong>g the result <strong>in</strong> Example A.4.3 of Appendix A, we f<strong>in</strong>allyhave that∫lim log S(s) ds = 0 .R→∞ C RThe result then follows by comb<strong>in</strong><strong>in</strong>g the <strong>in</strong>tegrals over all portions ofthe total contour for each of the cases (i) <strong>and</strong> (ii).□Theorem 3.1.4 shows that the presence of open-loop unstable poles furtherrestricts the compromise between areas of sensitivity attenuation <strong>and</strong>amplification imposed by the <strong>in</strong>tegral. Specifically, s<strong>in</strong>ce the term ∑ n pi=1 p iis nonnegative, the contribution to the <strong>in</strong>tegral of those frequencies where|S(jω)| < 1 is clearly reduced if L has unstable poles. Moreover, the fartherfrom the jω-axis the poles are, the worse will be their effect.The first term on the RHS of (3.13) appears only if the plant has relativedegree 0 or 1 <strong>and</strong> no time delay, <strong>and</strong> admits an <strong>in</strong>terpretation <strong>in</strong> terms ofthe time response of L. Indeed, if l(t) denotes the response of the plant toa unitary step <strong>in</strong>put, we can alternatively write[ ]s[S(s) − S(∞)] L(∞) − L(s)lim= lim ss→∞ S(∞) s→∞ 1 + L(s)= − ˙l(0 + )1 + L(∞) ,where the last step follows from the Initial Value Theorem of the Laplacetransform (e.g., Frankl<strong>in</strong> et al., 1994). Thus, ˙l(0 + ) is the <strong>in</strong>itial slope of l(t),as illustrated <strong>in</strong> Figure 3.4.l(t)L(0)L(∞)θ = arctan ˙l(0 + )FIGURE 3.4. Step response of a plant of relative degree 0.tIf the plant has relative degree 0 or 1, a large positive value for ˙l(0 + ) willameliorate the <strong>in</strong>tegral constra<strong>in</strong>t for S. This, as we will see <strong>in</strong> more detail<strong>in</strong> §3.1.3, is justified by the fact that a larger value for ˙l(0 + ) is associatedwith larger b<strong>and</strong>width of the system.


3.1 Bode Integral Formulae 57If the plant has relative degree greater than 1, then ˙l(0 + ) vanishes, <strong>and</strong>(3.13) reduces to (3.14). We will see <strong>in</strong> §3.1.3, however, that larger b<strong>and</strong>widthstill alleviates the trade-offs <strong>in</strong>duced by this <strong>in</strong>tegral constra<strong>in</strong>t.Naturally, an equivalent relation holds for the complementary sensitivityfunction, as stated <strong>in</strong> the follow<strong>in</strong>g theorem.Theorem 3.1.5 (Bode Integral for T). Let T be the complementary sensitivityfunction def<strong>in</strong>ed by (3.8). Let {q i : i = 1, . . . , n q } be the set of zeros<strong>in</strong> the ORHP of the open-loop system L, <strong>and</strong> suppose that L(0) ≠ 0. Then,assum<strong>in</strong>g closed-loop stability,(i) if L is a proper rational function∫ ∞0logT(jω)∣ T(0) ∣dωω 2 = π 21T(0) lim dT(s)s→0 dsn q∑+ πi=11q i; (3.16)(ii) if L(s) = L 0 (s)e −sτ , where L 0 (s) is a strictly proper rational function<strong>and</strong> τ > 0,∫ ∞0logT(jω)∣ T(0) ∣dωω 2 = π 21T(0) lim dT(s)s→0 dsn q∑+ πi=11q i+ π 2 τ . (3.17)Proof. The proof follows along the same l<strong>in</strong>es as that of Theorem 3.1.4 with<strong>in</strong>verted symmetry <strong>in</strong> the roles played by the po<strong>in</strong>ts s = 0 <strong>and</strong> s = ∞. The<strong>in</strong>terested reader may f<strong>in</strong>d the details <strong>in</strong> Middleton (1991).□It is <strong>in</strong>terest<strong>in</strong>g to note that the <strong>in</strong>tegral for T is <strong>in</strong> fact the same as thatfor S under frequency <strong>in</strong>version. Indeed, by lett<strong>in</strong>g ν = 1/ω, we can alternativelyexpress∫ ∞0logT(jω)∣ T(0)∣dωω 2 = ∫ ∞0logT(1/jν)∣ T(0)∣ dν .Accord<strong>in</strong>gly, as seen <strong>in</strong> (3.16) <strong>and</strong> (3.17), nonm<strong>in</strong>imum phase zeros of Lplay for T an entirely equivalent role to that of unstable poles for S; i.e.,they worsen the <strong>in</strong>tegral constra<strong>in</strong>t. In this case, zeros <strong>in</strong> the ORHP thatare closer to the jω-axis will pose a greater difficulty <strong>in</strong> shap<strong>in</strong>g T.The first term on the RHSs of (3.16) <strong>and</strong> (3.17) has an <strong>in</strong>terpretation <strong>in</strong>terms of steady-state properties of the plant. Consider the feedback loopshown <strong>in</strong> Figure 3.5. The Laplace transform of the error signal e = r − y isgiven by[E(s) = 1 − T(s) ]R(s) .T(0)Hence, if r is a unitary ramp, i.e., R(s) = 1/s 2 , then the correspond<strong>in</strong>gsteady-state value of e(t) (see Figure 3.6) can be computed us<strong>in</strong>g the F<strong>in</strong>al


58 3. SISO <strong>Control</strong>r❜ ✲1 +✲T(0)−+ e✲ ✐ ✲ ❜− ✻y✐ ✲ L(s) ✲ ❜✻FIGURE 3.5. Type-1 feedback system.Value Theorem (e.g., Frankl<strong>in</strong> et al., 1994) <strong>and</strong> L’Hospital’s rule (p. 260Widder, 1961, e.g., ) ase ss = limt→∞e(t)= lims→0sE(s)1 − T(s)T(0)= lims→0 s= − 1T(0) lim dT(s)s→0 dsThus the constant 1/T(0)dT/ds| s=0 plays a role similar to that played bythe reciprocal of the velocity constant <strong>in</strong> a Type-1 feedback system (e.g.,Truxal, 1955, p. 286). Consequently, the correspond<strong>in</strong>g term on the RHSsof (3.16) <strong>and</strong> (3.17) can ameliorate the severity of the design trade-off onlyif the steady-state error to a ramp <strong>in</strong>put is large <strong>and</strong> positive, so that theoutput lags the reference <strong>in</strong>put significantly..e(t)e ssFIGURE 3.6. Steady-state error to ramp.tF<strong>in</strong>ally, the last term on the RHS of (3.17) shows that the trade-off alsoworsens if the plant has a time delay. In a sense this is not surpris<strong>in</strong>g, s<strong>in</strong>ce


3.1 Bode Integral Formulae 59a time delay may be approximated by nonm<strong>in</strong>imum phase zeros 6 , whichalready appear <strong>in</strong> connection with the Bode <strong>in</strong>tegral for T.In the follow<strong>in</strong>g section we will discuss more detailed design implicationsof the <strong>in</strong>tegral constra<strong>in</strong>ts given by Theorem 3.1.4 <strong>and</strong> Theorem 3.1.5.3.1.3 Design InterpretationsAs discussed <strong>in</strong> §3.1, the <strong>in</strong>tegral constra<strong>in</strong>t (3.1) tells us that, if the feedbackloop is designed to hold |S(jω)| smaller than one over a given frequencyrange, then it will be larger than one <strong>in</strong> another range. In the caseof open-loop unstable plants, the constra<strong>in</strong>t (3.14) shows that the trade-offis worsened s<strong>in</strong>ce the <strong>in</strong>tegral must be positive <strong>in</strong> this case.Heuristically, this effect can be seen from the Nyquist plot of the openlooptransfer function, L, as follows. Consider first that L is a stable rationalfunction of relative degree two or more; then L(jω) will ultimately havephase lag of at least −π. This situation is depicted on the left of Figure 3.7for L(s) = 1.5/(1+s) 2 . Note that S −1 (jω) = 1+L(jω) is the vector from the−1 po<strong>in</strong>t to the po<strong>in</strong>t on the Nyquist plot correspond<strong>in</strong>g to the frequencyω. Thus, we see that there is a portion of the plot where |S(jω)| is lessthan one <strong>and</strong> another portion where |S(jω)| <strong>in</strong>creases above one. A similarsituation occurs for strictly proper plants with time delay, as shown onthe right of Figure 3.7 for L(s) = 1.5e −3s /(s + 2). In this case, however,the Nyquist plot alternates between areas of sensitivity attenuation <strong>and</strong>amplification <strong>in</strong>f<strong>in</strong>itely many times.jωjω|S(jω)| > 1|S(jω)| < 1|S(jω)| > 1|S(jω)| < 1−1σ−1σL(jω)L(jω)FIGURE 3.7. Graphical <strong>in</strong>terpretation of the area formula.The possibility of achiev<strong>in</strong>g a negative value of the <strong>in</strong>tegral <strong>in</strong> (3.13) forstable open-loop systems hav<strong>in</strong>g relative degree one can also be deducedfrom Figure 3.7. Indeed, these systems will exhibit an asymptotic phaselag of −π/2, <strong>and</strong> hence their Nyquist plot can be kept largely outside theregion of sensitivity <strong>in</strong>crease.6 This is evident <strong>in</strong> the Padé approximations of e −sτ (e.g., Middleton <strong>and</strong> Goodw<strong>in</strong>, 1990).


60 3. SISO <strong>Control</strong>Example 3.1.3. Let L = K/(s + 1), K > 0, i.e., a first order plant cascadedwith a proportional controller. ThenS(s) = s + 1s + 1 + K .The value of the <strong>in</strong>tegral <strong>in</strong> (3.13) is −K (which is m<strong>in</strong>us the <strong>in</strong>itial slopeof the step response of L). The Nyquist plot of L lies entirely <strong>in</strong> the forthquadrant, <strong>and</strong> therefore there is no area of sensitivity amplification. ◦Despite the area balance effect, one cannot immediately conclude fromeither (3.1) or (3.14) that a peak that is significantly greater than one willoccur outside the frequency range over which sensitivity reduction hasbeen achieved. Actually, it is possible that area of sensitivity reductionover some f<strong>in</strong>ite range of frequencies may be compensated by an areawhere |S| is allowed to be slightly greater than one over an arbitrarily largerange of frequencies.Any practical design, however, is affected by b<strong>and</strong>width constra<strong>in</strong>ts.Indeed, several factors such as undermodel<strong>in</strong>g, sensor noise, plant b<strong>and</strong>width,etc., lead to the desirability of decreas<strong>in</strong>g the open-loop ga<strong>in</strong> at highfrequencies, thus putt<strong>in</strong>g a limit on the b<strong>and</strong>width of the closed loop. It isreasonable to assume, then, that the open-loop ga<strong>in</strong> satisfies a design specificationof the type( ωc) 1+k|L(jω)| ≤ δ , ∀ω ≥ ωc , (3.18)ωwhere δ < 1/2 <strong>and</strong> k > 0. Note that k, ω c <strong>and</strong> δ can be adjusted to provideupper limits to the slope of magnitude roll-off, the frequency where rolloffstarts <strong>and</strong> the ga<strong>in</strong> at that frequency, respectively.By convention, the b<strong>and</strong>width of the closed-loop system <strong>in</strong> Figure 3.2 isdef<strong>in</strong>ed (e.g., Frankl<strong>in</strong> et al., 1994) as the frequency of a s<strong>in</strong>usoidal <strong>in</strong>put, r,at which the output y is attenuated to a factor of 1/ √ 2. From the def<strong>in</strong>itionof T <strong>in</strong> (3.8), the b<strong>and</strong>width ω b is the frequency at which |T(jω b )| = 1/ √ 2.It is possible to approximate ω b by a factor (generally between 1 <strong>and</strong> 2)of the crossover frequency, ω 0 , def<strong>in</strong>ed as the lowest frequency at which themagnitude of the open-loop system is one, i.e., |L(jω 0 )| = 1 (e.g., Frankl<strong>in</strong>et al., 1994). S<strong>in</strong>ce ω c <strong>in</strong> (3.18) is such that |L(jω c )| < 1/2, it follows thatω 0 < ω c <strong>and</strong> hence the b<strong>and</strong>width ω b can be roughly taken as ω b ≈ ω c .The follow<strong>in</strong>g result shows that a condition such as (3.18) on the openloopga<strong>in</strong> puts an upper limit on the area of sensitivity <strong>in</strong>crease that canbe present at frequencies greater than ω c .Corollary 3.1.6. Suppose that L is a rational function of relative degreetwo or more <strong>and</strong> satisfy<strong>in</strong>g the b<strong>and</strong>width restriction (3.18). Then∞ ∣∣∫log |S(jω)| dω∣ ≤ 3δω cω c2k . (3.19)


3.1 Bode Integral Formulae 61Proof. We use the fact that if |s| < 1/2, then | log(1 + s)| ≤ 3|s|/2 (SeeExample A.8.6 <strong>in</strong> Appendix A). Then∞ ∫ ∞∣∣∫log |S(jω)| dω∣ ≤ |log |S(jω)|| dωω c ω∫c∞≤ |log S(jω)| dωω c∫ ∞= |log[1 + L(jω)]| dωω c≤ 3δω1+k c2∫ ∞1dωω cω1+k = 3δω c2k . □The above result shows that the area of the tail of the Bode sensitivity <strong>in</strong>tegralover the <strong>in</strong>f<strong>in</strong>ite frequency range [ω c , ∞) is limited. Hence, any areaof sensitivity reduction must be compensated by a f<strong>in</strong>ite area of sensitivity<strong>in</strong>crease, which will necessarily lead to a large peak <strong>in</strong> the sensitivityfrequency response before ω c . To see this, suppose that S is required tosatisfy the follow<strong>in</strong>g reduction condition|S(jω)| ≤ α < 1 , ∀ω ≤ ω 1 < ω c . (3.20)Now, us<strong>in</strong>g (3.14) <strong>and</strong> the bounds (3.18) <strong>and</strong> (3.20), it is easy to show that⎡⎤1sup log |S(jω)| ≥ ⎣π ∑p + ω 1 log 1ω∈(ω 1 ,ω c)ω c − ω 1 α − 3δω c⎦ .2kp∈Z S(3.21)The bound (3.21) shows that any attempt to <strong>in</strong>crease the area of sensitivityreduction by requir<strong>in</strong>g α to be small <strong>and</strong>/or ω 1 to be close to ω c willnecessarily result <strong>in</strong> a large sensitivity peak <strong>in</strong> the range (ω 1 , ω c ). Hence,the Bode sensitivity <strong>in</strong>tegral (3.14) imposes a clear design trade-off whennatural b<strong>and</strong>width constra<strong>in</strong>ts are assumed for the closed-loop system.Notice, however, that this trade-off is alleviated if the closed-loop b<strong>and</strong>widthis large, i.e., large ω c <strong>in</strong> (3.18).Example 3.1.4. The <strong>in</strong>equality (3.21) can be used to derive a lower boundon the closed-loop b<strong>and</strong>width <strong>in</strong> terms of the sum of open-loop unstablepoles. Indeed, suppose that the frequency ω 1 <strong>in</strong> (3.20) is taken to be afraction of the b<strong>and</strong>width, sayω 1 = k 1 ω c ,


62 3. SISO <strong>Control</strong><strong>and</strong> assume that we desire that the peak sensitivity on the LHS of (3.21)be less than or equal to a number S m > 1. Then, necessarily, the lowerbound on the RHS of (3.21) must be less than or equal to S m . Impos<strong>in</strong>gthis condition yields the follow<strong>in</strong>g lower bound on the b<strong>and</strong>width, whichwe take as ω b = ω c ,ω b ≥ B(S m ) ∑p∈Z Sp , (3.22)where B(S m ) isB(S m ) π(1 − k 1 )(S m + 1.5 δ/k + k 1 log α) .The factor B(S m ) is plotted <strong>in</strong> Figure 3.8 as a function of the desired peaksensitivity S m , for δ = 0.45, k = 1, k 1 = 0.7 <strong>and</strong> α = 0.5.98B(S m )76541 1.2 1.4 1.6 1.8 2S mFIGURE 3.8. Lower bound on the b<strong>and</strong>width as a function of the peak sensitivity.We may conclude from this figure, for example, that an open-loop unstablesystem hav<strong>in</strong>g relative degree two requires a b<strong>and</strong>width of at least6.5 times the sum of its ORHP poles if it is desired that |S| be smallerthan 1/2 over 70% of the closed-loop b<strong>and</strong>width while keep<strong>in</strong>g the lowerbound on the peak sensitivity smaller than S m = √ 2.◦3.2 The Water-Bed EffectThe Bode sensitivity <strong>and</strong> complementary sensitivity <strong>in</strong>tegrals representa constra<strong>in</strong>t imposed by nonm<strong>in</strong>imum phase zeros <strong>and</strong> unstable polesof the open-loop system. We have just seen <strong>in</strong> §3.1.3 that these <strong>in</strong>tegralconstra<strong>in</strong>ts, <strong>in</strong> conjunction with a further requirement on the closed-loopb<strong>and</strong>width, lead to a quantifiable trade-off <strong>in</strong> design. Indeed, from thecomment at the end of §3.1.3, we see that an arbitrary reduction of sensitivityover a range of frequencies necessarily implies an arbitrarily large


3.2 The Water-Bed Effect 63<strong>in</strong>crease at other frequencies. This k<strong>in</strong>d of phenomenon has been referredto as a push-pop or water-bed effect (e.g., Shamma, 1991; Doyle et al., 1992).The water-bed effect was first recognized by Francis <strong>and</strong> Zames (1984),who showed that a similar trade-off exists with respect to sensitivity m<strong>in</strong>imizationover a frequency <strong>in</strong>terval when the open-loop system has a zero<strong>in</strong> the ORHP. This will be established below. As opposed to the previousresults, which were based on the Cauchy <strong>in</strong>tegral theorem, the result thatfollows relies on an application of the maximum modulus pr<strong>in</strong>ciple (TheoremA.10.2 <strong>in</strong> Appendix A).Theorem 3.2.1 (Water-Bed Effect). Let S be the sensitivity function def<strong>in</strong>edby (3.8). Assume that S is proper <strong>and</strong> stable. Then, if the open-loopplant L has a zero <strong>in</strong> the ORHP, there exists a positive number m such thatsup |S(jω)| ≥ 1/‖S‖ m ∞ ,ω∈[ω 1 ,ω 2 ]whereis the <strong>in</strong>f<strong>in</strong>ity norm of S.‖S‖ ∞ = sup |S(jω)|ωProof. Let q be a zero of L <strong>in</strong> the ORHP. It follows from (3.7) that S(q) = 1.Consider the mapp<strong>in</strong>g from the CRHP onto the unit disk given byz = q − sq + s ,s = q − qz1 + z .Then the <strong>in</strong>terval {s = jω : ω ∈ [ω 1 , ω 2 ]} is mapped onto the arcLet R be the follow<strong>in</strong>g function,supθ∈[θ 1 ,θ 2 ]{z = e jθ : θ ∈ [θ 1 , θ 2 ]} . (3.23)R(z) S( ) q − qz.1 + zIt follows that R is analytic <strong>in</strong> |z| < 1, R(0) = 1, <strong>and</strong> moreover(∣ R ejθ )∣ ∣ =<strong>and</strong>sup |S(jω)| ,ω∈[ω 1 ,ω 2 ]∣sup ∣R ( e jθ)∣ ∣ = ‖S‖ ∞ .θNow let φ be the angle φ = |θ 2 − θ 1 | <strong>and</strong> let n be any <strong>in</strong>teger greaterthan 2π/φ. Def<strong>in</strong>e the auxiliary function P asP(z) n−1∏k=0(R z e j2πk/n) . (3.24)


64 3. SISO <strong>Control</strong>Then P is also analytic <strong>in</strong> |z| < 1, <strong>and</strong> P(0) = 1. S<strong>in</strong>ce the angle 2π/n is lessthan φ, at least one of the po<strong>in</strong>ts z e j2πk/n , k = 0, · · · , n − 1, lies on thearc (3.23) for each z on the unit circle. Thus, from (3.24) <strong>and</strong> the maximummodulus pr<strong>in</strong>ciple1 = |P(0)|(≤ sup ∣ P ejθ )∣ ∣[≤θsupθ= ‖S‖ n−1∞(∣ R ejθ )∣ n−1∣]supθ∈[θ 1 ,θ 2 ]supω∈[ω 1 ,ω 2 ]|S(jω)| .∣ R(ejθ )∣ ∣The result then follows by assign<strong>in</strong>g m = n − 1.□This result shows the water-bed effect for l<strong>in</strong>ear nonm<strong>in</strong>imum phasesystems: the magnitude of S can be made arbitrarily small over a frequency<strong>in</strong>terval only at the expense of hav<strong>in</strong>g the magnitude arbitrarilylarge off this <strong>in</strong>terval (Francis <strong>and</strong> Zames, 1984). We will next obta<strong>in</strong> alternativeexpressions for the sensitivity trade-offs us<strong>in</strong>g Poisson <strong>in</strong>tegralformula. In section §3.3.2, these results will be used to give a more explicitform for the water-bed trade-off.3.3 Poisson Integral FormulaeConsider aga<strong>in</strong> the feedback control configuration of Figure 3.2 <strong>and</strong> thesensitivity <strong>and</strong> complementary sensitivity functions def<strong>in</strong>ed <strong>in</strong> (3.8). In§3.1.2 we discussed the algebraic trade-off imposed by the complementarityconstra<strong>in</strong>t (3.7), which, for example, precludes the possibility of hav<strong>in</strong>gboth small |S(jω)| <strong>and</strong> |T(jω)| at the same frequency. We will see <strong>in</strong> this sectionthat it is particularly reveal<strong>in</strong>g to evaluate this algebraic constra<strong>in</strong>t atthe nonm<strong>in</strong>imum phase zeros <strong>and</strong> unstable poles of the open-loop transferfunction L. Indeed, with the aid of the Poisson <strong>in</strong>tegral formula for thehalf plane (see §A.6.1 <strong>in</strong> Appendix A), we will show that this algebraicconstra<strong>in</strong>t actually leads to <strong>in</strong>tegral constra<strong>in</strong>ts on the overall frequencyresponses of S <strong>and</strong> T.3.3.1 Poisson Integrals for S <strong>and</strong> TBefore deriv<strong>in</strong>g the Poisson <strong>in</strong>tegral theorems for S <strong>and</strong> T, we need someprelim<strong>in</strong>ary notation. Recall that, if L is free of unstable hidden modes,the ORHP poles <strong>and</strong> zeros of L become zeros of S <strong>and</strong> T, as stated <strong>in</strong>Lemma 3.1.3. Let q i , i = 1, . . . , n q , be the zeros of L <strong>in</strong> the ORHP, <strong>and</strong>


3.3 Poisson Integral Formulae 65p i , i = 1, . . . , n p , be the poles of L <strong>in</strong> the ORHP, all counted with multiplicities.Follow<strong>in</strong>g the notation <strong>in</strong>troduced <strong>in</strong> (3.11), we then have thatthe ORHP zeros of S <strong>and</strong> T are given byZ S = {p i : i = 1, . . . , n p } ,Z T = {q i : i = 1, . . . , n q } .(3.25)We next <strong>in</strong>troduce the Blaschke products of the ORHP zeros of S <strong>and</strong> T,given by (see also (2.37) <strong>in</strong> Chapter 2)B S (s) =n p∏i=1np i − sp i + s , <strong>and</strong> B ∏ qT (s) =i=1q i − sq i + s . (3.26)Blaschke products are “all-pass” functions, s<strong>in</strong>ce their magnitude is constant<strong>and</strong> equal to one on the jω-axis. Us<strong>in</strong>g B S <strong>and</strong> B T , we factorize LasL(s) = ˜L(s)B −1S (s)B T (s)e −sτ , (3.27)where τ ≥ 0 is a possible time delay <strong>in</strong> the open-loop system, <strong>and</strong> wherethe factor ˜L(s) is a proper rational function hav<strong>in</strong>g no poles or zeros <strong>in</strong> theORHP. Then S <strong>and</strong> T <strong>in</strong> (3.8) can be factored asS(s) = ˜S(s)B S (s) ,T(s) = ˜T(s)B T (s)e −sτ .(3.28)Note that ˜S <strong>and</strong> ˜T have no zeros <strong>in</strong> the f<strong>in</strong>ite ORHP s<strong>in</strong>ce the factors B S<strong>and</strong> B T remove the f<strong>in</strong>ite zeros of S <strong>and</strong> T. Also, if the closed-loop systemis stable, both ˜S <strong>and</strong> ˜T have no poles <strong>in</strong> the f<strong>in</strong>ite CRHP. The po<strong>in</strong>ts = ∞ requires some more care. If τ = 0 <strong>and</strong> ˜L is proper, then both ˜S <strong>and</strong>˜T are analytic at <strong>in</strong>f<strong>in</strong>ity. On the other h<strong>and</strong>, if τ > 0 <strong>and</strong> ˜L is proper, S<strong>and</strong> T have an essential s<strong>in</strong>gularity at <strong>in</strong>f<strong>in</strong>ity (see Example A.8.3 <strong>in</strong> AppendixA). However, both functions log ˜S <strong>and</strong> log ˜T are of class R, i.e., theirgrowth at <strong>in</strong>f<strong>in</strong>ity <strong>in</strong> the CRHP is restricted. 7The above discussion suggests that we can apply the Poisson <strong>in</strong>tegralformula — more specifically Corollary A.6.3 <strong>in</strong> Appendix A — to the functionslog ˜S <strong>and</strong> log ˜T, as we show next.Theorem 3.3.1 (Poisson Integral for S). Let S be the sensitivity functiondef<strong>in</strong>ed by (3.8). Assume that the open-loop system L can be factored as <strong>in</strong>7 Recall that a function f is of class R if lim R→∞ R −1 sup θ∈[−π/2,π/2] |f(Re jθ )| = 0(§A.6.1 <strong>in</strong> Appendix A). S<strong>in</strong>ce L is proper <strong>and</strong> the closed-loop system is stable, it is not difficultto see that ˜S is bounded on the CRHP by a constant, S m say. Then log | ˜S| ≤ log S m onthe CRHP <strong>and</strong> hence log | ˜S|/R → 0 as R → ∞. A similar argument shows that ˜T is of classR.


66 3. SISO <strong>Control</strong>(3.27) <strong>and</strong> let q = σ q + jω q be an ORHP zero of L. Then, if the closed-loopsystem is stable,∫ ∞−∞σ qlog |S(jω)|σ 2 q + (ω q − ω) 2 dω = π log ∣ ∣B −1S (q)∣ ∣ . (3.29)Proof. We will apply Corollary A.6.3 of Appendix A to the function log ˜S.S<strong>in</strong>ce the closed-loop system is stable <strong>and</strong> <strong>in</strong> view of the factorization(3.28), then ˜S has no zeros or poles <strong>in</strong> the ORHP (see comments after (3.28))<strong>and</strong> hence log ˜S is analytic there. If τ > 0 then log ˜S has an essential s<strong>in</strong>gularityat <strong>in</strong>f<strong>in</strong>ity, but these are allowed by Corollary A.6.3. Then, s<strong>in</strong>celog ˜S is of class R (see footnote on page 65), (A.48) <strong>in</strong> Corollary A.6.3 canbe used with with f = log ˜S <strong>and</strong> s 0 = q. The constra<strong>in</strong>t (3.29) then followson not<strong>in</strong>g that | ˜S(jω)| = |S(jω)| (s<strong>in</strong>ce |B S (jω)| = 1, ∀ω) <strong>and</strong> that˜S(q) = B −1S(q) (s<strong>in</strong>ce S(q) = 1). □The correspond<strong>in</strong>g result for T is as follows.Theorem 3.3.2 (Poisson Integral for T). Let T be the complementary sensitivityfunction def<strong>in</strong>ed by (3.8). Assume that the open-loop system L canbe factored as <strong>in</strong> (3.27) <strong>and</strong> let p = σ p + jω p be an ORHP pole of L. Then,if the closed-loop system is stable,∫ ∞−∞σ plog |T(jω)|σ 2 p + (ω p − ω) 2 dω = π log ∣ ∣B −1T (p)∣ ∣ + πσ p τ . (3.30)Proof. We will apply Corollary A.6.3 of Appendix A to the function log ˜T.S<strong>in</strong>ce the closed-loop system is stable <strong>and</strong> <strong>in</strong> view of the factorization(3.28), ˜T has no zeros or poles <strong>in</strong> the ORHP (see comments after (3.28)).If ˜L is strictly proper, <strong>and</strong>/or if τ > 0, then log ˜T has a s<strong>in</strong>gularity at <strong>in</strong>f<strong>in</strong>ity,but these are allowed by Corollary A.6.3. Then, s<strong>in</strong>ce log ˜T is of class R(see footnote on page 65), (A.48) <strong>in</strong> Corollary A.6.3 can be used with withf = log ˜T <strong>and</strong> s 0 = p. The constra<strong>in</strong>t (3.29) then follows on not<strong>in</strong>g that| ˜T(jω)| = |T(jω)| (s<strong>in</strong>ce ∣ BS (jω)e −jωτ∣ ∣ = 1, ∀ω) <strong>and</strong> that ˜T(p) = B −1T(p)(s<strong>in</strong>ce T(p) = 1).□Note that, as was the case for Corollary A.6.3 <strong>in</strong> Appendix A, Theorems3.3.1 <strong>and</strong> 3.3.2 still hold when L has poles or zeros on the imag<strong>in</strong>ary axis.Also, if the ORHP zero q <strong>in</strong> Theorem 3.3.1 has multiplicity m > 1, additional<strong>in</strong>tegral constra<strong>in</strong>ts on the first m − 1 derivatives of log ˜S can be obta<strong>in</strong>ed.These, however, do not seem to be as <strong>in</strong>sightful as (3.29). A similarcomment is <strong>in</strong> order <strong>in</strong> the case of multiple ORHP poles <strong>in</strong> Theorem 3.3.2.Similar to the Bode <strong>in</strong>tegral formulae of §3.1.2, the Poisson <strong>in</strong>tegral constra<strong>in</strong>ts(3.29) <strong>and</strong> (3.30) represent a balance between areas of sensitivityor complementary sensitivity attenuation <strong>and</strong> amplification. To see thisnotice: first that, for s 0 = σ 0 + jω 0 , σ 0 > 0, the weight<strong>in</strong>g functionσ 0W s0 (ω) σ 2 0 + (ω (3.31)0 − ω) 2


3.3 Poisson Integral Formulae 67is positive for all ω, <strong>and</strong> second that the RHSs of both equations are nonnegatives<strong>in</strong>ce, for a Blaschke product B, we have that ∣ ∣B −1 (s) ∣ ≥ 1 for s<strong>in</strong> the ORHP.The area balance implied by the Poisson <strong>in</strong>tegrals, however, differs fromthat of the Bode <strong>in</strong>tegrals. Indeed, the presence of the weight<strong>in</strong>g function(3.31) <strong>in</strong> the Poisson <strong>in</strong>tegrals precludes the possibility of compensat<strong>in</strong>gan area of sensitivity reduction over a f<strong>in</strong>ite range of frequencies by anarea where |S| (or |T|) is allowed to be slightly greater than one over anarbitrarily large range of frequencies. This is because the weighted area ofthe imag<strong>in</strong>ary axis is f<strong>in</strong>ite <strong>and</strong> equal to π, as seen from <strong>in</strong>tegrat<strong>in</strong>g (3.31).More <strong>in</strong>sights <strong>in</strong>to the properties of the weight<strong>in</strong>g function will be given<strong>in</strong> the follow<strong>in</strong>g section.The case where the open-loop system is both unstable <strong>and</strong> nonm<strong>in</strong>imumphase is even more problematic. Indeed, note that both RHSs arepositive if the open-loop system has at least one ORHP zero <strong>and</strong> oneORHP pole. If this is the case the weighted area of sensitivity <strong>in</strong>creasemust be larger than the area of sensitivity reduction. Moreover, s<strong>in</strong>ce B −1T(p)<strong>in</strong> (3.29) has poles at p = q i , i = 1, . . . , n q (similarly, B −1S(q) <strong>in</strong> (3.30) haspoles at q = p i , i = 1, . . . , n p ), the constra<strong>in</strong>ts are aggravated if there is anapproximate pole-zero cancelation <strong>in</strong> the ORHP.In the follow<strong>in</strong>g section we will discuss design trade-offs implied byTheorems 3.3.1 <strong>and</strong> 3.3.2 with respect to sensitivity <strong>and</strong> complementarysensitivity m<strong>in</strong>imization over a frequency <strong>in</strong>terval.3.3.2 Design InterpretationsIn §3.1.3 we gave a graphical <strong>in</strong>terpretation of the area balance implied by(3.1) us<strong>in</strong>g the Nyquist plot of the open-loop transfer function. It was seen<strong>in</strong> Figure 3.7 that, as long as the phase of the open-loop system surpasses−π, the plot must enter the area of sensitivity <strong>in</strong>crease. It is well knownthat ORHP zeros <strong>in</strong>troduce additional phase lag 8 to the open-loop system,which is evident from the fact that each factor of B T <strong>in</strong> (3.26) satisfiesarg q i − jω→ −π as ω → ∞. (3.32)q i + jωHence the trade-off between areas of sensitivity attenuation <strong>and</strong> amplificationcan also be observed <strong>in</strong> the Nyquist plot of a nonm<strong>in</strong>imum phaseopen-loop system. As an example, Figure 3.9 shows the Nyquist plot ofthe function L(s) = (1 − s)/(1 + s) 2 together with its m<strong>in</strong>imum phasecounterpart ˜L(s) = 1/(1 + s). It can be seen from this figure that the area8 By additional phase lag <strong>in</strong>troduced by an ORHP zero we mean the difference betweenthe phase of the system with the ORHP zero <strong>and</strong> that of a system where the ORHP zero isreplaced by its reflection <strong>in</strong> the OLHP.


68 3. SISO <strong>Control</strong>balance is apparent for the nonm<strong>in</strong>imum phase system whilst the Nyquistplot of the m<strong>in</strong>imum phase system can be kept <strong>in</strong>side the area of sensitivityreduction save, of course, at <strong>in</strong>f<strong>in</strong>ite frequency.jω|S(jω)| > 1|S(jω)| < 1−1˜L(jω)σL(jω)FIGURE 3.9. Area balance for a nonm<strong>in</strong>imum phase system.From the expression T = 1/(1 + 1/L), it is evident that a similar graphicanalysis can be done for the complementary sensitivity function by study<strong>in</strong>gthe Nyquist plot of 1/L. Hence, the additional “phase lag” <strong>in</strong>troducedby ORHP poles (of L) to the function 1/L are now seen to be the cause ofthe need to balance the weighted areas where |T(jω)| < 1 <strong>and</strong> |T(jω)| > 1.The weight<strong>in</strong>g function (3.31) — or Poisson kernel for the right halfplane — <strong>in</strong> the <strong>in</strong>tegral constra<strong>in</strong>ts (3.29) <strong>and</strong> (3.30) takes explicit accountof the effect of the additional phase lag <strong>in</strong>troduced by ORHP zeros orpoles. 9 To see this, given an <strong>in</strong>terval [−ω 1 , ω 1 ], ω 1 > 0, consider the <strong>in</strong>tegralof the weight<strong>in</strong>g function (3.31), i.e.,Θ s0 (ω 1 ) =∫ ω1−ω 1W s0 (ω) dω∫ ω1σ 0−ω 1σ 2 0 + (ω 0 − ω) dω 2= arctan ω 1 − ω 0σ 0+ arctan ω 1 + ω 0σ 0.It is then easy to see that, for a real zero s 0 = q = σ q , we have(3.33)Θ q (ω 1 ) = − arg σ q − jω 1σ q + jω 1,whereas for a pair of complex conjugate zeros s 0 = q <strong>and</strong> q, we haveΘ q (ω 1 ) = − 1 [arg q − jω 1+ arg q − jω ]1. (3.34)2 q + jω 1 q + jω 19 S<strong>in</strong>ce the poles are here seen as zeros of 1/L, we will use the word “zeros” <strong>in</strong> the discussionabout the weight<strong>in</strong>g function.


3.3 Poisson Integral Formulae 69Note that Θ s0 (ω 1 ) represents the weighted length (by the ORHP zero s 0 )of the frequency <strong>in</strong>terval [−ω 1 , ω 1 ]; thus, the weighted length of such an<strong>in</strong>terval is equal to m<strong>in</strong>us the additional phase lag <strong>in</strong>troduced by the realORHP zero (or to m<strong>in</strong>us half the additional phase lag <strong>in</strong>troduced by thepair of complex conjugate zeros) at the upper endpo<strong>in</strong>t of the <strong>in</strong>terval.The above <strong>in</strong>terpretation of the weighted length of the frequency <strong>in</strong>tervalis useful <strong>in</strong> quantify<strong>in</strong>g trade-offs implied by the Poisson <strong>in</strong>tegralsrelative to the location of the ORHP zero. As an example, we will next revisitthe water-bed effect of §3.2. We view it here <strong>in</strong> the light of the <strong>in</strong>tegralconstra<strong>in</strong>t (3.29).Suppose, as <strong>in</strong> §3.1.3, that the feedback loop has been designed to achieve|S(jω)| ≤ α 1 < 1 , ∀ω ∈ Ω 1 [−ω 1 , ω 1 ] . (3.35)Let q be an ORHP zero of L <strong>and</strong> consider the weighted length of the <strong>in</strong>tervalΩ 1 , i.e., Θ q (ω 1 ) as <strong>in</strong> (3.33). Then the <strong>in</strong>f<strong>in</strong>ity norm of the sensitivityfunction has a lower bound as shown <strong>in</strong> the follow<strong>in</strong>g result.Corollary 3.3.3. Let S be the sensitivity function def<strong>in</strong>ed by (3.8) <strong>and</strong> supposethat the open-loop system L can be factored as <strong>in</strong> (3.27). Assume thatthe closed-loop system is stable <strong>and</strong> that the goal (3.35) has been achieved.Then, for each ORHP zero of L, q, we have( ) Θq(ω 1)1 π−Θ q(ω 1 ) ∣ π‖S‖ ∞ ≥∣B −1Sα (q)∣ ∣ π−Θ q(ω 1 ) , (3.36)1where Θ q (ω 1 ) is as <strong>in</strong> (3.33) with s 0 = q.Proof. Divid<strong>in</strong>g the range of <strong>in</strong>tegration <strong>in</strong> (3.29), <strong>and</strong> us<strong>in</strong>g the <strong>in</strong>equalities(3.35) <strong>and</strong> |S(jω)| ≤ ‖S‖ ∞ , ∀ω, we havelog α 1 Θ q (ω 1 ) + log ‖S‖ ∞ [π − Θ q (ω 1 )] ≥ π log ∣ ∣ B−1S (q)∣ ∣ . (3.37)The result then follows by exponentiat<strong>in</strong>g both sides.□It is immediate from (3.36) that the lower bound on the sensitivity peakis strictly greater than one. Indeed, this follows from the fact that ∣ B−1S(q)∣ ∣ ≥1, α 1 < 1 <strong>and</strong> Θ q (ω 1 ) < π. Note also that if the open-loop system is bothnonm<strong>in</strong>imum phase <strong>and</strong> unstable, then the sensitivity peak is greater thanone even if there is no region of sensitivity reduction.Remark 3.3.1. Corollary 3.3.3 provides another <strong>in</strong>terpretation for the waterbedeffect of Theorem 3.2.1: if the open-loop system is nonm<strong>in</strong>imum phase,then requir<strong>in</strong>g |S(jω)| to be small over a range of frequencies will necessarilylead to a large sensitivity peak outside that range. In this new form,however, the lower bound on the sensitivity peak exhibits an explicit dependenceon the location of the ORHP zero of the open-loop system. Indeed,recall from our previous discussion that Θ q (ω 1 ) is equal to m<strong>in</strong>us


70 3. SISO <strong>Control</strong>the additional phase lag <strong>in</strong>troduced by the real ORHP zero q (or to m<strong>in</strong>ushalf the additional phase lag <strong>in</strong>troduced by the pair of complex conjugatezeros q <strong>and</strong> q) at the frequency ω 1 . Thus, if at ω = ω 1 the ORHP zerocontributes with a significant amount of lag (Θ q (ω 1 ) close to π, say) thefirst term on the LHS of (3.37) will be large (<strong>and</strong> negative) <strong>and</strong> hence willrequire a large value of ‖S‖ ∞ to satisfy the lower bound on the RHS. Onthe other h<strong>and</strong>, if the lag <strong>in</strong>troduced by the zero at ω 1 is small then thetrade-off implied by (3.36) will not be severe.Observe that, s<strong>in</strong>ce |S(jω)| ≤ α 1 < 1 implies |L(jω)| ≥ 1/α 1 − 1, a desirablefeedback design objective would be to roll-off the open-loop ga<strong>in</strong>well before the phase lag <strong>in</strong>troduced by the ORHP zero becomes significant.This confirms a well-known rule of thumb used <strong>in</strong> classical feedbackcontrol design.◦A result parallel to Corollary 3.3.3 for the complementary sensitivityfunction can be easily obta<strong>in</strong>ed if we assume that the follow<strong>in</strong>g goal hasbeen achieved (e.g., for robustness purposes)|T(jω)| ≤ α 2 < 1 , ∀ω ∈ Ω 2 [−∞, −ω 2 ] ∪ [ω 2 , ∞] . (3.38)We then have the follow<strong>in</strong>g result.Corollary 3.3.4. Let T be the sensitivity function def<strong>in</strong>ed by (3.8) <strong>and</strong> supposethat the open-loop system L can be factored as <strong>in</strong> (3.27). Assume thatthe closed-loop system is stable <strong>and</strong> that the goal (3.38) has been achieved.Then, for each ORHP pole of L, p, we have‖T‖ ∞ ≥( 1α 2) π−Θp(ω 2)Θ p(ω 2 )∣ ∣B −1Twhere Θ p (ω 1 ) is as <strong>in</strong> (3.33) with s 0 = p.π(p)eσpτ∣ ∣ Θ p(ω 2 ) , (3.39)Proof. Similar to the proof of Corollary 3.3.3, us<strong>in</strong>g the bound (3.38).□Notice that, accord<strong>in</strong>g to the def<strong>in</strong>ition of closed-loop b<strong>and</strong>width <strong>in</strong>§3.1.3, ω 2 <strong>in</strong> (3.38) is equal to the b<strong>and</strong>width ω b if α 2 is taken to be 1/ √ 2.In view of this observation, Corollary 3.3.4 can be used to obta<strong>in</strong> boundson the closed-loop b<strong>and</strong>width, as shown <strong>in</strong> the follow<strong>in</strong>g example.Example 3.3.1. Consider the feedback system of Figure 3.2, <strong>and</strong> assume,for simplicity, that the open-loop system L has a real ORHP pole p = σ p ,is m<strong>in</strong>imum phase <strong>and</strong> has no time delay, i.e., q = 0 <strong>and</strong> τ = 0 <strong>in</strong> (3.27).Assume that the closed-loop system is stable <strong>and</strong> satisfies (3.38) for α 2 =1/ √ 2 (<strong>and</strong> hence ω 2 = ω b ). Then‖T‖ ∞ ≥ √ π−Θ p(ω b )Θ2 p(ω b ).


3.3 Poisson Integral Formulae 71Suppose further that we impose the condition that the RHS of the aboveexpression be less than or equal to T m , which is necessary if we require thatthe peak complementary sensitivity (i.e., the LHS of the above expression)be less than or equal to T m . Us<strong>in</strong>g this additional requirement we obta<strong>in</strong>the follow<strong>in</strong>g lower bound on the closed-loop b<strong>and</strong>width()πω b ≥ p tan2 + 2 log T m / log √ . (3.40)2We conclude from (3.40), for example, that an open-loop system hav<strong>in</strong>g areal ORHP pole, p, requires a b<strong>and</strong>width at least equal to p if it is desiredto keep the lower bound on the peak complementary sensitivity smallerthan T m = √ 2.Notice that the lower bounds on the b<strong>and</strong>width given by (3.22) <strong>and</strong>(3.40) are not directly comparable, s<strong>in</strong>ce (3.22) was obta<strong>in</strong>ed from requirementson S while (3.40) follows from specifications on T.◦A more realistic design will require both (3.35) <strong>and</strong> (3.38) to be satisfied.Note that (3.38) implies |S(jω)| ≤ 1 + α 2 , ∀ω ∈ Ω 2 . Figure 3.10 representsthe comb<strong>in</strong>ed shape specifications on |S(jω)|.|S(jω)|1α 1α 2ω 1 ω 2 ωFIGURE 3.10. Frequency specifications for S.Us<strong>in</strong>g this additional <strong>in</strong>formation as <strong>in</strong> the proof of Corollary 3.3.3 weobta<strong>in</strong>( )Θ q(ω 1 )( ) π−Θq(ω 2)1 Θ q(ω 2 )−Θ q(ω 1 ) 1 Θ q(ω 2 )−Θ q(ω 1 )‖S‖ ∞ ≥α 1 1 + α 2× ∣ ∣ B−1S(q)∣ ∣πΘ q(ω 2 )−Θ q(ω 1 ) .The follow<strong>in</strong>g example exam<strong>in</strong>es the severity of constra<strong>in</strong>t (3.41).(3.41)Example 3.3.2. Suppose that we desire to control a plant with a s<strong>in</strong>glenonm<strong>in</strong>imum phase zero q > 0. We assume for simplicity that the plant


72 3. SISO <strong>Control</strong>is open-loop stable. We desire for the closed-loop system a sensitivity reductionof at least α 1 with<strong>in</strong> the <strong>in</strong>terval of frequencies [0, ω 1 ], where ω 1is chosen to be three quarters of the desired closed-loop b<strong>and</strong>width frequencyω b , i.e., the specification (3.35) holds with ω 1 = 0.75ω b .Follow<strong>in</strong>g the def<strong>in</strong>ition of closed-loop b<strong>and</strong>width on page 60, we canwrite the requirement of b<strong>and</strong>width by sett<strong>in</strong>g ω 2 = ω b <strong>and</strong> α 2 = 1/ √ 2<strong>in</strong> (3.38) (see also Example 3.3.1).Figure 3.11 plots the lower bound (3.41) on ‖S‖ ∞ versus the position ofthe nonm<strong>in</strong>imum phase zero (relative to the desired b<strong>and</strong>width), <strong>and</strong> fordifferent values of sensitivity reduction.5432α 1 =0.8 0.6 0.4 0.2100 0.5 1 1.5 2q/ω bFIGURE 3.11. Lower bound (3.41) on ‖S‖ ∞.As anticipated <strong>in</strong> Remark 3.3.1, the picture shows that the constra<strong>in</strong>tsimposed by a nonm<strong>in</strong>imum phase zero worsen the more the zero is with<strong>in</strong>the desired closed-loop b<strong>and</strong>width, which manifests as higher peaks <strong>in</strong>|S(jω)|. S<strong>in</strong>ce, as seen <strong>in</strong> Chapter 2, large peaks <strong>in</strong> |S(jω)| are associatedwith poor sensitivity <strong>and</strong> robustness properties, then a nonm<strong>in</strong>imum phasezero imposes a trade-off <strong>in</strong> design that limits the achievable closed-loopb<strong>and</strong>width of the system. The same conclusion holds for complex nonm<strong>in</strong>imumphase zeros.◦Notice that the bound (3.41) is satisfied with equality for an “ideal” sensitivityfunction that is sectionally constant <strong>and</strong> equal to α 1 <strong>in</strong> Ω 1 , to ‖S‖ ∞<strong>in</strong> [−ω 2 , −ω 1 ] ∪ [ω 1 , ω 2 ], <strong>and</strong> to 1 + α 2 <strong>in</strong> Ω 2 . Hence, the lower boundgiven by (3.41) is <strong>in</strong> fact conservative. The closer the actual sensitivity is tothe “ideal” function, the tighter the bound (3.41) is. 10 Although, <strong>in</strong> theory,this sectionally constant function can be arbitrarily approximated by a ra-10 Less conservative bounds can be also derived from these <strong>in</strong>tegral constra<strong>in</strong>ts by consider<strong>in</strong>gmore realistic shape specifications; see for example Middleton (1991) or Middleton <strong>and</strong>Goodw<strong>in</strong> (1990, Chapter 13).


3.3 Poisson Integral Formulae 73tional function, there exist a number of practical limitations <strong>in</strong> achiev<strong>in</strong>gthis result. For example, a restriction on the complexity of the controllerwould mean that the ideal case cannot be achieved.F<strong>in</strong>ally, if the plant has more than one ORHP zero, bounds similar to(3.36) or (3.41) hold for each of them, which, <strong>in</strong> general, will provide different<strong>in</strong>formation. Thus, s<strong>in</strong>ce each of these bounds may not be tight ifconsidered alone, it will be necessary to analyze them <strong>in</strong> comb<strong>in</strong>ation toobta<strong>in</strong> the overall trade-offs imposed by the set of ORHP zeros. However,this will not be as <strong>in</strong>sightful <strong>and</strong> simple as <strong>in</strong> the case of a s<strong>in</strong>gle zero,which shows a limitation of the approach. A different technique, basedon Nevanl<strong>in</strong>na-Pick theory, has been described by O’Young <strong>and</strong> Francis(1985). This technique provides an iterative procedure to compute “hard”bounds on the sensitivity function for multivariable plants with severalORHP zeros.3.3.3 Example: Inverted PendulumConsider aga<strong>in</strong> the <strong>in</strong>verted pendulum shown <strong>in</strong> Figure 3.12, studied <strong>in</strong>§1.3.3 of Chapter 1 <strong>in</strong> the time doma<strong>in</strong>.θ muyMlFIGURE 3.12. Inverted pendulum.It was seen <strong>in</strong> §1.3.3 that the l<strong>in</strong>earized model for this system has a transferfunction from u to y of the formY(s) (s − q)(s + q)=U(s) M s 2 (s − p)(s + p) ;i.e., it has one nonm<strong>in</strong>imum phase zero <strong>and</strong> one unstable pole. Assumethat the parameters <strong>in</strong> the model are chosen as <strong>in</strong> §1.3.3, i.e., so that theORHP zero is q = 1 <strong>and</strong> the pole is moved accord<strong>in</strong>g to four differentvalues of the mass ratio m/M. For example, for m/M = 1, the pole isp = √ 2. Us<strong>in</strong>g Corollary 3.3.3 <strong>and</strong> Corollary 3.3.4 with the latter valuesof p <strong>and</strong> q, <strong>and</strong> assum<strong>in</strong>g no particular range of sensitivity reduction (i.e.,tak<strong>in</strong>g ω 1 = 0 <strong>in</strong> (3.35) <strong>and</strong> ω 2 = ∞ <strong>in</strong> (3.38)), then the peak sensitivity<strong>and</strong> complementary sensitivity satisfy‖S‖ ∞ ≥ 5.8284, <strong>and</strong> ‖T‖ ∞ ≥ 5.8284. (3.42)


74 3. SISO <strong>Control</strong>Figure 3.13 shows log |S(jω)| <strong>and</strong> log |T(jω)| achieved by the same LQG-LQR design used <strong>in</strong> §1.3.3, for m/M = 0.1, 0.2, 0.4, 1. These figures corre-10 2m / M = 0.110 2m / M = 0.1log |S(jω)|10 010 −210 4 ωm / M = 1log |T(jω)|10 010 −210 4 ωm / M = 110 −410 −410 −610 −4 10 −2 10 0 10 2 10 410 −610 −4 10 −2 10 0 10 2 10 4FIGURE 3.13. log |S(jω)| <strong>and</strong> log |T(jω)| for the <strong>in</strong>verted pendulum.spond to the output time response plotted <strong>in</strong> Figure 1.5 of Chapter 1. Notethat the actual peaks are much larger than predicted, which is due to thefact that the bounds (3.42) assume |S| <strong>and</strong> |T| flat <strong>and</strong> equal to their <strong>in</strong>f<strong>in</strong>itynorms. Tighter bounds can be obta<strong>in</strong>ed by assum<strong>in</strong>g that |S| has a sectionallyconstant shape as <strong>in</strong> (3.41), or us<strong>in</strong>g more sophisticated shapes as <strong>in</strong>Middleton (1991). Note, however, that the bounds derived <strong>in</strong> this chapterare valid for any design methodology, <strong>and</strong> hence it is expected that theywill be conservative for some particular design choices.3.4 Discrete SystemsThis section describes the extension of the Poisson <strong>and</strong> Bode <strong>in</strong>tegral relationsto discrete-time feedback control systems. We aga<strong>in</strong> focus on sensitivity<strong>and</strong> complementary sensitivity functions of feedback control systems,but the transfer functions are here complex functions of the variablez, i.e., they represent the Z-transform of impulse responses of discrete-timesystems. We will see that the cont<strong>in</strong>uous-time results extend <strong>in</strong> a straightforwardfashion to discrete time by means of the Poisson <strong>in</strong>tegral formulafor the unit disk, treated <strong>in</strong> §A.6.2 of Appendix A. We will also see, however,that <strong>in</strong> general, the discrete constra<strong>in</strong>ts are more dem<strong>and</strong><strong>in</strong>g s<strong>in</strong>cethe sensitivity trade-offs must be achieved on the f<strong>in</strong>ite frequency <strong>in</strong>terval[0, π].The results <strong>in</strong> this section apply to systems (plant <strong>and</strong> controller) whoseorig<strong>in</strong>al description is <strong>in</strong> discrete time; if the orig<strong>in</strong>al description of theplant is <strong>in</strong> cont<strong>in</strong>uous time <strong>and</strong> the controller is digital, we call the overallsystem a sampled-data system, <strong>and</strong> the analysis is more adequately per-


3.4 Discrete Systems 75formed <strong>in</strong> cont<strong>in</strong>uous time. Sampled-data control systems are discussed<strong>in</strong> Chapter 6.3.4.1 Poisson Integrals for S <strong>and</strong> TConsider the feedback control loop of Figure 3.14, where the open-loopsystem, L, is a proper transfer function of the complex variable z.r + e y❜ ✲ ✐ ✲ L(z) ✲ ❜− ✻FIGURE 3.14. Discrete-time feedback control system.As <strong>in</strong> (3.8), the sensitivity <strong>and</strong> complementary sensitivity functions ofthe loop <strong>in</strong> Figure 3.14 are def<strong>in</strong>ed asS(z) =11 + L(z)<strong>and</strong> T(z) = L(z)1 + L(z) , (3.43)respectively. In common with the cont<strong>in</strong>uous-time case, the sensitivity <strong>and</strong>complementary sensitivity functions for the discrete case satisfy <strong>in</strong>terpolationsconstra<strong>in</strong>ts imposed by zeros <strong>and</strong> poles of the open-loop system.Indeed, if L is free of unstable pole-zero cancelations, 11 then S has zeros atthe unstable open-loop plant poles <strong>and</strong> T has zeros at the nonm<strong>in</strong>imumphase open-loop plant zeros; i.e., Lemma 3.1.3 holds with ¢ +replaced bythe region outside the open unit disk, which we denote by £ c .For the purpose of deriv<strong>in</strong>g Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, we need toextract from S <strong>and</strong> T all the zeros outside the closed 12 unit disk — denoted£cby . Let qi , i = 1, . . . , n q , be the zeros of L £c<strong>in</strong> , <strong>and</strong> pi , i = 1, . . . , n p ,be the poles of L <strong>in</strong> £c, all counted with multiplicities. It follows from ourprevious discussion that the q i ’s are the zeros of T £c<strong>in</strong> , <strong>and</strong> the pi ’s arethe zeros of S £c<strong>in</strong> . We <strong>in</strong>troduce the discrete Blaschke products of the zerosof S <strong>and</strong> T, given byB S (z) =n p∏i=1p i|p i |nz − p i1 − p i z , <strong>and</strong> B ∏ qT (z) =i=1q i|q i |z − q i1 − q i z . (3.44)11 Recall from Chapter 2 that, for discrete-time systems, a transfer function is nonm<strong>in</strong>imumphase if it has zeros outside the open unit disk, , <strong>and</strong> it is unstable if it has poles outside. Thus, L is free of unstable pole-zero cancelations if there are no cancelations of zeros <strong>and</strong>poles outside between the plant <strong>and</strong> controller whose cascade connection form L.12 Recall that, <strong>in</strong> the cont<strong>in</strong>uous-time case, only the zeros <strong>and</strong> poles of the plant <strong>in</strong> theORHP impose <strong>in</strong>tegral constra<strong>in</strong>ts. Those on the imag<strong>in</strong>ary axis do not contribute to thevalue of the <strong>in</strong>tegrals (see Lemma A.6.2 <strong>and</strong> Corollary A.6.3 <strong>in</strong> Appendix A).


76 3. SISO <strong>Control</strong>We will def<strong>in</strong>e B S (z) = 1 if L is stable, <strong>and</strong> B T (z) = 1 if L is m<strong>in</strong>imumphase. Similar to their cont<strong>in</strong>uous-time counterpart, the discrete Blaschkeproducts are “all-pass” functions, s<strong>in</strong>ce their magnitude is constant <strong>and</strong>equal to one on the unit circle |z| = 1. Us<strong>in</strong>g (3.44), <strong>and</strong> extract<strong>in</strong>g thezeros at <strong>in</strong>f<strong>in</strong>ity <strong>in</strong> a similar fashion as was done <strong>in</strong> (2.7) <strong>in</strong> Chapter 2, wecan factor the open-loop transfer function, L, asL(z) = B −1S (z) B T (z) ˜L(z) z −δ , (3.45)where ˜L is a stable, m<strong>in</strong>imum-phase transfer function, hav<strong>in</strong>g relative degreezero. It is then easy to see that S <strong>and</strong> T <strong>in</strong> (3.43) can be factored asS(z) = ˜S(z)B S (z) ,T(z) = ˜T(z)B T (z)z −δ ,(3.46)where ˜S <strong>and</strong> ˜T have no zeros <strong>in</strong> the region outside the closed unit disk (<strong>in</strong>clud<strong>in</strong>g<strong>in</strong>f<strong>in</strong>ity). Note that here the zeros at <strong>in</strong>f<strong>in</strong>ity of T (which are branchpo<strong>in</strong>ts of log T, see §A.9.2 <strong>in</strong> Appendix A) have to be factored explicitlys<strong>in</strong>ce they will contribute to the value of the Poisson <strong>in</strong>tegral of log T onthe unit circle. This is because the po<strong>in</strong>t at <strong>in</strong>f<strong>in</strong>ity is an <strong>in</strong>terior po<strong>in</strong>t ofthe region “encircled” by the unit £ccircle, , <strong>and</strong> thus it should be treatedas any other f<strong>in</strong>ite s<strong>in</strong>gularity of log T £c<strong>in</strong> . Note that this is not the casefor cont<strong>in</strong>uous-time systems, i.e., the s<strong>in</strong>gularities at <strong>in</strong>f<strong>in</strong>ity of log T aris<strong>in</strong>gfrom a strictly proper L do not add to the value of the Poisson <strong>in</strong>tegralof log T on the imag<strong>in</strong>ary axis (see Corollary A.6.3 <strong>in</strong> Appendix A, <strong>and</strong> theproof of Theorem 3.3.2).Assum<strong>in</strong>g stability of the closed-loop system, <strong>and</strong> s<strong>in</strong>ce, as just seen, ˜S<strong>and</strong> ˜T <strong>in</strong> (3.46) have no zeros £c<strong>in</strong> , then the functions log ˜S <strong>and</strong> log ˜T areanalyticc c<strong>in</strong> £ . The real parts of these functions are harmonic <strong>in</strong> £ , <strong>and</strong>they thus satisfy the conditions of Corollary A.6.5 <strong>in</strong> Appendix A. We thenobta<strong>in</strong> the follow<strong>in</strong>g results.Theorem 3.4.1 (Poisson Integral for S). Let S be the discrete sensitivityfunction def<strong>in</strong>ed by (3.43). Assume that the open-loop system L can befactored as <strong>in</strong> (3.45) <strong>and</strong> let q = r q e jθq be a zero of L £c<strong>in</strong> . Then, if theclosed-loop system is stable,∫ π−πlog |S(e jθ r 2 q − 1)|1 − 2r q cos(θ−θ q ) + r 2 qdθ = 2π log ∣ ∣ B−1S (q)∣ ∣ . (3.47)Proof. Under the assumptions of the theorem, S can be factored as <strong>in</strong> (3.46),where ˜S is a stable, m<strong>in</strong>imum-phase transfer function, hav<strong>in</strong>g relative degreezero. It follows that the function log | ˜S| is harmonic £c<strong>in</strong> . Us<strong>in</strong>gCorollary A.6.5 <strong>in</strong> Appendix A with u = log | ˜S| <strong>and</strong> s 0 = q, <strong>and</strong> not<strong>in</strong>gthat | ˜S(e jθ )| = |S(e jθ )| (s<strong>in</strong>ce ∣ BS (e jθ ) ∣ = 1, ∀θ) <strong>and</strong> that ˜S(q) = B −1S(q)(s<strong>in</strong>ce S(q) = 1), yields the desired result.□


3.4 Discrete Systems 77The correspond<strong>in</strong>g result for T is as follows.Theorem 3.4.2 (Poisson Integral for T). Let T be the complementary sensitivityfunction def<strong>in</strong>ed by (3.43). Assume that the open-loop system Lcan be factored as <strong>in</strong> (3.45) <strong>and</strong> let p = r p e jθp be a pole of L £c<strong>in</strong> . Then, ifthe closed-loop system is stable,∫ π−πlog |T(e jθ r 2 p − 1)|1 − 2r p cos(θ−θ p ) + r 2 pdθ = 2π log ∣ ∣B −1T (p)∣ ∣ + 2πδ log |p| .(3.48)Proof. The proof uses Corollary A.6.5 <strong>in</strong> Appendix A with u = log | ˜T|,where ˜T is given <strong>in</strong> (3.46), <strong>and</strong> s 0 = p. Then (3.48) follows by similar argumentsto those used <strong>in</strong> the proof of Theorem 3.4.1.□As <strong>in</strong> the cont<strong>in</strong>uous-time case, two technical comments are <strong>in</strong> order.We first note that Theorems 3.4.1 <strong>and</strong> 3.4.2 still hold when L has polesor zeros on the unit circle. Second, if the zeros <strong>and</strong>/or poles of L outsidethe unit disk have multiplicities greater than one, then additional <strong>in</strong>tegralconstra<strong>in</strong>ts on the derivatives of log ˜S <strong>and</strong> of log ˜T can be derived.Also similar to the Poisson <strong>in</strong>tegral constra<strong>in</strong>ts for cont<strong>in</strong>uous-time systems,the relations (3.47) <strong>and</strong> (3.48) represent a balance between weightedareas of sensitivity or complementary sensitivity attenuation <strong>and</strong> amplification.This is because: (i) for z 0 = r 0 e jθ 0with r 0 > 1, the weight<strong>in</strong>gfunction (or Poisson kernel for the unit disk)r 2 0W z0 (θ) − 11 − 2r 0 cos(θ − θ 0 ) + r 2 0(3.49)is positive for all θ, <strong>and</strong> (ii) the RHSs of both equations are nonnegatives<strong>in</strong>ce, for a discrete Blaschke product B, we have that ∣ B −1 (z) ∣ ≥ 1 for|z| > 1. These facts imply that the weighted area of sensitivity <strong>in</strong>creasemust be, at least, as large as the weighted area of sensitivity reduction.This situation is aggravated if the open-loop system is both unstable <strong>and</strong>nonm<strong>in</strong>imum phase, s<strong>in</strong>ce <strong>in</strong> this case both ∣ ∣B −1S(q)∣ ∣ <strong>and</strong> ∣ ∣B −1T(p)∣ ∣ on theRHSs of (3.47) <strong>and</strong> (3.48) are strictly greater than one. Moreover, the closerthe unstable zeros <strong>and</strong> poles are to each other, the larger the RHSs of bothequations become.Note also the similarities between Theorem 3.4.2 <strong>and</strong> its counterpart forcont<strong>in</strong>uous-time systems, Theorem 3.3.2. In particular, the term due to therelative degree of the plant <strong>in</strong> the discrete case <strong>in</strong> (3.48) is analogous tothat due to a pure time delay <strong>in</strong> (3.30). Hence, the constra<strong>in</strong>ts imposedon T by open loop time delays <strong>and</strong> unstable poles are also present <strong>in</strong> thediscrete case, <strong>and</strong> aga<strong>in</strong>, the “more unstable” the pole, <strong>and</strong> the larger thetime delay, the worse these constra<strong>in</strong>ts will be.In the follow<strong>in</strong>g section we will discuss some of the design trade-offsimplied by Theorems 3.4.1 <strong>and</strong> 3.4.2.


78 3. SISO <strong>Control</strong>3.4.2 Design InterpretationsThe Poisson <strong>in</strong>tegrals of Theorem 3.4.1 <strong>and</strong> Theorem 3.4.2 can be usedto derive lower bounds on the <strong>in</strong>f<strong>in</strong>ity norms of the discrete sensitivityfunctions. As <strong>in</strong> the cont<strong>in</strong>uous-time case, these lower bounds exhibit thewater-bed effect experienced by nonm<strong>in</strong>imum phase plants when sensitivityreduction is required over some frequency range.To analyze the water-bed effect, we require the weighted length of an<strong>in</strong>terval on the unit disk by the kernel (3.49) (cf. the correspond<strong>in</strong>g for theweight<strong>in</strong>g function for the half plane, given by (3.33)), i.e.,∫ θ1Θ z0 (θ 1 ) 1 2W z0 (θ) dθ−θ 1[r0 + 1= arctanr 0 − 1 tan θ ] [1 − θ 0r0 + 1+ arctan2r 0 − 1 tan θ ]1 + θ 0,2(3.50)where the <strong>in</strong>terval of <strong>in</strong>terest is Θ 1 = [−θ 1 , θ 1 ], θ 1 < π. It is possible toshow that Θ z0 (θ 1 ) is equal to m<strong>in</strong>us half the sum of the phases of the discreteBlaschke products correspond<strong>in</strong>g to z 0 <strong>and</strong> its conjugate, evaluatedat z = e jθ 1, i.e.,Θ z0 (θ 1 ) = − 1 [arg z 0 e jθ 1− z 02 r 0 1 − z 0 e jθ + arg z 0 e jθ 1]− z 01 r 0 1 − z 0 e jθ .1This was also the case for the weighted length of an <strong>in</strong>terval on the imag<strong>in</strong>aryaxis by the Poisson kernel for the right half plane, as can be seen fromequation (3.34).Suppose next that the discrete feedback control loop has been designedto achieve the goal|S(e jθ )| ≤ α < 1 , ∀θ ∈ Θ 1 [−θ 1 , θ 1 ] , θ 1 < π . (3.51)Let q be a zero of L £c<strong>in</strong> <strong>and</strong> consider the weighted length of the <strong>in</strong>tervalΘ 1 , i.e., Θ q (θ 1 ) as <strong>in</strong> (3.50). Then the <strong>in</strong>f<strong>in</strong>ity norm of the sensitivityfunction has a lower bound as shown <strong>in</strong> the follow<strong>in</strong>g result.Corollary 3.4.3. Let S be the sensitivity function def<strong>in</strong>ed by (3.43) <strong>and</strong> supposethat the open-loop system L can be factored as <strong>in</strong> (3.45). Assume thatthe closed-loop system is stable <strong>and</strong> that the goal (3.51) has been achieved.Then, for each zero of L £c<strong>in</strong> , q = rq e jθq ,( 1‖S‖ ∞ ≥α) Θq(θ 1)π−Θ q(θ 1 ) ∣ π∣B −1S(q)∣ ∣ π−Θ q(θ 1 ) , (3.52)where Θ q is the function def<strong>in</strong>ed <strong>in</strong> (3.50).


3.4 Discrete Systems 79Proof. The proof follows that of Corollary 3.3.3.□Similar comments to those follow<strong>in</strong>g Corollary 3.3.3 are also relevanthere. In particular, we note that the lower bound on the RHS of (3.52) isstrictly greater than one <strong>and</strong> it becomes larger when the open-loop systemhas both zeros <strong>and</strong> poles £c<strong>in</strong> . Also, parallel lower bounds can be derivedfor the complementary sensitivity function, as well as bounds aris<strong>in</strong>g fromcomb<strong>in</strong>ed specifications for S <strong>and</strong> T. The <strong>in</strong>terested reader is encouragedto restate the results <strong>in</strong> §3.3.2 for the discrete-time case.3.4.3 Bode Integrals for S <strong>and</strong> TIn this section we obta<strong>in</strong> Bode <strong>in</strong>tegral constra<strong>in</strong>ts for the discrete-timesensitivity <strong>and</strong> complementary sensitivity functions. Recall that, <strong>in</strong> Theorems3.1.4 <strong>and</strong> 3.1.5, we established the correspond<strong>in</strong>g results for cont<strong>in</strong>uoustimesystems by direct contour <strong>in</strong>tegration. We will not follow the sameprocedure here, but <strong>in</strong>stead we will derive the Bode <strong>in</strong>tegrals from thePoisson <strong>in</strong>tegral formula for the disk. This is done <strong>in</strong> the follow<strong>in</strong>g results.Theorem 3.4.4 (Bode Integral for S). Let S be the sensitivity function def<strong>in</strong>edby (3.43). Assume that the open-loop system, L, is strictly proper<strong>and</strong> let {p i : i = 1, . . . , n p } be the set of poles of L £c<strong>in</strong> . Then, assum<strong>in</strong>gclosed-loop stability,∫ π0n p∑log |S(e jθ )| dθ = π log |p i | . (3.53)i=1Proof. As <strong>in</strong> the proof of Theorem 3.4.1, we use the factorization (3.46) <strong>and</strong>apply Corollary A.6.5 <strong>in</strong> Appendix A with u = log | ˜S| <strong>and</strong> s 0 = r, r > 1.This yields∫1 π2π −πlog |S(e jθ r 2 − 1)|1 − 2r cos θ + r 2 dθ = log | ˜S(r)|= log ∣ ∣ B−1S (r)∣ ∣ + log |S(r)| . (3.54)Next, we take limits as r → ∞ on both sides of (3.54). Us<strong>in</strong>g the uniformconvergence theorem to take limits <strong>in</strong>side the <strong>in</strong>tegral (Lev<strong>in</strong>son <strong>and</strong> Redheffer,1970, p. 335), <strong>and</strong> the fact that |S(e −jθ )| = |S(e jθ )|, we have that theLHS <strong>in</strong> (3.54) tends to∫1 πlog |S(e jθ )| dθ .π 0


£80 3. SISO <strong>Control</strong>As for the RHS, note that lim r→∞ log |S(r)| = 0, s<strong>in</strong>ce L is strictly proper,<strong>and</strong>lim log ∣ n B−1r→∞S(r)∣ ∏ p∣ = lim logr→∞n pi=1∑= log |p i | .i=1∣1 − p i rr − p i∣ ∣∣∣The result (3.53) then follows.□The parallel result for T is given <strong>in</strong> the follow<strong>in</strong>g theorem.Theorem 3.4.5 (Bode Integral for T). Let T be the complementary sensitivityfunction def<strong>in</strong>ed by (3.43). Assume that the open-loop system L canbe factored as <strong>in</strong> (3.45) <strong>and</strong> let {q i : i = 1, . . . , n q } be the set of zeros of L <strong>in</strong>c. Suppose further that L(1) ≠ 0. Then, assum<strong>in</strong>g closed-loop stability,∫ π0logT(e −jθ )∣ T(1) ∣dθ1 − cos θ = πT(1) lim dT(z)z→1 dzwhere δ is the relative degree of L.n q∑+ πi=1|q i | 2 − 1|q i − 1| 2 + πδ ,(3.55)Proof. From the factorization (3.46), we write the Poisson Integral formula(Corollary A.6.5) for log( ˜T/T(1)) evaluated at a real po<strong>in</strong>t r > 1 (note thatT(1) ≠ 0 s<strong>in</strong>ce L(1) ≠ 0 by assumption). This gives∫ π0log∣˜T(e jθ )T(1)∣ r 2 − 1∣∣∣∣ 1 − 2r cos(θ) + r 2 dθ = π log T(r)T(1) ∣+ π log ∣ ∣ B−1T (r)∣ ∣ + πδ log r . (3.56)Next, we divide both sides of (3.56) by (r − 1) <strong>and</strong> take limits as r → 1. Wecompute each term <strong>in</strong> turn. First, by the uniform convergence theorem,the limit on the LHS of (3.56) can be brought <strong>in</strong>side the <strong>in</strong>tegral, <strong>and</strong> then,us<strong>in</strong>g the fact that | ˜T(e jθ )| = |T(e jθ )|, it is easy to check that the LHS of(3.55) is obta<strong>in</strong>ed.Then consider the first term on the RHS of (3.56). We have∣ 1 ∣∣∣π limr→1 (r − 1) log T(r)T(1) ∣ = π ( ) 22 lim 1 T(r)r→1 (r − 1) log T(1)(3.57)= πT(1) lim dT(z),z→1 dz(3.58)which follows by apply<strong>in</strong>g L’Hospital’s rule. This is the first term on theRHS of (3.55). The second term on the RHS of (3.55) is also obta<strong>in</strong>ed from


3.4 Discrete Systems 81a straightforward application of L’Hospital’s rule to the follow<strong>in</strong>g limitlimr→11(r − 1) log ∣ ∣B −1T(r)∣ ∣ = limr→112(r − 1)F<strong>in</strong>ally, the proof is concluded by not<strong>in</strong>g thatn q∑i=1log (1 − rq i)(1 − rq i )(r − q i )(r − q i )πδ log rlimr→1 r − 1 = πδ . □.We see <strong>in</strong> (3.55) that Theorem 3.4.5 is entirely analogous to its cont<strong>in</strong>uoustimecounterpart, Theorem 3.1.5. Indeed, the first term on the RHS of (3.55)is precisely m<strong>in</strong>us the reciprocal of the velocity constant of a discrete-timesystem equivalent to that <strong>in</strong> Figure 3.5. The effect of nonm<strong>in</strong>imum phasezeros of L is also equivalent to that <strong>in</strong> cont<strong>in</strong>uous-time, <strong>and</strong> so is the effectof the relative degree of L, which corresponds to a time delay of δ discreteunits. Hence, the same <strong>in</strong>terpretations that were given for Theorem 3.1.5apply to Theorem 3.4.5 for discrete-time systems.Nevertheless, an important difference between these <strong>in</strong>tegrals <strong>in</strong> thecont<strong>in</strong>uous- <strong>and</strong> discrete-time cases is that the latter <strong>in</strong>volves restrictionsover a f<strong>in</strong>ite <strong>in</strong>terval. We see this <strong>in</strong> more detail <strong>in</strong> the follow<strong>in</strong>g subsection,where we study the design trade-offs <strong>in</strong>duced by Bode’s discrete sensitivity<strong>in</strong>tegral.3.4.4 Design InterpretationsBode <strong>in</strong>tegral formulae, both <strong>in</strong> the discrete <strong>and</strong> cont<strong>in</strong>uous-time case, <strong>in</strong>dicatethat there is a balance of areas of sensitivity reduction <strong>and</strong> <strong>in</strong>crease.In the cont<strong>in</strong>uous-time case, however, the Bode <strong>in</strong>tegral formula does notimply a direct trade-off <strong>in</strong> design if additional b<strong>and</strong>width constra<strong>in</strong>ts arenot imposed (see §3.1.3). Indeed, s<strong>in</strong>ce <strong>in</strong>tegration is performed over an <strong>in</strong>f<strong>in</strong>iterange, the area of sensitivity reduction over a f<strong>in</strong>ite range of frequenciesmay be compensated by an area where |S| is allowed to be slightlygreater than one over an arbitrarily large range of frequencies.On the other h<strong>and</strong>, <strong>in</strong> the discrete-time case, the Bode <strong>in</strong>tegral impliesa nontrivial sensitivity trade-off even if no b<strong>and</strong>width constra<strong>in</strong>t is assumed.This is seen from the follow<strong>in</strong>g immediate corollary.Corollary 3.4.6. Suppose that the conditions of Theorem 3.4.4 hold. Assumefurther that the goal (3.51) has been achieved. Then necessarily( 1‖S‖ ∞ ≥α) θ 1∣π−θ 1 ∣∣∣∣ n∏ p∣ π∣∣∣∣ π−θ 1p i . (3.59)i=1


82 3. SISO <strong>Control</strong>Proof. The proof follows by splitt<strong>in</strong>g the range of <strong>in</strong>tegration <strong>in</strong> (3.53) <strong>and</strong>us<strong>in</strong>g the bounds (3.51) <strong>and</strong> |S(e jθ )| ≤ ‖S‖ ∞ , ∀θ.□Similar results can be derived from Theorem 3.4.5 for the discrete complementarysensitivity function, show<strong>in</strong>g that discrete nonm<strong>in</strong>imum phasezeros <strong>and</strong> time delays <strong>in</strong>duce design trade-offs even if no b<strong>and</strong>width constra<strong>in</strong>tsare imposed on the discrete system.However, the above conclusions might be irrelevant to the real designproblem if the discrete system corresponds to the discretization of a cont<strong>in</strong>uoustimesystem where the controller is implemented digitally. This is discussed<strong>in</strong> the follow<strong>in</strong>g remark.Remark 3.4.1. Consider the discrete system shown <strong>in</strong> Figure 3.15. In thissystem, K d is a digital controller <strong>and</strong> (GH) d denotes the discrete equivalentof the cascade of a hold, H, <strong>and</strong> a cont<strong>in</strong>uous-time plant, G. Thisdiscretized plant is depicted <strong>in</strong> Figure 3.16, <strong>and</strong> usually arises <strong>in</strong> the analysis<strong>and</strong> design of cont<strong>in</strong>uous-time systems where the controller is implementeddigitally. 13{r k } +u❜ ✲ ✐ ✲ k{yK d✲ ✲ k }(GH) d ❜− ✻FIGURE 3.15. Discrete-time representation of a sampled-data control system.The signals <strong>in</strong> Figures 3.15 <strong>and</strong> 3.16 represent discrete-time signals, where{u k } is the discrete output of the digital controller, <strong>and</strong> {r k }, {y k } are thesampled values of the cont<strong>in</strong>uous-time reference <strong>and</strong> output signals, respectively.If we def<strong>in</strong>e L as the product of discrete controller <strong>and</strong> discretized plant,L = (GH) d K d , then the system may be studied by analyz<strong>in</strong>g the correspond<strong>in</strong>gdiscrete sensitivity <strong>and</strong> complementary sensitivity functions S<strong>and</strong> T def<strong>in</strong>ed <strong>in</strong> (3.43). The model of the system obta<strong>in</strong>ed by discretizationis LTI, due to the periodicity of the sampl<strong>in</strong>g process, which greatlysimplifies the analysis. However, a limitation of this method is that suchmodels fail to represent the full response of the system, s<strong>in</strong>ce <strong>in</strong>tersamplebehavior is <strong>in</strong>herently lost or hidden. 14It is well known that the poles of the discretized plant are determ<strong>in</strong>edby those of the cont<strong>in</strong>uous-time plant via the mapp<strong>in</strong>g z = e sτ , where τis the sampl<strong>in</strong>g period (e.g., Middleton <strong>and</strong> Goodw<strong>in</strong>, 1990). The discrete13 We give a more thorough treatment of these systems <strong>in</strong> Chapter 6, to which we refer forfurther details.14 Some <strong>in</strong>tersample <strong>in</strong>formation can still be h<strong>and</strong>led <strong>in</strong> a discrete model by us<strong>in</strong>g the modifiedZ-transform. However, this l<strong>in</strong>e of work will not be pursued here.


3.5 Summary 83..............................................(GH) d (z)❜u k✲u y ✟✲ ✲ ❛ y❄❛✲ ❜ kH(s) G(s)τ.............................................FIGURE 3.16. Discrete equivalent of the cascade of hold <strong>and</strong> plant.zeros of (GH) d , however, bear no simple relation to those of G <strong>and</strong>, <strong>in</strong>fact, may be arbitrarily assigned by suitable choice of the hold device (Åström<strong>and</strong> Wittenmark, 1990, p. 74).From this observation, it may be tempt<strong>in</strong>g to conclude that design limitationsdue to nonm<strong>in</strong>imum phase zeros of the analogue plant may becircumvented by assign<strong>in</strong>g the zeros of the discretized plant to be m<strong>in</strong>imumphase.Unfortunately, as we will see <strong>in</strong> Chapter 6, the difficulties imposed bynonm<strong>in</strong>imum phase zeros of the analogue plant rema<strong>in</strong> when the controlleris implemented digitally <strong>and</strong>, moreover, are <strong>in</strong>dependent of thetype of hold used. It is important that the <strong>in</strong>tersample behavior be exam<strong>in</strong>edif the problems are to be detected, s<strong>in</strong>ce analyz<strong>in</strong>g the system responseonly at the sampl<strong>in</strong>g <strong>in</strong>stants may be mislead<strong>in</strong>g.◦3.5 SummaryIn this chapter, we have considered both cont<strong>in</strong>uous <strong>and</strong> discrete-timescalar systems <strong>in</strong> unity feedback control loops. For these systems, we havepresented <strong>in</strong>tegral relations on the frequency response of the sensitivity<strong>and</strong> complementary sensitivity functions. These <strong>in</strong>tegrals — of the Bode<strong>and</strong> Poisson type — follow from applications of Cauchy’s <strong>in</strong>tegral relations,<strong>and</strong> reveal constra<strong>in</strong>ts on the closed-loop system imposed by ORHPopen-loop poles <strong>and</strong> zeros. One can use these constra<strong>in</strong>ts to study fundamentallimitations on the achievable performance of the closed loop. Forexample, it is possible to show that, if the open-loop system is nonm<strong>in</strong>imumphase, then requir<strong>in</strong>g |S(jω)| to be small over a range of frequencieswill necessarily lead to a large sensitivity peak outside that range. Moreover,the relations allow one to quantify this peak <strong>in</strong> terms of parametersof the open-loop plant. Thus, it is possible to use the results presentedhere to establish bounds on the achievable frequency responses for S <strong>and</strong>T, which hold for all possible controller designs.


84 3. SISO <strong>Control</strong>Notes <strong>and</strong> ReferencesBode Integral Formulae§3.1.1 is taken ma<strong>in</strong>ly from Bode (1945); §3.1.2 is based on results of Horowitz(1963), Freudenberg <strong>and</strong> Looze 1985; 1987, Middleton <strong>and</strong> Goodw<strong>in</strong> (1990) <strong>and</strong>Middleton (1991). §3.1.3 follows Freudenberg <strong>and</strong> Looze (1988), but it is also possibleto obta<strong>in</strong> similar results assum<strong>in</strong>g functional bounds, as was done <strong>in</strong> Middleton(1991).The Water-Bed EffectThis result was first discussed by Francis <strong>and</strong> Zames (1984), <strong>and</strong> then revisited byFreudenberg <strong>and</strong> Looze 1987 <strong>and</strong> Doyle et al. (1992). The result presented here isthe one given <strong>in</strong> Francis <strong>and</strong> Zames (1984).Poisson Integral FormulaeThis section is largely taken from Freudenberg <strong>and</strong> Looze 1985; 1988.For an <strong>in</strong>terest<strong>in</strong>g discussion on the complementarity of S <strong>and</strong> T see Kwakernaak(1995).Discrete Systems§3.4 is based on Sung <strong>and</strong> Hara (1988). A unification of both cont<strong>in</strong>uous <strong>and</strong>discrete-time results has been made <strong>in</strong> Middleton (1991), where the Bode <strong>in</strong>tegralfor T is also derived.


4MIMO <strong>Control</strong>This chapter <strong>in</strong>vestigates sensitivity limitations <strong>in</strong> multivariable l<strong>in</strong>ear control.There are different ways of extend<strong>in</strong>g the scalar results to a multivariablesett<strong>in</strong>g. We follow here two approaches, namely, one that considers<strong>in</strong>tegral constra<strong>in</strong>ts on the s<strong>in</strong>gular values of the sensitivity functions, <strong>and</strong> asecond that develops <strong>in</strong>tegral constra<strong>in</strong>ts on sensitivity vectors. These approachescomplement each other, <strong>in</strong> the sense that they f<strong>in</strong>d application<strong>in</strong> different problems, <strong>and</strong> hence both are needed to obta<strong>in</strong> a general viewof multivariable design limitations imposed by ORHP zeros <strong>and</strong> poles. Inorder to avoid repetition, we use the first approach to derive the multivariableversion of Bode’s <strong>in</strong>tegral theorems, whilst the second approachis taken to obta<strong>in</strong> the multivariable extension of the Poisson <strong>in</strong>tegrals. Bothapproaches emphasize the multivariable aspects of the problem by tak<strong>in</strong>g<strong>in</strong>to account, <strong>in</strong> addition to location, the directions of zeros <strong>and</strong> poles.4.1 Interpolation Constra<strong>in</strong>tsConsider the unity feedback configuration of Figure 4.1, where the openloopsystem, L, is a square (i.e., n × n), proper transfer matrix. As <strong>in</strong> Chapter3, let the sensitivity <strong>and</strong> complementary sensitivity functions be givenbyS(s) = [I + L(s)] −1 , <strong>and</strong> T(s) = L(s) [I + L(s)] −1 . (4.1)


86 4. MIMO <strong>Control</strong>r + ey❜ ✲ ✐ ✲ L(s) ✲ ❜− ✻FIGURE 4.1. Feedback control system.We will assume that the open-loop system is formed by the cascade ofthe plant, G, <strong>and</strong> the controller, K, i.e.,L = GK .The sensitivities <strong>in</strong> (4.1) are def<strong>in</strong>ed at the plant output. A similar pair ofsensitivities can be def<strong>in</strong>ed at the plant <strong>in</strong>put. In this chapter we will focuson the sensitivities given <strong>in</strong> (4.1), but we po<strong>in</strong>t out that parallel results canbe derived for the sensitivities def<strong>in</strong>ed at the plant <strong>in</strong>put.In the sequel, we will impose the follow<strong>in</strong>g assumption, which precludesthe possibility of hidden pole-zero cancelations <strong>in</strong> the open-loopsystem. 1Assumption 4.1. The sets of frequency locations of CRHP zeros <strong>and</strong> polesof the open-loop system L are disjo<strong>in</strong>t.◦Let the plant <strong>and</strong> controller have the follow<strong>in</strong>g coprime factorizationsover the r<strong>in</strong>g of proper <strong>and</strong> stable transfer matrices:G = ˜D −1G ÑG = N G D −1G ,K = ˜D −1K ÑK = N K D −1K . (4.2)It then follows from Chapter 2 that q is a zero of G if <strong>and</strong> only if there existvectors Ψ i , Ψ o ∈ ¢ n such that Ñ G (q)Ψ i = 0 <strong>and</strong> Ψ ∗ oN G (q) = 0. Ψ i <strong>and</strong>Ψ o are the <strong>in</strong>put <strong>and</strong> output zero directions associated with q, <strong>and</strong> can benormalized to be unitary vectors, i.e., such that Ψ ∗ i Ψ i = 1 <strong>and</strong> Ψ ∗ oΨ o = 1.Similarly, p is a pole of G if <strong>and</strong> only if there exist vectors Φ i , Φ o ∈ ¢ n suchthat ˜D G (p)Φ i = 0 <strong>and</strong> Φ ∗ oD G (p) = 0; Φ i <strong>and</strong> Φ o are the <strong>in</strong>put <strong>and</strong> outputpole directions associated with p. 2 Recall that a zero (pole) direction is saidto be canonical if it has only one nonzero component.Us<strong>in</strong>g the factorizations of plant <strong>and</strong> controller given <strong>in</strong> (4.2), the sensitivity<strong>and</strong> complementary sensitivity functions (4.1) can be expressed asS = D K ( ˜D G D K + Ñ G N K ) −1 ˜D G ,T = N G (Ñ K N G + ˜D K D G ) −1 Ñ K .(4.3)Note that the zeros of S <strong>and</strong> T are easily identified from (4.3). This is formalized<strong>in</strong> the follow<strong>in</strong>g lemma.1 A relaxed form of this requirement will be considered <strong>in</strong> §4.3.5.2 Note that if G is <strong>in</strong>vertible, then p is a zero of G −1 .


4.2 Bode Integral Formulae 87Lemma 4.1.1 (Interpolation Constra<strong>in</strong>ts). Under Assumption 4.1, the sensitivity<strong>and</strong> complementary sensitivity functions must satisfy the follow<strong>in</strong>gconditions.(i) If p ∈ ¢ +is a pole of G with <strong>in</strong>put direction Φ ∈ ¢ n , thenS(p)Φ = 0, <strong>and</strong> T(p)Φ = Φ. (4.4)(ii) If q ∈ ¢ +is a zero of G with output direction Ψ ∈ ¢ n , thenΨ ∗ S(q) = Ψ ∗ , <strong>and</strong> Ψ ∗ T(q) = 0. (4.5)(iii) If p ∈ ¢ +is a pole of K with output direction Φ ∈ ¢ n , thenΦ ∗ S(p) = 0, <strong>and</strong> Φ ∗ T(p) = Φ ∗ .(iv) If q ∈ ¢ +is a zero of K with <strong>in</strong>put direction Ψ ∈ ¢ n , thenS(q)Ψ = Ψ, <strong>and</strong> T(q)Ψ = 0.Proof. Note first that Assumption 4.1 guarantees that there is no pole-zerocancelation <strong>in</strong> S <strong>and</strong> T. Next, let p ¢ ∈ +be a pole of G with <strong>in</strong>put directionΦ ¢ ∈ n . Then ˜D G (p)Φ = 0 <strong>and</strong> (4.4) thus holds from (4.3). The proof ofthe other cases if similar.□Lemma 4.1.1 is the multivariable generalization of Lemma 3.1.3 <strong>in</strong> Chapter3 <strong>and</strong>, as was the case with the latter lemma, it translates the open-loopcharacteristics of <strong>in</strong>stability <strong>and</strong> nonm<strong>in</strong>imum phaseness <strong>in</strong>to propertiesthat the functions S <strong>and</strong> T must satisfy <strong>in</strong> the CRHP. In the multivariableversion, these properties also <strong>in</strong>volve directions of zeros <strong>and</strong> poles. Notethat the constra<strong>in</strong>ts given <strong>in</strong> (4.4) <strong>and</strong> (4.5) depend only on poles <strong>and</strong> zerosof the plant <strong>and</strong> hence hold irrespective of the controller design.In §4.2 <strong>and</strong> §4.3 we will use Lemma 4.1.1 to derive <strong>in</strong>tegral constra<strong>in</strong>tsof the Bode <strong>and</strong> Poisson type, respectively.4.2 Bode Integral FormulaeIn this section we will derive Bode <strong>in</strong>tegral formulae for the s<strong>in</strong>gular valuesof the sensitivity function. An important obstacle <strong>in</strong> extend<strong>in</strong>g thescalar <strong>in</strong>tegral constra<strong>in</strong>ts to the multivariable case lies <strong>in</strong> the fact that thes<strong>in</strong>gular values of an analytic transfer matrix are not themselves analytic.A notable step towards the multivariable results was taken by Boyd <strong>and</strong>Desoer (1985), who established <strong>in</strong>equality versions of the Bode <strong>and</strong> Poisson<strong>in</strong>tegral formulae. These <strong>in</strong>equalities use the fact that the logarithm ofthe largest s<strong>in</strong>gular value of an analytic transfer matrix is a subharmonic


88 4. MIMO <strong>Control</strong>function. 3 By restrict<strong>in</strong>g the class of systems under consideration — tothose for which the multiplicity of the s<strong>in</strong>gular values of S is constant <strong>in</strong>the CRHP — Chen (1995) established equality versions of the Bode <strong>and</strong>Poisson <strong>in</strong>tegral formulae. These relations were derived us<strong>in</strong>g Green’s formulafor functions of two real variables (see §A.5.1 <strong>in</strong> Appendix A), whichholds for arbitrary functions hav<strong>in</strong>g cont<strong>in</strong>uous second derivatives over aregion. Bode’s <strong>in</strong>tegral formulae for the s<strong>in</strong>gular values of S will be given<strong>in</strong> §4.2.2, after some prelim<strong>in</strong>ary def<strong>in</strong>itions <strong>and</strong> results.4.2.1 Prelim<strong>in</strong>ariesGiven a unitary vector Φ ∈ ¢ n , we call the one-dimensional subspacespanned by Φ the direction of Φ. The angle between the directions of twounitary vectors Φ 1 <strong>and</strong> Φ 2 is def<strong>in</strong>ed to be the pr<strong>in</strong>cipal angle (Björck <strong>and</strong>Golub, 1973) between the two correspond<strong>in</strong>g subspaces spanned by thetwo vectors. This angle, denoted by ∠(Φ 1 , Φ 2 ), is given bycos∠(Φ 1 , Φ 2 ) |Φ ∗ 1Φ 2 | .As discussed <strong>in</strong> Björck <strong>and</strong> Golub (1973) <strong>and</strong> Golub <strong>and</strong> Van Loan (1983),the pr<strong>in</strong>cipal angle between two subspaces serves as a distance measure,<strong>and</strong> it quantifies how well the two subspaces are aligned.Our aim <strong>in</strong> the follow<strong>in</strong>g subsection is to apply Green’s formula to thelogarithm of the s<strong>in</strong>gular values of S. It is thus necessary to extract fromS all its ORHP zeros. It is well known that a nonm<strong>in</strong>imum phase transferfunction H admits a factorization that consists of a m<strong>in</strong>imum phase part<strong>and</strong> an all-pass factor. 4 This factorization can be obta<strong>in</strong>ed by the follow<strong>in</strong>gsequential procedure, which amounts to repeated use of a formula developed<strong>in</strong> Wall, Jr. et al. (1980). Let q i ∈ ¢ + , i = 1, · · · , n q , be the ORHPzeros of H. Then, H can be factorized as H = H 1 B 1 , whereB 1 (s) = I − 2 Re q 1(s + q 1 ) Φ 1Φ ∗ 1 ,<strong>and</strong> Φ 1 is the <strong>in</strong>put direction of q 1 . Note that after this factorization, q 1is no longer a zero of the transformed transfer matrix H 1 . This procedurecan be cont<strong>in</strong>ued to obta<strong>in</strong> H i−1 = H i B i , where B i (s) = I − 2 Re q i /(s +q i ) Φ i Φ ∗ i , <strong>and</strong> Φ i, satisfy<strong>in</strong>g Φ ∗ i Φ i = 1, is obta<strong>in</strong>ed as if it were the directionof q i but is computed from H i−1 rather than H. As such, Φ i need notco<strong>in</strong>cide with the <strong>in</strong>put direction of q i ; however, it is easy to see that it is a3 Recall (cf. §A.3.1 <strong>in</strong> Appendix A) that a cont<strong>in</strong>uous function f is subharmonic if its Laplacian∇ 2 f ∂ 2 f/∂σ 2 + ∂ 2 f/∂ω 2 is nonnegative.4 An all-pass transfer function is a stable transfer function such that the magnitude of itslargest s<strong>in</strong>gular value equals one at all po<strong>in</strong>ts on the imag<strong>in</strong>ary axis.


4.2 Bode Integral Formulae 89l<strong>in</strong>ear comb<strong>in</strong>ation of the zero directions. By repeat<strong>in</strong>g this procedure, weobta<strong>in</strong> a factorization of the formn q∏H = ˜H B i ,i=1where ˜H is m<strong>in</strong>imum phase <strong>and</strong> B i , obta<strong>in</strong>ed as described above, is anall-pass factor correspond<strong>in</strong>g to q i .Let p i , i = 1, . . . , n p , be the poles of the open-loop system L <strong>in</strong> theORHP, repeated accord<strong>in</strong>g to their geometric multiplicities. 5 Under Assumption4.1, it follows from Lemma 4.1.1 <strong>and</strong> the factorization describedabove that S can be expressed asn p∏S = ˜S B i , (4.6)i=1where ˜S is m<strong>in</strong>imum phase <strong>and</strong> B i is the all-pass factor correspond<strong>in</strong>g top i , given byB i (s) = I − 2 Re p i(s + p i ) Ψ iΨ ∗ i . (4.7)We note that the all-pass factors <strong>in</strong> the factorization (4.6) can be constructed<strong>in</strong>dependently of the controller if the controller is stable. In particular, supposethat Assumption 4.1 holds <strong>and</strong> that the plant <strong>and</strong> controller are expressedas <strong>in</strong> (4.2). Assume further that the controller denom<strong>in</strong>ator D K has noORHP zeros. Then, it is clear from the expression for S <strong>in</strong> (4.3) that theall-pass factors of the factorization (4.6) can be computed from ˜D G alone.This will be relevant <strong>in</strong> the follow<strong>in</strong>g section when the <strong>in</strong>terest is to obta<strong>in</strong>performance limitations that can be computed before select<strong>in</strong>g anyparticular controller design.In the rema<strong>in</strong><strong>in</strong>g of §4.2, we make the follow<strong>in</strong>g assumption.Assumption 4.2.(i) The closed-loop system of Figure 4.1 is stable.(ii)lim sup R σ(L(s)) = 0.R→∞s∈ +|s|≥R(iii) The s<strong>in</strong>gular values of ˜S <strong>in</strong> (4.6), i.e., σ i ( ˜S(s)), i = 1, · · · , n, havecont<strong>in</strong>uous second order derivatives for all s ∈ ¢ +.◦5 Recall that the geometric multiplicity of a pole of H = ˜D −1 Ñ is equal to the deficiency<strong>in</strong> rank of ˜D at the frequency of the pole.


90 4. MIMO <strong>Control</strong>Assumption 4.2-(i) is the same as for the SISO case, <strong>and</strong> amounts tothe analyticity of S <strong>in</strong> the CRHP. Assumption 4.2-(ii) states that the largests<strong>in</strong>gular value of the open-loop transfer function has a roll-off rate of morethan one pole-zero excess. It will hold, for example, if each entry of L hasrelative degree larger than one. 6 Assumption 4.2-(iii) is necessary for theuse of Green’s theorem. A sufficient condition for this assumption to holdis that the multiplicity of σ i ( ˜S(s)), i = 1, · · · , n, be constant for all s ∈¢ +(Chen, 1995) — which <strong>in</strong> fact guarantees that σ i ( ˜S) has cont<strong>in</strong>uousderivatives of all orders <strong>in</strong> the CRHP.It is important to note that, even if S is analytic <strong>in</strong> the CRHP <strong>and</strong> thes<strong>in</strong>gular values of ˜S have cont<strong>in</strong>uous derivatives of all orders <strong>in</strong> the CRHP,the function σ i ( ˜S(s)) is not, <strong>in</strong> general, a harmonic function (Boyd <strong>and</strong>Desoer, 1985; Freudenberg <strong>and</strong> Looze, 1987). This precludes the use ofCauchy’s <strong>and</strong> Poisson’s formulae, which apply to analytic <strong>and</strong> harmonicfunctions.We end this subsection with a technical lemma that will prove useful <strong>in</strong>the subsequent analysis.Lemma 4.2.1. For any s ∈ ¢ +such that s ≠ q i , we have that∣ σ(B −1i(s)) =s + q i ∣∣∣∣ , (4.8)s − q iwhere B i is given by (4.7).σ j (B −1i(s)) = 1 , j = 2, · · · , n , (4.9)Proof. Us<strong>in</strong>g the matrix <strong>in</strong>version lemma (Golub <strong>and</strong> Van Loan, 1983), wehave, for s ≠ q i ,(B −1i(s) = I − 2 Re q ) −1i(s + q i ) Φ iΦ ∗ i= I + 2 Re q i(s + q i ) Φ iΦ ∗ i= I + 2 Re q is − q iΦ i Φ ∗ i .11 − 2 Re q i Φ ∗ i Φ i/(s + q i )Consider next the matrix Υ i , such that [Φ i , Υ i ] form an orthonormal basisof ¢ n . Then I = Φ i Φ ∗ i + Υ iΥ ∗ i , <strong>and</strong>B −1i(s) = Φ i Φ ∗ i + Υ i Υ ∗ i + 2 Re q iΦ i Φ ∗ is − q⎡ ⎤ i= [ ]s + q i[ ]Φ i Υ i⎣0s − q i⎦ Φ∗iΥ0 I∗ .i6 This follows from the <strong>in</strong>equality σ(A) ≤ n max 1≤i,j≤n |a ij |, which holds for any A =[a ij ] ∈ n×n (see e.g., Golub <strong>and</strong> Van Loan, 1983).


4.2 Bode Integral Formulae 91Thus<strong>and</strong> the result follows.σ j (B −1i⎛⎡⎤⎞s + q i(s)) = σ j⎝⎣0s − q i⎦⎠ ,0 I□4.2.2 Bode Integrals for SIn this section we will state the Bode <strong>in</strong>tegral theorem for the logarithm ofthe s<strong>in</strong>gular values of the sensitivity function. The proof <strong>in</strong>volves the useof Green’s formula <strong>and</strong> a careful computation of contour <strong>in</strong>tegrals. S<strong>in</strong>ceit is rather long <strong>and</strong> tedious, we defer it to §B.1 of Appendix B.The ma<strong>in</strong> result of the section is the follow<strong>in</strong>g.Theorem 4.2.2 (Bode Integral for S). Let S be factored as <strong>in</strong> (4.6). Then,under Assumption 4.2,∫ ∞log σ j (S(jω)) dω = F j + K j , (4.10)0whereF j = 1 ∫∫σ ∇ 2 log σ j ( ˜S(σ + jω)) dσ dω ,2<strong>and</strong>+∫ π/2K j = limR→∞ −π/2(∏npR log σ ji=1B −1i(Re jθ ))cos θ dθ .Proof. The proof follows by apply<strong>in</strong>g Green’s formula (A.30) <strong>in</strong> Appendix Afor Ω equal to the typical semicircular doma<strong>in</strong> of radius R <strong>in</strong>to the ORHP,<strong>and</strong> the choices of functions f(s) = log σ j ( ˜S(s)) <strong>and</strong> g(s) = log(|η + s|/|η −s|), where η > R. The details are given <strong>in</strong> §B.1 of Appendix B. □Note that F j <strong>and</strong> K j are functions of the particular factorization of Sgiven <strong>in</strong> (4.6). As discussed before, this factorization will, <strong>in</strong> general, dependon the choice of controller. It is possible, however, to derive an <strong>in</strong>equalitythat is <strong>in</strong>dependent of the controller as long as the controller isstable. This is stated <strong>in</strong> the follow<strong>in</strong>g corollary.Corollary 4.2.3. Let the plant be given by G = ˜D −1G ÑG, <strong>and</strong> assume thatthe controller K is stable. Let S be factorized as <strong>in</strong> (4.6), where the all-passfactors are computed from ˜D G alone. Then, under Assumption 4.2,∫ ∞ ∫ (π/2np)∏log σ(S(jω)) dω ≥ lim R log σ B −1i(Re jθ ) cos θ dθ .0R→∞ −π/2i=1(4.11)


92 4. MIMO <strong>Control</strong>Proof. If the controller is stable, then it is possible to factorize S as <strong>in</strong> (4.6),where the all-pass factors are computed from ˜D G alone. Consider next(4.10) for j = 1, i.e., focus<strong>in</strong>g on the largest s<strong>in</strong>gular value of S. Then thefunction σ( ˜S) is subharmonic ¢ <strong>in</strong> +(Boyd <strong>and</strong> Desoer, 1985; Freudenberg<strong>and</strong> Looze, 1987). This implies that ∇ 2 log σ( ˜S) ≥ 0 <strong>and</strong> hence F 1 ≥ 0.Inequality (4.11) then follows.□The <strong>in</strong>tegral constra<strong>in</strong>t given <strong>in</strong> (4.10) is an extension of Bode’s <strong>in</strong>tegralgiven previously for scalar systems <strong>in</strong> Theorem 3.1.4 of Chapter 3. Notethat for SISO systems, the s<strong>in</strong>gular values, σ j ( ˜S), all collapse <strong>in</strong>to | ˜S|. S<strong>in</strong>ce,˜S is analytic <strong>in</strong> the CRHP it follows that log | ˜S| is harmonic <strong>in</strong> the CRHP.Hence ∇ 2 log | ˜S| = 0 <strong>in</strong> ¢ +, giv<strong>in</strong>g F j = 0 <strong>in</strong> (4.10). We next focus on K j<strong>in</strong> (4.10). Note that for SISO systems, the all-pass factor <strong>in</strong> (4.7) have theform B i (s) = (s − p i )/(s + p i ). Us<strong>in</strong>g the uniform convergence theorem(Lev<strong>in</strong>son <strong>and</strong> Redheffer, 1970, p. 335), <strong>and</strong> Example A.8.7 of Appendix A,K j <strong>in</strong> (4.10) can be written as∑K j =n p ∫ π/2i=1−π/2n p ∫ π/2∑=i=1−π/2n p ∫ π/2∑=i=1−π/2n p∑∫ π/2= 2 Re p ii=1n p∑= π Re p i .i=1∣ ∣ ∣∣∣lim R log Re jθ + p i ∣∣∣R→∞ Re jθ cos θ dθ− p i[Re lim R log(1+ p ie −jθR→∞RRe(p i e −jθ + p i e −jθ ) cos θ dθ−π/2cos 2 θ dθ)−R log(1− p ie −jθR)]cos θ dθHence, for scalar systems, (4.10) reduces to (3.13) (particularized to thecase S(∞) = 1).We next discuss the terms F j <strong>and</strong> K j <strong>in</strong> (4.10) for the general multivariablecase. It is easy to see from the proof of Theorem 4.2.2 (see §B.1 <strong>in</strong>Appendix B) that F j will be present for both stable <strong>and</strong> unstable openloopsystems. In particular, for j = 1, i.e., focus<strong>in</strong>g on the largest s<strong>in</strong>gularvalue of S, we have seen <strong>in</strong> the proof of Corollary 4.2.3 that F 1 ≥ 0.We thus conclude that, even for open-loop stable systems (satisfy<strong>in</strong>g Assumption4.2), the Bode <strong>in</strong>tegral of the largest s<strong>in</strong>gular value of S has anonnegative value.It is also clear from the proof of Theorem 4.2.2 that the terms K j completelyquantify the effect of open-loop unstable systems on the Bode <strong>in</strong>tegralsfor S. It is easy to see that K j ≥ 0. Indeed, this follows from the


4.2 Bode Integral Formulae 93<strong>in</strong>equality( np)∏σ j B −1i(Re jθ )i=1n p∏≥i=1σ ( B −1i(Re jθ ) ) ,<strong>and</strong> the fact that the RHS above equals 1 by Lemma 4.2.1. Thus, open-loopunstable poles contribute an additional nonnegative term, which has thepotential for additional limitations upon sensitivity properties.In order to obta<strong>in</strong> a better underst<strong>and</strong><strong>in</strong>g of the multivariable natureof the result <strong>in</strong> Theorem 4.2.2, we will focus on some particular cases, forwhich the term K j has a simpler explicit expression <strong>and</strong> bound. It is <strong>in</strong>structiveto first exam<strong>in</strong>e two extreme cases. The first case corresponds toΨ ∗ i Ψ j = 0 for all i, j = 1, · · · , n p , i ≠ j, <strong>and</strong> the second case correspondsto Ψ 1 = Ψ 2 = · · · = Ψ np . For simplicity, we assume that n p ≤ n. We thenhave the follow<strong>in</strong>g result.Proposition 4.2.4. Let S be factorized as <strong>in</strong> (4.6) <strong>and</strong> assume, without lossof generality, that Re p 1 ≥ · · · ≥ Re p np . Also, let n p ≤ n. Then, underAssumption 4.2,(i) if Ψ ∗ i Ψ j = 0 for all i, j = 1, · · · , n p , i ≠ j, we have that∫ ∞0log σ j (S(jω)) dω = π Re p j + F j , j = 1, · · · , n p , (4.12)<strong>and</strong>∫ ∞0log σ j (S(jω)) dω = F j , j = n p + 1, · · · , n ; (4.13)(ii) if Ψ i = Ψ for all i = 1, · · · , n p , we have that∫ ∞0n p∑log σ(S(jω)) dω = π Re p i + F 1 , (4.14)i=1<strong>and</strong> ∫ ∞0log σ j (S(jω)) dω = F j , j = 2, · · · , n . (4.15)Proof. In case (i), it is easy to show thatn∏ pB −1i=1n p∑i(s) = I +i=12 Re p is − p iΨ i Ψ ∗ i .Consider next the matrix Υ, such that [Ψ 1 , · · · , Ψ np , Υ] form an orthonormalbasis of ¢ n . Then I = ∑ n pi=1 Ψ iΨ ∗ i + ΥΥ∗ , <strong>and</strong> by a similar argument


94 4. MIMO <strong>Control</strong>to that used <strong>in</strong> the proof of Lemma 4.2.1, it follows that⎛⎡⎤⎞s + p 1(∏np)s − p 1 .σ j B −1.. i(s) = σ j .s + pi=1⎜⎢np⎥⎟⎝⎣s − p np⎦⎠I n−npIt is then clear from (4.10) <strong>and</strong> the above relation that (4.13) holds. As for(4.12), a straightforward manipulation ¢ shows that for s ∈ +<strong>and</strong> |s| ≥ Rwith R be<strong>in</strong>g sufficiently large, the <strong>in</strong>equality∣ ∣ s + p 1 ∣∣∣ ∣ ≥s + p 2 ∣∣∣s − p 1∣s − p 2holds if Re p 1 ≥ Re p 2 , as was assumed. Then, as R → ∞( np)∣∏σ j B −1 s + pi(s) =j ∣∣∣∣ , j = 1, · · · , n p .s − p ji=1This gives K j <strong>in</strong> (4.10) equal to π Re p j , thus show<strong>in</strong>g (4.12).For case (ii), one can construct a matrix Υ whose columns form an orthonormalbasis of n together with Ψ. This leads ¢ to the expression⎡n p⎤n∏ pB −1i(s) = [ Ψ Υ ] ∏ s + p⎢i[ ]⎥ Ψ∗⎣ s − p i ⎦i=1Υ ∗ ,i=1I n−1<strong>and</strong> the rest of the proof is similar to that of case (i).Proposition 4.2.4 suggests that the limitations imposed by open-loopunstable poles <strong>in</strong> multivariable systems are related not only to the location<strong>in</strong> frequency, but also to the directions of poles (or rather a l<strong>in</strong>ear comb<strong>in</strong>ationof such directions) <strong>and</strong> further to the relative geometric configuration ofthese directions. Case (i) corresponds to the situation where the pole directionsare mutually orthonormal, for which the <strong>in</strong>tegral perta<strong>in</strong><strong>in</strong>g to eachs<strong>in</strong>gular value is related solely to one unstable pole (with a correspond<strong>in</strong>gdistance to the imag<strong>in</strong>ary axis), as if each “channel” of the system is decoupledfrom the others. Case (ii) corresponds to the situation where allthe pole directions are parallel, for which the unstable poles affect only the<strong>in</strong>tegral of the largest s<strong>in</strong>gular value, as if the channel correspond<strong>in</strong>g tothis s<strong>in</strong>gular value conta<strong>in</strong>s all unstable poles.The follow<strong>in</strong>g result — stated without proof — specializes Theorem 4.2.2to the case of two open-loop unstable poles, <strong>and</strong> it further shows that therelative geometry of the directions of these poles is <strong>in</strong>deed crucial <strong>in</strong> thestudy of sensitivity limitations.□


4.2 Bode Integral Formulae 95Theorem 4.2.5 (Chen, 1995). Let n p = 2 <strong>and</strong> let the conditions of Theorem4.2.2 hold. Then∫ ∞log σ(S(jω)) dω = a(1, 2) + F 1 , (4.16)0∫ ∞log σ 2 (S(jω)) dω = b(1, 2) + F 2 , (4.17)0<strong>and</strong> ∫ ∞0log σ j (S(jω)) dω = F j , j = 3, · · · , n , (4.18)wherea(i, j) π (√)Re(p i +p j )+ [Re(p i −p j )]22 +4 Re p i Re p j cos 2 ∠(Ψ i , Ψ j ) ,b(i, j) π (√)Re(p i +p j )− [Re(p i −p j )]22 +4 Re p i Re p j cos 2 ∠(Ψ i , Ψ j ) .The above result fully characterizes the limitation imposed by a pairof open-loop unstable poles on the sensitivity reduction properties. Thislimitation depends, not only on the distances of the poles to the imag<strong>in</strong>aryaxis, but also on the pr<strong>in</strong>cipal angle between the pole directions.Us<strong>in</strong>g similar arguments as <strong>in</strong> the proof of Theorem 4.2.5, Chen (1995)obta<strong>in</strong>ed the follow<strong>in</strong>g lower bounds for the Bode <strong>in</strong>tegral of the largests<strong>in</strong>gular value of the sensitivity function.Corollary 4.2.6 (Chen, 1995). Let the conditions of Theorem 4.2.2 hold.Then, for any Φ ∈ ¢ n satisfy<strong>in</strong>g Φ ∗ Φ = 1, we have∫ ∞0n p∑log σ(S(jω)) dω ≥ π Re p i cos 2 ∠(Φ, Ψ i ) + F 1 , (4.19)i=1<strong>and</strong>, <strong>in</strong> particular, for any j = 1, · · · , n p ,∫ ∞0n p∑log σ(S(jω)) dω ≥ π Re p i cos 2 ∠(Ψ i , Ψ j ) + F 1 . (4.20)i=1From Theorem 4.2.2 <strong>and</strong> Corollary 4.2.6, it can be <strong>in</strong>ferred that there willexist a frequency range over which the largest s<strong>in</strong>gular value of the sensitivityfunction exceeds one if it is to be kept below one at other frequencies.We will further see <strong>in</strong> §4.2.3 below that, <strong>in</strong> the presence of b<strong>and</strong>widthconstra<strong>in</strong>ts, this will result <strong>in</strong> a sensitivity trade-off <strong>in</strong> different frequencyranges. The above results also show unique features of multivariable feedbacksystems. First, ow<strong>in</strong>g to the fact that log σ( ˜S(jω)) is a subharmonic◦◦


96 4. MIMO <strong>Control</strong>function <strong>in</strong> the CRHP, the results show (via the terms F j ) that design limitationsdue to b<strong>and</strong>width constra<strong>in</strong>ts are, <strong>in</strong> a sense, more str<strong>in</strong>gent than<strong>in</strong> scalar systems. 7 Moreover, how str<strong>in</strong>gent these limitations are dependson the Laplacian of log σ( ˜S(jω)). The second multivariable feature highlightedby the results, is that the additional cost associated with open-loopunstable poles (captured <strong>in</strong> the terms K j ) is a function not only of the positionof the poles but also of the pole directions. Moreover, Proposition 4.2.4<strong>and</strong> Theorem 4.2.5 show that the relative geometry of the pole directionsis also important.A multivariable version of the Poisson <strong>in</strong>tegral for the largest s<strong>in</strong>gularvalue of the sensitivity function can be used to study the effect of openloopnonm<strong>in</strong>imum phase zeros upon sensitivity limitations. This resultwas given <strong>in</strong> Chen (1995), who developed an extension of the Poisson <strong>in</strong>tegralformula for real functions f : ¢ → ¡ with cont<strong>in</strong>uous derivativesof all orders <strong>in</strong> ¢ +, but that are not necessarily harmonic functions. Asexpected, this new formula has an additional term <strong>in</strong>volv<strong>in</strong>g the Laplacianof f. Accord<strong>in</strong>gly, the Poisson <strong>in</strong>tegral theorem for the largest s<strong>in</strong>gularvalue of the sensitivity function displays a term depend<strong>in</strong>g on the Laplacianof log σ( ˜S(jω)). This Poisson <strong>in</strong>tegral also shows that the sensitivityreduction ability of the system may be severely limited if the system hasboth open-loop unstable poles <strong>and</strong> nonm<strong>in</strong>imum phase zeros, especiallywhen these poles <strong>and</strong> zeros are close to each other <strong>and</strong> the pr<strong>in</strong>cipal anglebetween their directions are small. More discussion on this version of thePoisson <strong>in</strong>tegral for multivariable systems can be found <strong>in</strong> Chen (1995).As a f<strong>in</strong>al comment, we remark that us<strong>in</strong>g st<strong>and</strong>ard modifications tothe proof of Theorem 4.2.2, similar to the ones used <strong>in</strong> the proof of Theorem3.1.4 of Chapter 3, it is possible to show that open-loop poles onthe imag<strong>in</strong>ary axis do not contribute to the values of the Bode <strong>and</strong> Poisson<strong>in</strong>tegrals. Also, parallel results can be obta<strong>in</strong>ed for the complementarysensitivity function.4.2.3 Design InterpretationsIn this section we briefly show the utility of the <strong>in</strong>tegral constra<strong>in</strong>t (4.10)to study design trade-offs imposed on sensitivity reduction by open-loopunstable poles <strong>in</strong> conjunction with b<strong>and</strong>width constra<strong>in</strong>ts. These tradeoffsare similar to those obta<strong>in</strong>ed <strong>in</strong> §3.1.3 of Chapter 3 for scalar systems.For the purpose of illustration, we will consider limitations related to thelargest s<strong>in</strong>gular value of S, which we have seen <strong>in</strong> Chapter 2 has relevance<strong>in</strong> quantify<strong>in</strong>g feedback properties of the closed-loop system of Figure 4.1.7 This conclusion holds if the multivariable problem is approached from a s<strong>in</strong>gular valueperspective. Yet we will see <strong>in</strong> §4.3 that, when study<strong>in</strong>g <strong>in</strong>tegral constra<strong>in</strong>ts on sensitivityvectors, the trade-offs appear to be alleviated with respect to the scalar case.


4.2 Bode Integral Formulae 97Follow<strong>in</strong>g §3.1.3, we will assume that the largest s<strong>in</strong>gular value of theopen-loop system satisfies the boundσ(L(jω)) ≤δω k+1 ≤ ɛ < 1 , ∀ω ∈ [ω c, ∞] , (4.21)where δ > 0 <strong>and</strong> k > 0 are given constants. A b<strong>and</strong>width constra<strong>in</strong>t like(4.21) may be necessary to ensure robust stability aga<strong>in</strong>st uncerta<strong>in</strong>ty <strong>in</strong>the plant model (see Chapter 2). Note that a different upper bound of theform (4.21) may hold for each s<strong>in</strong>gular value.Next, assume that the feedback loop of Figure 4.1 has been designed toachieve the follow<strong>in</strong>g reduction specificationσ(S(jω)) ≤ α , ∀ω ∈ [−ω 1 , ω 1 ] , (4.22)where ω 1 < ω c <strong>and</strong> α > 0 is a given small constant. The follow<strong>in</strong>g resultshows that the reduction specification (4.22), together with the b<strong>and</strong>widthconstra<strong>in</strong>t (4.21), can be achieved only at the expense of sensitivity<strong>in</strong>crease over the range [ω 1 , ω c ].Corollary 4.2.7. Let the conditions of Theorem 4.2.2 hold. In addition,suppose that (4.21) <strong>and</strong> (4.22) are satisfied for some ω 1 <strong>and</strong> ω c such thatω 1 < ω c . Then,sup log σ(S(jω)) ≥ω∈(ω 1 ,ω c)[1F 1 + K 1 + ω 1 log 1 ]ω c −ω 1 α + ω c log(1−ɛ) .(4.23)Proof. The proof is similar to that of (3.21) <strong>in</strong> §3.1.3, <strong>and</strong> follows us<strong>in</strong>g(4.10) together with the bounds (4.21) <strong>and</strong> (4.22), <strong>and</strong> the observation thatfor ω ≥ ω c1σ(S(jω)) =σ(I + L(jω))1≤1 − σ(L(jω))1≤1 − δ/ω k+1 . □Similar to its scalar counterpart, given <strong>in</strong> (3.21) <strong>in</strong> Chapter 3, the bound(4.23) shows that any attempt to <strong>in</strong>crease the area of sensitivity reduction,by requir<strong>in</strong>g α to be small <strong>and</strong>/or ω 1 to be close to ω c , will necessarily result<strong>in</strong> a large sensitivity peak <strong>in</strong> the range (ω 1 , ω c ). Note that the constantK 1 — which is the multivariable version of the term π ∑ Re p i <strong>in</strong> (3.21) —can be precisely evaluated for some particular cases as shown <strong>in</strong> Proposition4.2.4 <strong>and</strong> Theorem 4.2.5. In the general case K 1 is nonnegative, thus


98 4. MIMO <strong>Control</strong>potentially <strong>in</strong>creas<strong>in</strong>g the sensitivity peak. The ma<strong>in</strong> multivariable featureappears <strong>in</strong> the term F 1 . As we have seen, the constant F 1 is also nonnegative,lead<strong>in</strong>g to an additional possibility of an <strong>in</strong>crease <strong>in</strong> the sensitivitypeak.We remark that similar trade-offs can be analyzed by means of the Poisson<strong>in</strong>tegral formula for the largest s<strong>in</strong>gular value of the sensitivity function(Chen, 1995).4.3 Poisson Integral FormulaeIn this section, we use the approach of Gómez <strong>and</strong> Goodw<strong>in</strong> (1995) toobta<strong>in</strong> <strong>in</strong>tegral constra<strong>in</strong>ts — of the Poisson type — on columns of the sensitivityfunction. These constra<strong>in</strong>ts are useful <strong>in</strong> the study of performancelimitations <strong>in</strong> multivariable feedback problems where structural features— such as diagonalization, triangularization, etc.— are of <strong>in</strong>terest. The resultuses the Poisson <strong>in</strong>tegral formula developed <strong>in</strong> §A.6.1 of Appendix A.Similar <strong>in</strong>tegral constra<strong>in</strong>ts can be obta<strong>in</strong>ed on rows of the complementarysensitivity function. The <strong>in</strong>terested reader is referred to Gómez <strong>and</strong> Goodw<strong>in</strong>(1995) for this latter analysis.4.3.1 Prelim<strong>in</strong>ariesIn this subsection we <strong>in</strong>troduce some prelim<strong>in</strong>ary notions. We denote byS ik , the element <strong>in</strong> the i-row <strong>and</strong> k-column of the n × n square transfermatrix S. If Ψ is a vector <strong>in</strong> ¢ n , we denote its elements by ψ i , i =1, . . . , n, i.e., Ψ = [ψ ∗ 1 , ψ∗ 2 , . . . , ψ∗ n] ∗ . Also, we <strong>in</strong>troduce the <strong>in</strong>dex set I Ψ {i ∈ : ψ i ≠ 0} as the set of <strong>in</strong>dices of the nonzero elements of Ψ.As <strong>in</strong> Chapter 3, given a set Z k = {s i , i = 1, . . . , n k } of complex numbers<strong>in</strong> the ORHP, we def<strong>in</strong>e its Blaschke product as the functionB k (s) =∏n ki=1s − s is + ¯s i.If Z k is empty, we def<strong>in</strong>e B k (s) = 1, ∀s.In the rema<strong>in</strong>der of §4.3, we make the follow<strong>in</strong>g assumption.Assumption 4.3.(i) The closed-loop system of Figure 4.1 is stable.(ii) The open-loop system L is a proper transfer matrix.Assumption 4.3-(i) is the same as for the Bode <strong>in</strong>tegral given <strong>in</strong> §4.2.Assumption 4.3-(ii) is necessary to restrict the behavior at <strong>in</strong>f<strong>in</strong>ity of the◦


4.3 Poisson Integral Formulae 99sensitivity function, <strong>and</strong> is more relaxed than Assumption 4.2-(ii) used toderive the Bode <strong>in</strong>tegral.Next, let Ψ ∈ ¢ n , Ψ ≠ 0, be the output direction of an ORHP zero ofthe plant G, as described <strong>in</strong> Lemma 4.1.1 (ii). Consider the vector functionΨ ∗ S : ¢ → ¢ n , whose n elements are proper, stable, scalar rationalfunctions. Pick one of these elements, sayρ k (s) n∑ψ ∗ i S ik (s) , (4.24)i=1where k is <strong>in</strong> I Ψ , <strong>and</strong> let B k be the Blaschke product of the zeros of ρ k <strong>in</strong>the ORHP. Then, the function ˜ρ k (s) B −1k(s)ρ k(s) is proper, stable, <strong>and</strong>m<strong>in</strong>imum phase, which implies that log ˜ρ k (s) is analytic <strong>in</strong> the ORHP <strong>and</strong>satisfies the conditions of the Poisson <strong>in</strong>tegral formula (see Theorem A.6.1<strong>in</strong> Appendix A). In the follow<strong>in</strong>g subsection we will use this <strong>in</strong>formationto derive Poisson <strong>in</strong>tegrals for S.4.3.2 Poisson Integrals for SIn the follow<strong>in</strong>g theorem, we translate the <strong>in</strong>terpolation constra<strong>in</strong>t given<strong>in</strong> (4.5) <strong>in</strong>to Poisson <strong>in</strong>tegrals on the columns of S.Theorem 4.3.1 (Poisson Integral for S). Let q = σ q + jω q , σ q > 0, bea zero of the plant G, <strong>and</strong> let Ψ ¢ ∈ n , Ψ ≠ 0, be its output direction, asdescribed <strong>in</strong> Lemma 4.1.1 (ii). Then, under Assumption 4.3, for each <strong>in</strong>dexk <strong>in</strong> I Ψ ,1π∫ ∞−∞n∑log∣ψ ∗ iψ ∗ i=1 kS ik (jω)∣σ qσ 2 q + (ω q − ω)dω = log |B−12 k(q)| , (4.25)where B k is the Blaschke product of the ORHP zeros of ρ k <strong>in</strong> (4.24).Proof. The proof is an application of the Poisson <strong>in</strong>tegral formula, as <strong>in</strong>Theorem 3.3.1 <strong>in</strong> Chapter 3, to the scalar elements ρ k of the vector functionΨ ∗ S : ¢ → ¢ n , which, under Assumption 4.3, are proper, stable, rationalfunctions.Let ρ k <strong>in</strong> (4.24), where k is <strong>in</strong> I Ψ , be one of these elements, <strong>and</strong> let B kbe the Blaschke product of the zeros of ρ k <strong>in</strong> the ORHP. As noted before,we can then apply the Poisson <strong>in</strong>tegral formula to the function log ˜ρ k (s),where ˜ρ k (s) B −1k(s)ρ k(s). Evaluat<strong>in</strong>g this <strong>in</strong>tegral at s = q, <strong>and</strong> us<strong>in</strong>gthe fact that | ˜ρ k (jω)| = |ρ k (jω)| yields∫ ∣ 1 ∞ n∑logψ ∗ σ ∣∣∣∣ nqi S ik (jω)π −∞ ∣∣ σ 2 q+(ω q −ω) 2 dω = log B −1k(q) ∑ψ ∗ i S ik (q)∣i=1i=1= log |B −1k (q)| + log |ψ∗ k| ,(4.26)


100 4. MIMO <strong>Control</strong>where the last step follows from the <strong>in</strong>terpolation condition Ψ ∗ S(q) = Ψ ∗given <strong>in</strong> (4.5) (notice that ψ k ≠ 0 by assumption). S<strong>in</strong>ce1π∫ ∞−∞σ qσ 2 q + (ω q − ω) 2 dω = 1 ,subtract<strong>in</strong>g log |ψ ∗ k| from both sides of (4.26) yields (4.25), thus complet<strong>in</strong>gthe proof.□Note that the result <strong>in</strong> Theorem 4.3.1 depends on the particular choiceof controller. This is due to the presence of the Blaschke product B k ofORHP zeros of ρ k on the RHS of (4.25). Indeed, ρ k <strong>in</strong> (4.24) is formedby comb<strong>in</strong><strong>in</strong>g entries of S, which clearly are functions of both plant <strong>and</strong>controller. However, the follow<strong>in</strong>g corollary establishes a constra<strong>in</strong>t thatholds for any controller, <strong>and</strong> can therefore be used to identify, a priori,design limitations imposed by plant characteristics.Corollary 4.3.2. Under the conditions of Theorem 4.3.1, the sensitivityfunction S satisfies, for each <strong>in</strong>dex k <strong>in</strong> I Ψ ,n∑ ψ ∗ iσ qlogS∣ik (jω)∣ σ 2 dω ≥ 0. (4.27)q + (ω q − ω)2∫ ∞−∞ψ ∗ i=1 kProof. The proof follows from (4.25), on not<strong>in</strong>g that |B −1k(s)| ≥ 1 at anypo<strong>in</strong>t s ¢ <strong>in</strong> + , <strong>and</strong> so log |B −1k(q)| ≥ 0. □Theorem 4.3.1 <strong>and</strong> Corollary 4.3.2 establish that, for each nonm<strong>in</strong>imumphase zero of the plant, q, there is a set of <strong>in</strong>tegral constra<strong>in</strong>ts that limitthe values of S on the jω-axis <strong>in</strong> an <strong>in</strong>tr<strong>in</strong>sically vectorial fashion. Observethat, depend<strong>in</strong>g on the number of nonzero elements of the output directionΨ associated with the zero of the plant — i.e., the number of nonzeroelements <strong>in</strong> the <strong>in</strong>dex set I Ψ — up to n <strong>in</strong>tegral constra<strong>in</strong>ts of the form(4.25) can be stated, each correspond<strong>in</strong>g to a column of S. Moreover, if νis the geometric multiplicity of the ORHP zero of the plant, i.e., the dimensionof the (left) null space of the matrix N(q), then up to ν <strong>in</strong>tegralconstra<strong>in</strong>ts of the form (4.25) arise for each column of S. S<strong>in</strong>ce the componentsof the null space are l<strong>in</strong>early <strong>in</strong>dependent, it is expected that each<strong>in</strong>tegral constra<strong>in</strong>t will give different <strong>in</strong>formation. Hence, it is reasonableto expect that, the greater the drop <strong>in</strong> rank caused by a particular zero, themore restrictive the constra<strong>in</strong>t becomes. 8Note that, if the zero direction happens to be canonical, i.e., the <strong>in</strong>dexset I Ψ has only one element, k say, then the constra<strong>in</strong>ts reduce to those8 If the geometric multiplicity of a zero is greater than one, then there is no unique way ofchoos<strong>in</strong>g a basis for the correspond<strong>in</strong>g null space. Each choice of basis, <strong>in</strong> pr<strong>in</strong>ciple, generatesalternative constra<strong>in</strong>ts. However, these constra<strong>in</strong>ts are <strong>in</strong>terrelated <strong>and</strong> do not provide<strong>in</strong>dependent <strong>in</strong>formation regard<strong>in</strong>g performance limitations.


4.3 Poisson Integral Formulae 101of the SISO case for the element S kk . This is seen more clearly from thefollow<strong>in</strong>g corollary.Corollary 4.3.3. Under the conditions of Theorem 4.3.1, the diagonal elements,S kk , of the sensitivity function satisfy, for each <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫ ∞−∞∫ ∞σ qlog |S kk (jω)|−∞σ 2 q + (ω q − ω) 2 dω ≥logψ ∗ k S (4.28)kk(jω)∣∑ ni=1 ψ∗ i S σ q∣ik(jω) σ 2 q + (ω q − ω) 2 dω .Proof. The LHS of (4.27) can be alternatively written as∫ ∞−∞⎛⎞∫ ∞ ⎜n∑ ψ ∗ ilogS kk ⎝1 +S ik ⎟−∞ψ∣∗ i=1 k S ⎠kk∣i≠k∣ n∑ ∣∣∣∫σ ∞ ∣ ψ ∗ i S ikqlog |S kk |σ 2 q+(ω q −ω) 2 dω+ i=1log−∞ |ψ ∗ k S kk|σ qσ 2 q + (ω q − ω) 2 dω =σ qσ 2 q+(ω q −ω) 2 dω ,from which (4.28) follows.□As discussed before, if the zero direction is canonical, then the <strong>in</strong>dexset I Ψ has only one element, k say. In this particular case, Corollary 4.3.3gives,∫ ∞log |S kk (jω)|−∞σ qσ 2 q + (ω q − ω) 2 dω ≥ 0 ,which is the same constra<strong>in</strong>t that would hold for a scalar system hav<strong>in</strong>ga nonm<strong>in</strong>imum phase zero at s = q <strong>and</strong> a sensitivity function equal toS kk . Corollary 4.3.3 also shows that, if the zero direction is not canonical,i.e., the set I Ψ has more than one element, then additional degrees of freedomarise <strong>in</strong> multivariable systems, which can potentially be exploited toreduce the cost associated with nonm<strong>in</strong>imum phase zeros. This will beanalyzed further <strong>in</strong> the follow<strong>in</strong>g two sections.It is clear from the previous discussion that the results given above displaythe essential multivariable nature of the problem, <strong>in</strong> the sense that theconstra<strong>in</strong>ts stress the importance of directions of ORHP open-loop zeros<strong>and</strong> poles <strong>in</strong> connection with spatial — i.e., related to the structure of thetransfer matrix — sensitivity allocation. In the follow<strong>in</strong>g section we analyzedesign implications <strong>and</strong> trade-offs <strong>in</strong>duced by the <strong>in</strong>tegral constra<strong>in</strong>tsjust given.


102 4. MIMO <strong>Control</strong>4.3.3 Design InterpretationsAs <strong>in</strong> the case of scalar systems, Theorem 4.3.1 <strong>and</strong> Corollary 4.3.2 can beused to give <strong>in</strong>sights <strong>in</strong>to the frequency doma<strong>in</strong> trade-offs <strong>in</strong> sensitivity.A straightforward corollary of these results emphasizes the vectorial natureof the associated trade-offs. Let Ω 1 [0, ω 1 ] denote a given rangeof frequencies of <strong>in</strong>terest, <strong>and</strong> assume that the k-column of S satisfies thefollow<strong>in</strong>g design specifications:|S ik (jω)| ≤ α ik for ω <strong>in</strong> [−ω 1 , ω 1 ], i = 1, . . . , n , (4.29)where α ik , i = 1, . . . , n, are small positive numbers. Let Θ q (ω 1 ) be theweighted length of the <strong>in</strong>terval [−ω 1 , ω 1 ], as def<strong>in</strong>ed <strong>in</strong> (3.33) <strong>in</strong> Chapter3, which can be expressed asΘ q (ω 1 ) =∫ ω1We then have the follow<strong>in</strong>g corollary.0σ qσ 2 q + (ω q − ω) 2 + σ qσ 2 q + (ω q + ω) 2 dω.Corollary 4.3.4. Assume that all the conditions of Theorem 4.3.1 hold.Then, if the k-column of S, where k is <strong>in</strong> I Ψ , achieves the specificationsgiven <strong>in</strong> (4.29), the follow<strong>in</strong>g <strong>in</strong>equality must be satisfied,‖S kk ‖ ∞ +n∑∣i=1i≠kψ ∗ iψ ∗ k⎛∣ ‖S ⎜ 1ik‖ ∞ ≥ ⎝α kk + ∑ n ∣i=1∣ ψ∗ iψi≠k∗ k⎞⎟⎠∣ α ikΘ q(ω 1 )π−Θ q(ω 1 ).(4.30)Proof. Divid<strong>in</strong>g the range of <strong>in</strong>tegration <strong>and</strong> us<strong>in</strong>g the weighted length ofthe <strong>in</strong>terval [−ω 1 , ω 1 ], the <strong>in</strong>equality (4.27) impliesn∑ ψ ∗ ilog maxω∈[−ω 1 ,ω 1 ] ∣ ψ ∗ S ik (jω)i=1 k∣ Θ q(ω 1 )+n∑ ψ ∗ ilogS∥ik (jω) ∥ [π − Θ q (ω 1 )] ≥ 0 .ψ ∗ i=1 k∥∞Exponentiat<strong>in</strong>g both sides above yields[] n∑ ψ ∗ Θq(ω 1 ) ∥ ∥∥∥∥i∑ nmaxSω∈[−ω 1 ,ω 1 ] ∣ik (jω)∣ψ ∗ i=1 kψ ∗ iψ ∗ i=1 kS ik (jω)∥[π−Θ q(ω 1 )]∞≥ 1 .Us<strong>in</strong>g the specifications (4.29) <strong>and</strong> the triangular <strong>in</strong>equality, we have⎛⎞Θ q(ω 1 )⎛⎞[π−Θ q(ω 1 )]⎜n∑⎝α kk +ψ ∗ i∣ ∣ α ⎟ ⎜n∑ik⎠⎝‖S kk ‖ ∞ +ψ ∗ i∣ ∣ ‖S ⎟ik‖ ∞ ⎠ ≥ 1 ,i=1i≠kψ ∗ ki=1i≠kψ ∗ k


4.3 Poisson Integral Formulae 103from which (4.30) follows immediately.□As <strong>in</strong> the SISO case, the above corollary shows that <strong>in</strong>tegral relation(4.25) implies lower bounds on the <strong>in</strong>f<strong>in</strong>ity norm of elements of S. Noticethat the exponent on the RHS of (4.30) is a positive number <strong>and</strong> its base islikely to be larger than one if the coefficients α ik are small enough; hence,the more dem<strong>and</strong><strong>in</strong>g the specifications, the larger these lower bounds are.If the zero direction is not canonical, an important difference <strong>in</strong> theMIMO case is that the lower bounds apply to a comb<strong>in</strong>ation of norms of elements,which somehow relaxes the constra<strong>in</strong>t over the SISO case (wherethere is only one element). Also, the lower bounds are smaller than <strong>in</strong> theSISO case due to the presence of extra positive terms <strong>in</strong> the denom<strong>in</strong>ator ofthe RHS of (4.30). As a consequence, when the zero direction is not canonical,there is an alleviation of the cost associated with the correspond<strong>in</strong>gzero as compared with the scalar case. To ga<strong>in</strong> further <strong>in</strong>sight <strong>in</strong>to theseideas, we will next consider the special case of a diagonally decoupleddesign.4.3.4 The Cost of Decoupl<strong>in</strong>gMultivariable systems are, by their <strong>in</strong>tr<strong>in</strong>sic nature, subject to coupl<strong>in</strong>gbetween different outputs <strong>and</strong> <strong>in</strong>puts. This means that, <strong>in</strong> general, one <strong>in</strong>putaffects more than one output <strong>and</strong>, conversely, one output is affectedby more than one <strong>in</strong>put. A natural approach to tackle the additional designdifficulty aris<strong>in</strong>g from this <strong>in</strong>teraction is to devise methodologies thattranslate — or approximate — the MIMO problem <strong>in</strong>to a collection ofSISO problems. There are alternative ways to do this, but they all <strong>in</strong>volveachiev<strong>in</strong>g special structures for the closed-loop transfer matrices <strong>in</strong> such away that the coupl<strong>in</strong>g is elim<strong>in</strong>ated or is easier to h<strong>and</strong>le. Examples ofthese decoupl<strong>in</strong>g methodologies <strong>in</strong>clude diagonalization, triangularization,diagonal dom<strong>in</strong>ance, etc., which have been frequently reported <strong>in</strong>the literature (e.g., Hung <strong>and</strong> Anderson, 1979; Rosenbrock, 1969; Weller<strong>and</strong> Goodw<strong>in</strong>, 1993; Desoer <strong>and</strong> Gündes, 1986).In all of the above references, limitations imposed by nonm<strong>in</strong>imum phasezeros are discussed as an additional complication. Desoer <strong>and</strong> Gündes(1986) have made this more precise by show<strong>in</strong>g that, <strong>in</strong> order to achievediagonal decoupl<strong>in</strong>g, the multiplicity of nonm<strong>in</strong>imum phase zeros mayneed to be <strong>in</strong>creased. This, <strong>in</strong> turn, is associated with performance penaltiessuch as <strong>in</strong>creased undershoot <strong>and</strong> rise times.Follow<strong>in</strong>g Gómez <strong>and</strong> Goodw<strong>in</strong> (1995), we next study the cost of decoupl<strong>in</strong>g<strong>in</strong> the context of <strong>in</strong>tegral constra<strong>in</strong>ts. Note that, for a diagonallydecoupled system, the elements S ik , for i ≠ k, are zero. It then followsfrom Corollary 4.3.3 that, for each k <strong>in</strong> I Ψ , the diagonal element S kk satis-


104 4. MIMO <strong>Control</strong>fies the <strong>in</strong>tegral constra<strong>in</strong>t∫ ∞σ qlog |S kk (jω)|−∞σ 2 dω ≥ 0 . (4.31)q + (ω q − ω)2Also, if the sensitivity function is diagonal, <strong>and</strong> the diagonal element S kk ,where k is <strong>in</strong> I Ψ , is designed to satisfy |S kk (jω)| ≤ α kk for ω ∈ [−ω 1 , ω 1 ],Corollary 4.3.4 gives‖S kk ‖ ∞ ≥( ) Θq(ω 1)1 π−Θ q(ω 1 ). (4.32)α kkTo <strong>in</strong>vestigate the cost of diagonal decoupl<strong>in</strong>g, we compare the constra<strong>in</strong>t(4.28) with (4.31), <strong>and</strong> the lower bound (4.30) with (4.32).We first study the case where the dimension of the null space correspond<strong>in</strong>gto the zero at s = q is one. In this case we note that, if the zerodirection Ψ has more than one nonzero element, then it seems feasible toachieve a negative value for the RHS of (4.28) by mak<strong>in</strong>g off-diagonal elements<strong>in</strong> the sensitivity function nonzero. Compar<strong>in</strong>g this with the result<strong>in</strong> (4.31), where the RHS is zero, <strong>in</strong>dicates that when Ψ has more than onenonzero element, it is possible to exploit nondiagonal sensitivity entriesso as to ameliorate the constra<strong>in</strong>ts on the diagonal element. Thus, if Ψ has morethan one nonzero element, it is possible to have a jo<strong>in</strong>t spatial (i.e., nondiagonal)<strong>and</strong> frequency trade-off <strong>in</strong> sensitivity allocation. Similar remarksapply to the bounds on ‖S kk ‖ ∞ shown <strong>in</strong> (4.30) <strong>and</strong> (4.32). In the lattercase, nondiagonal decoupl<strong>in</strong>g allows α ik , for i ≠ k, to be greater thanzero <strong>and</strong> aga<strong>in</strong>, provided the vector Ψ has more than one nonzero element,then the RHS of (4.30) can be made less than the RHS of (4.32). Moreover,as already noted <strong>in</strong> §4.3.3, the LHS of (4.30) also conta<strong>in</strong>s positive termsrelated to the norms of the off-diagonal elements, which do not appear <strong>in</strong>(4.32). In this way, the cost <strong>in</strong>duced by the ORHP zero is somehow sharedamong various elements of the column.Interest<strong>in</strong>gly, hav<strong>in</strong>g more than one nonzero entry <strong>in</strong> the zero directionΨ means that the particular nonm<strong>in</strong>imum phase zero <strong>in</strong> question has itseffects associated with a l<strong>in</strong>ear comb<strong>in</strong>ation of more than one output. Conversely,if Ψ is canonical, i.e., it has only one nonzero element, then the zeroaffects only one output. In the latter case, one might expect that it is unhelpfulto make the sensitivity nondiagonal <strong>and</strong> this is precisely the po<strong>in</strong>tmade above.The preced<strong>in</strong>g arguments have focused on the case where the dimensionof the null space is one. For cases where the dimension of the nullspace is greater than one, then the issue arises as to whether or not thereexists a basis for the null space that is canonical, i.e., it uses only unit vectorswith only one nonzero entry. In cases where such a canonical basis


4.3 Poisson Integral Formulae 105exists, 9 it would seem, by extension of the arguments used above for theone dimensional case, that there is no apparent benefit to be ga<strong>in</strong>ed byexploit<strong>in</strong>g spatial sensitivity trade-offs.As a f<strong>in</strong>al remark, note that the <strong>in</strong>equality (4.32) is the same as for thescalar case (see §3.3.2 <strong>in</strong> Chapter 3), <strong>and</strong> can be used to describe the waterbedeffect for nonm<strong>in</strong>imum phase systems: reduc<strong>in</strong>g the sensitivity <strong>in</strong> onefrequency range causes it to rise <strong>in</strong> other ranges. However, (4.30) showsthat, if decoupl<strong>in</strong>g is removed as a constra<strong>in</strong>t, then spatial as well as frequencytrade-offs are possible. Thus, the water-bed effect on the diagonal elementsof the sensitivity function may be ameliorated by allow<strong>in</strong>g off-diagonalelements to be nonzero. Nevertheless, this requires directional conditions(noncanonical zero directions) so that the cost associated with the <strong>in</strong>tegralconstra<strong>in</strong>ts can be shared by different outputs.4.3.5 The Impact of Near Pole-Zero CancelationsCancelations of ORHP zeros <strong>and</strong> poles between plant <strong>and</strong> controller of thefeedback loop of Figure 4.1 is highly undesirable because it leads to loss of<strong>in</strong>ternal stability (cf. §2.2.1 <strong>in</strong> Chapter 2). In the multivariable case, thesecancelations <strong>in</strong>volve not only frequency locations of zeros <strong>and</strong> poles, butalso their directions. For example, if the plant <strong>and</strong> controller are expressedas <strong>in</strong> (4.2), then the closed loop will not be <strong>in</strong>ternally stable if ˜D G <strong>and</strong> Ñ Kshare an ORHP zero with the same <strong>in</strong>put direction (Gómez <strong>and</strong> Goodw<strong>in</strong>,1995). It is easy to see that, if this is the case, then ˜D G <strong>and</strong> Ñ K are notright coprime. Loss of <strong>in</strong>ternal stability also occurs if Ñ G <strong>and</strong> ˜D K are notright coprime, if N G <strong>and</strong> D K are not left coprime, <strong>and</strong> if D G <strong>and</strong> N K arenot left coprime (Vidyasagar, 1985). All these four cases are multivariableextensions of unstable pole-zero cancelations. Note that Assumption 4.1precludes any of these cancelations by not allow<strong>in</strong>g even frequency co<strong>in</strong>cidenceof zeros <strong>and</strong> poles.Approximate unstable cancelations are also problematic. Indeed, wehave seen <strong>in</strong> Chapter 3 that, for scalar systems, severe sensitivity peaksarise if there exist open-loop zeros <strong>and</strong> poles at near frequencies. This canbe seen clearly from the values of the Poisson <strong>in</strong>tegral constra<strong>in</strong>ts (3.29)<strong>and</strong> (3.30) <strong>in</strong> §3.3, where the Blaschke products on the RHSs tend to <strong>in</strong>f<strong>in</strong>ityas ORHP zeros approach ORHP poles. We will see next that similarconclusions hold <strong>in</strong> the multivariable case regard<strong>in</strong>g approximate unstablecancelations — keep<strong>in</strong>g <strong>in</strong> m<strong>in</strong>d that these cancelations <strong>in</strong>volve bothfrequency <strong>and</strong> directions. To analyze the effect of these unstable multivariablecancelations, we consider separately the case where the directions are9 One <strong>in</strong>stance for the left null space of the plant to have a canonical basis is when theq-<strong>in</strong>teractor (Weller <strong>and</strong> Goodw<strong>in</strong>, 1993) associated with the zero q is diagonal (Gómez <strong>and</strong>Goodw<strong>in</strong>, 1995).


106 4. MIMO <strong>Control</strong>the same but the poles <strong>and</strong> zeros are close <strong>in</strong> frequency, <strong>and</strong> the case wherethe locations of poles <strong>and</strong> zeros are the same but the directions are close.Let q ∈ ¢ +be a zero of G with output direction Ψ ∈ ¢ n . It followsfrom Theorem 4.3.1 that, under Assumption 4.3, the sensitivity functionsatisfies the <strong>in</strong>tegral constra<strong>in</strong>t (4.25). Consider first the case that the vectorΨ is also an output direction of a zero q 1 ≠ q of D K , i.e., the controller hasa pole at s = q 1 such that Ψ ∗ D K (q 1 ) = 0. Inspection of the expression forS <strong>in</strong> (4.3) clearly shows that s = q 1 is a zero of the vector function Ψ ∗ S,i.e., each of its elements ρ k def<strong>in</strong>ed <strong>in</strong> (4.24) has a zero at s = q 1 . Thus, theBlaschke product of zeros of ρ k , B k , can be expressed asB k (s) =( s − q1s + q 1) nk˜B k (s) ,where n k ≥ 1, <strong>and</strong> where ˜B k (s) is the Blaschke product of the rema<strong>in</strong><strong>in</strong>gzeros of ρ k . Us<strong>in</strong>g the above expression <strong>in</strong> (4.25), <strong>and</strong> the fact that| ˜B −1k(s)| ≥ 1 at any po<strong>in</strong>t s <strong>in</strong> + , we have that¢∫ 1 ∞ n∑ ψ ∗ ∣iσ qlogπ −∞ ∣ ψ ∗ S ik (jω)i=1 k∣ σ 2 q + (ω q − ω) 2 dω ≥ n k logq + q 1 ∣∣∣∣ ,q − q 1(4.33)for each k <strong>in</strong> I Ψ , i.e., such that ψ k ≠ 0. We can conclude from (4.33) that,similar to the scalar case, the lower bound on the RHS becomes arbitrarilylarge when q 1 approaches q. This, <strong>in</strong> turn, would lead to high peaks onthe magnitudes of the elements of S (<strong>and</strong> of T s<strong>in</strong>ce S + T = I) on thek-column. Similar arguments apply when an unstable pole of the planthas the same direction as a nonm<strong>in</strong>imum phase zero of the controller. Inthis case, it is also possible to show that, as the location of the zero of thecontroller becomes close to that of the pole of the plant, large peaks <strong>in</strong> Twill occur.Another alternative for a near multivariable cancelation is when thecontroller has an unstable pole at the same frequency, q say, at which theplant has a nonm<strong>in</strong>imum phase zero, but with a slightly different direction.Let Ψ ∈ n be the output direction of the zero of the ¢ plant, <strong>and</strong> letΨ + ɛ be the direction of the pole of the controller. Then, cases (ii) <strong>and</strong> (iii)of Lemma 4.1.1 hold, i.e.,Ψ ∗ S(q) = Ψ ∗ , <strong>and</strong> Ψ ∗ T(q) = 0 ;(Ψ ∗ + ɛ ∗ )S(q) = 0 , <strong>and</strong> (Ψ ∗ + ɛ ∗ )T(q) = Ψ ∗ + ɛ ∗ .The above <strong>in</strong>terpolation conditions lead toɛ ∗ S(q) = −Ψ ∗ , <strong>and</strong> ɛ ∗ T(q) = Ψ ∗ + ɛ ∗ ,


4.3 Poisson Integral Formulae 107which, us<strong>in</strong>g any compatible matrix <strong>and</strong> vector norm, imply that‖S(q)‖ ≥ |Ψ|/|ɛ| ,‖T(q)‖ ≥ |Ψ + ɛ|/|ɛ| .Hence, if |ɛ| ≪ |Ψ| then ‖S(q)‖ <strong>and</strong> ‖T(q)‖ must be large. Consider, for example,the Euclidean norm for vectors <strong>and</strong> the <strong>in</strong>f<strong>in</strong>ity norm for matrices,i.e., the maximum s<strong>in</strong>gular value of the matrix. Then the bounds aboveshow that σ(S(q)) <strong>and</strong> σ(T(q)) will become very large if |ɛ| approacheszero. This, <strong>in</strong> turn, will harden the <strong>in</strong>tegral constra<strong>in</strong>ts on the whole frequencyresponses of σ(S) <strong>and</strong> σ(T), as can be seen from the Poisson <strong>in</strong>tegral<strong>in</strong>equalities obta<strong>in</strong>ed by Chen (1995), e.g.,∫1 ∞σ qlog σ(S(jω))π −∞σ 2 dω ≥ log σ(S(q)) . (4.34)q + (ω q − ω)2Summariz<strong>in</strong>g, the analysis <strong>in</strong> this section has shown that unstable multivariablenear cancelations lead to poor sensitivity properties. On the oneh<strong>and</strong>, ORHP zeros <strong>and</strong> poles of the open-loop system hav<strong>in</strong>g the samedirections <strong>and</strong> near frequencies cause the elements of the sensitivity functionsto exhibit large peaks, as demonstrated by (4.33). On the other h<strong>and</strong>,ORHP zeros <strong>and</strong> poles of the open-loop system at the same frequency <strong>and</strong>hav<strong>in</strong>g directions close <strong>in</strong> norm produce peaks on the maximum s<strong>in</strong>gularvalue of the sensitivity functions, as seen from (4.34) <strong>and</strong> the above discussion.This suggests that multivariable approximate cancelations of ORHPzeros <strong>and</strong> poles should be avoided if possible.4.3.6 ExamplesIn this subsection, we give examples that illustrate the issues discussedbefore for cont<strong>in</strong>uous-time multivariable systems.Example 4.3.1. Consider the plant⎡⎢G(s) = ⎣1 − s s + 3⎤(s + 1) 2 (s + 1)(s + 2) ⎥1 − s s + 4 ⎦ , (4.35)(s + 1)(s + 2) (s + 2) 2which has an ORHP zero at s = 1 with output direction equal to Ψ ∗ =[5, −6]. S<strong>in</strong>ce this direction is not canonical, we can argue from §4.3.4 thatthere will be a cost <strong>in</strong> sensitivity associated with achiev<strong>in</strong>g diagonal decoupl<strong>in</strong>g.We will <strong>in</strong>vestigate this cost.Direct application of Corollary 4.3.2 leads to the follow<strong>in</strong>g <strong>in</strong>equalitiesthat must be satisfied by the entries of the sensitivity function, regardless


108 4. MIMO <strong>Control</strong>of the design methodology used:∫1 ∞ logπ ∣ S 11(jω) − 6 −∞5 S 21(jω)∣1log5π ∣6 S 12(jω) − S 22 (jω)∣∫ ∞−∞1dω ≥ 0 , (4.36)1 + ω2 1dω ≥ 0 . (4.37)1 + ω2 Assum<strong>in</strong>g further that the sensitivity elements are required to satisfy (4.29),then Corollary 4.3.4 gives the follow<strong>in</strong>g lower bounds on l<strong>in</strong>ear comb<strong>in</strong>ationsof element norms:‖S 11 ‖ ∞ + 6 () Θq(ω 1)5 ‖S 1π−Θ q(ω 1 )21‖ ∞ ≥, (4.38)α 11 + 6/5 α 21‖S 22 ‖ ∞ + 5 () Θq(ω 1)6 ‖S 1π−Θ q(ω 1 )12‖ ∞ ≥. (4.39)α 22 + 5/6 α 12Next, we add the restriction of diagonal decoupl<strong>in</strong>g. Then, us<strong>in</strong>g (4.31)<strong>and</strong> (4.32), we have, for k = 1, 2,∫ ∞1log |S kk (jω)| dω ≥ 0 , (4.40)−∞1 + ω2 ( ) Θq(ω 1)1 π−Θ q(ω 1 )‖S kk ‖ ∞ ≥. (4.41)α kkCompar<strong>in</strong>g (4.36) <strong>and</strong> (4.37) with (4.40), for k = 1 <strong>and</strong> k = 2, respectively,it is clear that the constra<strong>in</strong>t on the diagonal elements is alleviated if theoff-diagonal entries are nonzero. More <strong>in</strong>terest<strong>in</strong>g, however, is to comparethe lower bounds on the peak sensitivity given by (4.38) <strong>and</strong> (4.39) with(4.41) for k = 1 <strong>and</strong> k = 2, respectively. Consider, for example, (4.38) <strong>and</strong>(4.41) for k = 1. These bounds are shown <strong>in</strong> Figure 4.2 as a function of thereduction level α 11 , for ω 1 = 0.3 <strong>and</strong> for different values of α 21 . Observethat α 21 = 0 <strong>in</strong> (4.38) (curve 1 <strong>in</strong> Figure 4.2) corresponds to the lowerbound <strong>in</strong> (4.41) for k = 1. Compar<strong>in</strong>g curve 1 with the other curves <strong>in</strong>Figure 4.2, we can see that relax<strong>in</strong>g the requirement of diagonal decoupl<strong>in</strong>g<strong>and</strong> allow<strong>in</strong>g the level of sensitivity reduction <strong>in</strong> the off-diagonalelements (S 21 <strong>in</strong> this case) to be greater than zero, significantly reducesthe constra<strong>in</strong>t on the comb<strong>in</strong>ation of peak sensitivity values.To further analyze the cost of decoupl<strong>in</strong>g, we next apply the particulardesign methodology proposed <strong>in</strong> Weller <strong>and</strong> Goodw<strong>in</strong> (1993). Thisdesign technique allows one to achieve different degrees of decoupl<strong>in</strong>gwhile keep<strong>in</strong>g other design parameters essentially constant. Follow<strong>in</strong>gthis method with the parameter a n — which very roughly def<strong>in</strong>es the


4.3 Poisson Integral Formulae 1093 122.52341.510 0.05 0.1 0.15 0.2α 11FIGURE 4.2. RHS of (4.38) as a function of α 11 , for ω 1 = 0.3 <strong>and</strong> for: 1- α 21 = 0;2- α 21 = 0.01; 3- α 21 = 0.05; 4- α 21 = 0.1.b<strong>and</strong>width — equal to 1, the resultant closed-loop sensitivity is⎡s 3 + s 2 ⎤(2 + λ) + s(1 + 3λ)S(s) = ⎢ (s + 1) 3 0⎥⎣(λ − 1)(3s 4 − 16s 3 − 21s 2 ) s 3 + 3s 2 + 4s⎦ , (4.42)3(s + 1) 5 (s + 1) 3where λ = 1 gives diagonal decoupl<strong>in</strong>g <strong>and</strong> λ < 1 gives partial <strong>and</strong> staticdecoupl<strong>in</strong>g. 10Because of the structure of this particular design, which leads to S 12 = 0,the <strong>in</strong>tegral constra<strong>in</strong>ts that must be satisfied are, for λ < 1: (4.40) fork = 1, 2, <strong>and</strong> for λ = 0: (4.36) <strong>and</strong> (4.40) for k = 2. Compar<strong>in</strong>g (4.36) <strong>and</strong>(4.40) for k = 1 <strong>in</strong>dicates that there should be an <strong>in</strong>crease <strong>in</strong> the sensitivitypeak of S 11 if diagonalization is required. This is evident from Figure 4.3,where |S 11 (jω)| is plotted for different values of λ <strong>in</strong> the range 0 ≤ λ ≤ 1.Turn<strong>in</strong>g to the s<strong>in</strong>gular value approach, note that the <strong>in</strong>tegral constra<strong>in</strong>t(4.34) holds due to the presence of the ORHP open-loop zero q = 1, i.e.,1π∫ ∞−∞1log σ(S(jω)) dω ≥ log σ(S(1)) .1 + ω2 The above constra<strong>in</strong>t, as it is stated, depends on the particular designmethodology. However, it is easy to show that σ(S(1)) ≥ 1 for any design,11 <strong>and</strong> hence the weighted <strong>in</strong>tegral of the logarithm of the maximum10 The closed loop of Figure 4.1 is said to be: statically decoupled if it is <strong>in</strong>ternally stable<strong>and</strong> the complementary sensitivity function T is nons<strong>in</strong>gular <strong>and</strong> satisfies T(0) = I; partiallydecoupled if it is <strong>in</strong>ternally stable <strong>and</strong> the complementary sensitivity function is nons<strong>in</strong>gular<strong>and</strong> lower triangular.11 This is because of norm properties <strong>and</strong> the fact that, if the open-loop system has anORHP zero q, then there exists a vector Ψ such that Ψ ∗ S(q) = Ψ ∗ , see Theorem 13.1.1 <strong>in</strong>Chapter 13 for more details.


110 4. MIMO <strong>Control</strong>21.5λ=1|S 11 (jω)|10.5λ=0010 −2 10 −1 10 0ω10 1 10 2FIGURE 4.3. Effect of decoupl<strong>in</strong>g on S 11 (jω) for Example 4.3.1.s<strong>in</strong>gular value of S is nonnegative. This implies, for example, that, forany design, the frequency response of σ(S) will necessarily achieve valuesgreater than one.We next consider aga<strong>in</strong> the (partial) decoupl<strong>in</strong>g design of Weller <strong>and</strong>Goodw<strong>in</strong> (1993), which led to the sensitivity function given <strong>in</strong> (4.42). Figure4.4 shows the maximum (left) <strong>and</strong> m<strong>in</strong>imum (right) s<strong>in</strong>gular values ofS for different values of λ <strong>in</strong> the range 0 ≤ λ ≤ 1.2.52.52λ=02σ 1 (jω)1.51λ=1σ 2 (jω)1.51λ=10.50.5λ=0010 −2 10 −1 10 0ω10 1 10 2010 −2 10 −1 10 0ω10 1 10 2FIGURE 4.4. Maximum (left) <strong>and</strong> m<strong>in</strong>imum (right) s<strong>in</strong>gular values of S <strong>in</strong> Example4.3.1, for different degrees of decoupl<strong>in</strong>g.In particular, the plot on the left confirms our prediction that σ(S(jω))would peak above one. An <strong>in</strong>terest<strong>in</strong>g observation is that total (diagonal)decoupl<strong>in</strong>g (λ = 1) is the best situation for the maximum s<strong>in</strong>gular value,<strong>in</strong> the sense that λ = 1 gives the smallest peak value for its frequencyresponse. This situation is desirable s<strong>in</strong>ce, as seen <strong>in</strong> §2.2.4 of Chapter 2,hav<strong>in</strong>g σ(S(jω)) small is a requirement for robust performance aga<strong>in</strong>ste.g., unstructured divisive perturbations of the plant. On the other h<strong>and</strong>,decreas<strong>in</strong>g λ from 1 to 0, i.e., relax<strong>in</strong>g the degree of decoupl<strong>in</strong>g, <strong>in</strong>creasesthe peak value of the maximum s<strong>in</strong>gular value. Compar<strong>in</strong>g these conclusionswith the previous ones about the harden<strong>in</strong>g of the constra<strong>in</strong>ts on


4.3 Poisson Integral Formulae 111the element S 11 as the degree of decoupl<strong>in</strong>g is <strong>in</strong>creased, we can arguethat, for this particular design methodology, the greater the degree of decoupl<strong>in</strong>gachieved, the worse the disturbance rejection properties of thediagonal elements become, while the performance is made more robustaga<strong>in</strong>st unstructured perturbations of the plant.◦Example 4.3.2. Consider the plant⎡⎤(1 − s)(s + 2)(1 − s)(s + 2)G(s) = ⎢ (2s + 1)(s + 1)(s + 4) (2s + 1)(s + 1)(s + 3)⎥⎣ −(s 4 + 7s 3 + 14s 2 + 6s − 4) −(s + 2)(s 3 + 4s 2 + 3s + 4) ⎦ ,(2s + 1) 2 (s + 1)(s + 3)(s + 4) (2s + 1) 2 (s + 1)(s + 3)(s + 5)(4.43)which has an ORHP zero at s = 1 with output direction equal to Ψ ∗ =[1, 0]. This plant was considered <strong>in</strong> Weller <strong>and</strong> Goodw<strong>in</strong> (1993), where itwas shown that the correspond<strong>in</strong>g <strong>in</strong>teractor matrix is nondiagonal <strong>and</strong>hence diagonal decoupl<strong>in</strong>g was achieved at the expense of <strong>in</strong>troduc<strong>in</strong>gadditional nonm<strong>in</strong>imum phase zeros <strong>and</strong> <strong>in</strong>creas<strong>in</strong>g the relative degree ofthe closed-loop transfer functions.From the po<strong>in</strong>t of view of sensitivity constra<strong>in</strong>ts, however, no <strong>in</strong>crease<strong>in</strong> the peak value of |S 11 (jω)| is expected due to diagonal decoupl<strong>in</strong>g s<strong>in</strong>cethe zero direction is canonical. Indeed, as discussed <strong>in</strong> §4.3.2, the elementS 11 satisfies the <strong>in</strong>tegral constra<strong>in</strong>t∫ ∞1log |S 11 (jω)|−∞1 + ω 2 dω ≥ 0 ,whether or not the system is decoupled. Thus, <strong>in</strong> terms of the sensitivitybounds presented <strong>in</strong> this chapter, there is no apparent cost associated withdecoupl<strong>in</strong>g.Us<strong>in</strong>g the methodology of Weller <strong>and</strong> Goodw<strong>in</strong> (1993), leads to the follow<strong>in</strong>gsensitivity function:S(s) =⎡⎢⎣4s 4 + s 3 (14 − 2λ) + 14s 2 + s(4 + 2λ)(s + 1) 3 0(1 − λ)s4s 3 + 8s 2 + 6s(2s + 1) 2 (2s + 1) 2 (s + 1)where, as before, λ = 1 gives diagonal decoupl<strong>in</strong>g <strong>and</strong> λ < 1 gives partial<strong>and</strong> static decoupl<strong>in</strong>g. Figure 4.5 shows the plot of |S 11 (jω)| versus frequency,for different values of λ <strong>in</strong> the range 0 ≤ λ ≤ 1. It can be observedthat the change <strong>in</strong> the peak of S 11 for different values of λ is m<strong>in</strong>imal <strong>and</strong>,<strong>in</strong> any case, is accompanied by a change <strong>in</strong> b<strong>and</strong>width. Compar<strong>in</strong>g theresults with those of Example 4.3.1 we conclude that here we essentiallyhave only a frequency trade-off <strong>in</strong> sensitivity, whereas <strong>in</strong> Example 4.3.1we also had a spatial trade-off; yet both have a nonm<strong>in</strong>imum phase zero⎤⎥⎦ ,


112 4. MIMO <strong>Control</strong>21.5λ=1 λ=0|S 11 (jω)|10.5010 −2 10 −1 10 0ω10 1 10 2FIGURE 4.5. Effect of decoupl<strong>in</strong>g on S 11 (jω) for Example 4.3.2.at the same location. The difference <strong>in</strong> these examples can be traced backto the directions associated with the zero.◦Example 4.3.3. Here we study the importance of directions when evaluat<strong>in</strong>gthe impact of near cancelation of ORHP zeros <strong>and</strong> poles. Consider theplant G = ˜D −1G ÑG = N G D −1G , where⎡⎤s − 1⎢0˜D G (s) = ⎣s + 1 ⎥s + 1⎦ = D G (s) ,1s + 2⎡⎤2⎢0Ñ G (s) = ⎣s + 2 ⎥2 ⎦ = N G (s) .0s + 2The matrix ˜D G has a zero at s = 1 with <strong>in</strong>put direction [2/3, −1] ∗ , whichcorresponds to an unstable pole of the plant.We will use the parametrization of all stabiliz<strong>in</strong>g controllers for thegiven plant. This parametrization possesses a free parameter, which willbe used to study the effect of directions <strong>in</strong> near pole-zero cancelations. Astraightforward calculation shows that the matrices⎡s + 2⎢X(s) = Y(s) =s + 1⎣ (s + 2)2−(s + 1)(s + 3)⎤0⎥s + 2⎦ ,s + 3satisfy XD + YN = I. Follow<strong>in</strong>g Vidyasagar (1985, Theorem 5.2.1), wehave that all controllers that stabilize the closed loop have the form K =(X + RÑ G ) −1 (Y − R ˜D G ), where R is a proper <strong>and</strong> stable transfer matrixsuch that X + RÑ G is nons<strong>in</strong>gular <strong>and</strong> biproper. We next choose R to have


4.4 Discrete Systems 113the follow<strong>in</strong>g special form⎡r 1 (s)⎢R(s) = ⎣s + 1r 2 (s)s + 1⎤r 3 (s)s + 1⎥r 4 (s)⎦ ,s + 1where the polynomials r i , i = 1, . . . , 4, have degree one <strong>and</strong> are such thatthe matrix R satisfies the follow<strong>in</strong>g <strong>in</strong>terpolation constra<strong>in</strong>ts⎡[ ]27ψ⎤1−2 00R(0) = 4− 2 ⎢, <strong>and</strong> R(1) =2(6ψ⎣2 + 9ψ 1 ) ⎥93 308 − 81ψ ⎦ .14(6ψ 2 + 9ψ 1 )It is possible to verify that this choice of R has the follow<strong>in</strong>g properties:• The controller stabilizes the closed loop.• The closed loop achieves static decoupl<strong>in</strong>g.• The controller has a nonm<strong>in</strong>imum phase zero at s = 1 with <strong>in</strong>putdirection Ψ = [ψ 1 , ψ 2 ] ∗ .Thus, we can freely specify the direction of the zero of the controller, whichis at the same frequency location as a pole of the plant.First note that, as discussed <strong>in</strong> §4.3.5, <strong>in</strong>ternal stability will be lost if Ψis aligned with [2/3, −1] ∗ . To cont<strong>in</strong>ue, we will use Ψ to show that the sensitivitydoes not necessarily behave badly for co<strong>in</strong>cident ORHP poles <strong>and</strong>zeros unless their directions are also (nearly) coll<strong>in</strong>ear. To do this, we considerseveral choices of Ψ, start<strong>in</strong>g with Ψ = [−1.94, 3.04] ∗ , which is almostcoll<strong>in</strong>ear with the direction of the pole, <strong>and</strong> then rotat<strong>in</strong>g Ψ clockwise upto Ψ = [3, 2] ∗ , which is orthogonal to the plant pole direction.Figure 4.6 shows the peak values of |T ij (jω)|, i, j = 1, 2 (<strong>in</strong> dB), as a functionof the angle between zero <strong>and</strong> pole directions, which ranges from−0.99 π (almost coll<strong>in</strong>ear) to -π/2 (orthogonal). It can be seen from thisfigure that the peaks <strong>in</strong>crease significantly when the direction of the controllerzero gets close to the correspond<strong>in</strong>g direction of the unstable poleof the plant. Note, however, that the peak values are not large when thedirection of the zero is not close to that of the pole. These results are <strong>in</strong>accord with the conclusions drawn <strong>in</strong> §4.3.5 <strong>and</strong> emphasize the role of directions<strong>in</strong> quantify<strong>in</strong>g performance limits.◦4.4 Discrete SystemsIn this section, we briefly address discrete-time systems. In particular, wegive the discrete-time counterpart of the Poisson sensitivity <strong>in</strong>tegral formultivariable systems developed for cont<strong>in</strong>uous-time systems <strong>in</strong> §4.3.2.


114 4. MIMO <strong>Control</strong>806040Magnitude [dB]200−20−40T 21T 22T 12T 11−60−80−100−3 −2.5 −2 −1.5Zero−Pole Angle [rad]FIGURE 4.6. Peaks <strong>in</strong> complementary sensitivity versus angle between zero <strong>and</strong>pole directions.4.4.1 Poisson Integral for SConsider the unity feedback configuration of Figure 4.7, where the openloopsystem, L, is an n × n, proper transfer matrix of the complex variablez. As before, assume that L is formed by the cascade of plant, G, <strong>and</strong> controller,K, i.e., L = GK. Let the plant <strong>and</strong> controller have coprime factorizations12 as <strong>in</strong> (4.2). Then, the sensitivity function <strong>in</strong> (3.43) can be expressedas <strong>in</strong> (4.3).r + e y❜ ✲ ✐ ✲ L(z) ✲ ❜− ✻FIGURE 4.7. Discrete-time feedback control system.To preclude unstable pole-zero cancelations <strong>in</strong> the open-loop system,we make the follow<strong>in</strong>g assumption.Assumption 4.4. The sets of frequency locations of zeros <strong>and</strong> poles of theopen-loop system L outside the unit disk are disjo<strong>in</strong>t.◦Under Assumption 4.4, if q £c∈ is a zero of NG (i.e., a nonm<strong>in</strong>imumphase zero of G) with output direction Ψ ¢ ∈ n , then S <strong>and</strong> T satisfy thefollow<strong>in</strong>g <strong>in</strong>terpolation constra<strong>in</strong>tΨ ∗ S(q) = Ψ ∗ , <strong>and</strong> Ψ ∗ T(q) = 0 . (4.44)Other <strong>in</strong>terpolation constra<strong>in</strong>ts hold at the plant’s unstable poles as wellas at zeros <strong>and</strong> poles of the controller £c<strong>in</strong> (see Lemma 4.1.1).12 Over the r<strong>in</strong>g of proper transfer matrices hav<strong>in</strong>g poles <strong>in</strong>side the unit disk (i.e., stable).


4.4 Discrete Systems 115Assume that S is proper <strong>and</strong> stable, <strong>and</strong> consider the notation of §4.3.1.In particular, ρ k <strong>in</strong> (4.24) is now a proper, stable, scalar rational functionof z. Let B k be the Blaschke product of the zeros of ρ k £c<strong>in</strong> , def<strong>in</strong>ed asThen, ρ k can be factored asB k (z) =∏n ki=1z i|z i |z − z i1 − z i z .ρ k (z) = B k (z) ˜ρ k (z) z −δ k, (4.45)where δ k is the relative degree of ρ k , <strong>and</strong> ˜ρ k is stable, m<strong>in</strong>imum phase,<strong>and</strong> has relative degree zero. Note that, for the common situation of astrictly proper plant, the diagonal elements of S are biproper, <strong>and</strong> henceδ k <strong>in</strong> (4.45) is zero <strong>in</strong> this case.We then have the follow<strong>in</strong>g result.Theorem 4.4.1 (Poisson Integral for S). Let q = r q e jθq , r q > 1, be a zeroof the plant G, <strong>and</strong> let Ψ ¢ ∈ n , Ψ ≠ 0, be its output direction. Assume thatthe sensitivity function, S, is proper <strong>and</strong> stable. Then, for each <strong>in</strong>dex k <strong>in</strong>I Ψ ,12π∫ π−πn∑log∣ψ ∗ iψ ∗ i=1 kwhere B k <strong>and</strong> δ k are as <strong>in</strong> (4.45).S ik (e jθ r 2 q − 1)∣ 1 − 2r q cos(θ − θ q ) + r 2 qdθ = log |B −1k(q)|+δ k log |q| ,(4.46)Proof. Note that log | ˜ρ k |, with ˜ρ k def<strong>in</strong>ed <strong>in</strong> (4.45), satisfies the assumptionsof Corollary A.6.4 <strong>in</strong> Appendix A. The result then follows by us<strong>in</strong>gthis corollary on log | ˜ρ k | as <strong>in</strong> the proof of Theorem 3.4.2 <strong>in</strong> Chapter 3, <strong>and</strong>then us<strong>in</strong>g the <strong>in</strong>terpolation constra<strong>in</strong>t (4.44) <strong>in</strong> a similar fashion to theproof of Theorem 4.3.1.□The follow<strong>in</strong>g corollary establishes a constra<strong>in</strong>t that is <strong>in</strong>dependent ofthe controller.Corollary 4.4.2. Under the conditions of Theorem 4.4.1, the sensitivityfunction S satisfies, for each <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫ 1 π n∑ ψ ∗ ilogS ik (e jθ r 2 q − 1)2π −π ∣∣ 1 − 2r q cos(θ − θ q ) + r 2 dθ ≥ 0 . (4.47)qψ ∗ i=1 kProof. The proof follows from (4.46), on not<strong>in</strong>g that |B −1k(q)| ≥ 1 <strong>and</strong>δ k log |q| ≥ 0.□The above results will be used <strong>in</strong> the follow<strong>in</strong>g chapter to study limitationsfor periodic systems.


116 4. MIMO <strong>Control</strong>4.5 SummaryThis chapter has discussed two different approaches to the <strong>in</strong>vestigationof sensitivity limitations <strong>in</strong> multivariable l<strong>in</strong>ear control. One of these approachesuses <strong>in</strong>tegral constra<strong>in</strong>ts on sensitivity vectors, whilst the otherapproach considers <strong>in</strong>tegral constra<strong>in</strong>ts on the logarithm of the s<strong>in</strong>gularvalues of the sensitivity functions. Each of these approaches to the problememphasizes different aspects of multivariable feedback design, <strong>and</strong>, <strong>in</strong>comb<strong>in</strong>ation, they provide a general view of multivariable design limitationsimposed by open-loop ORHP zeros <strong>and</strong> poles. On the one h<strong>and</strong>, theanalysis by means of <strong>in</strong>tegral constra<strong>in</strong>ts on sensitivity vectors has provenparticularly useful to give <strong>in</strong>sights <strong>in</strong>to multivariable issues especially relat<strong>in</strong>gto the cost of decoupl<strong>in</strong>g, spatial <strong>and</strong> frequency doma<strong>in</strong> trade-offs <strong>in</strong>sensitivity reduction, <strong>and</strong> the effect of directionality on the impact of nearunstable pole-zero cancelations on performance. On the other h<strong>and</strong>, the<strong>in</strong>tegral relations on the logarithm of the s<strong>in</strong>gular values of the sensitivityfunction suggest a l<strong>in</strong>k between the directionality properties — <strong>in</strong> termsof s<strong>in</strong>gular vectors, see Chapter 2 — <strong>and</strong> sensitivity reduction. Indeed, allthese <strong>in</strong>tegral relations exhibit a term <strong>in</strong>volv<strong>in</strong>g the Laplacian of the logarithmof sensitivity, which, <strong>in</strong> turn, us<strong>in</strong>g a technique due to Freudenberg<strong>and</strong> Looze (1987, p. 184), can be expressed <strong>in</strong> a way that highlights thedirectionality properties of the sensitivity function. The underst<strong>and</strong><strong>in</strong>g ofthe role of these Laplacians <strong>and</strong> their implications toward feedback designprobably merits further <strong>in</strong>vestigation.Notes <strong>and</strong> ReferencesBode Integral Formulae§4.1 <strong>and</strong> §4.2 are largely based on Chen (1995). This work also obta<strong>in</strong>ed Poisson<strong>in</strong>tegral constra<strong>in</strong>ts on the s<strong>in</strong>gular values of the sensitivity function.Poisson Integral Formulae§4.3 is based on Gómez <strong>and</strong> Goodw<strong>in</strong> (1995). This latter work also presented designlimitations for distributed systems, which allows one to consider transfer matriceswith different time delays affect<strong>in</strong>g each entry. The extension to this class ofsystems required additional technical results regard<strong>in</strong>g zeros <strong>and</strong> the behavior at<strong>in</strong>f<strong>in</strong>ity of entire functions. The examples <strong>in</strong> §4.3.6 were also taken from Gómez<strong>and</strong> Goodw<strong>in</strong> (1995).Other results on design limitations for multivariable systems were given byBoyd <strong>and</strong> Desoer (1985). This work obta<strong>in</strong>ed <strong>in</strong>equality versions of the Bode <strong>and</strong>Poisson <strong>in</strong>tegral formulae based on the recognition that the logarithm of the largests<strong>in</strong>gular value of an analytic transfer function is a subharmonic function.


4.5 Summary 117Related work has been reported by Freudenberg <strong>and</strong> Looze who convert themultivariable problem <strong>in</strong>to a scalar one by pre <strong>and</strong> post multiply<strong>in</strong>g the sensitivityfunction by vectors (Freudenberg <strong>and</strong> Looze, 1985) or by the use of determ<strong>in</strong>ants(Freudenberg <strong>and</strong> Looze, 1987). Similar ideas appear <strong>in</strong> the work of Sule <strong>and</strong>Athani (1991), who use directions associated with poles <strong>and</strong> zeros of the system,result<strong>in</strong>g <strong>in</strong> a directional study of trade-offs.§4.4 follows Gómez <strong>and</strong> Goodw<strong>in</strong> (1995). Alternative approaches to the problemof <strong>in</strong>tegral constra<strong>in</strong>ts for MIMO discrete systems can be found <strong>in</strong> Chen <strong>and</strong>Nett (1995), <strong>and</strong> Hara <strong>and</strong> Sung (1989).


5Extensions to Periodic SystemsPeriodic dynamical systems frequently arise <strong>in</strong> applications. Examples<strong>in</strong>clude batch processes that are taken through a periodic operat<strong>in</strong>g cycle,<strong>and</strong> systems where periodic or multirate sampl<strong>in</strong>g strategies are employed(Feuer <strong>and</strong> Goodw<strong>in</strong>, 1996). A periodic system is time-vary<strong>in</strong>g <strong>in</strong>nature; however, by us<strong>in</strong>g time or frequency doma<strong>in</strong> rais<strong>in</strong>g techniques,it is possible to reduce the analysis to that of a special LTI multivariablesystem.Here we focus on discrete-time l<strong>in</strong>ear periodic feedback systems. Forthese systems, the <strong>in</strong>tegral constra<strong>in</strong>ts for MIMO discrete-time systems of§4.4 <strong>in</strong> Chapter 4, can be used to quantify sensitivity limitations. However,the implications of the latter results <strong>in</strong> the present context are different <strong>and</strong>reflect the <strong>in</strong>tr<strong>in</strong>sic structure of periodic systems.5.1 Periodic Discrete-Time SystemsConsider a state space description for an n-<strong>in</strong>put n-output discrete-timeperiodic system given byx k+1 = A k x k + B k u k ,y k = C k x k + D k u k ,(5.1)where {A k }, {B k }, {C k } <strong>and</strong> {D k } are periodic sequences of period N. Witheach of these periodic sequences, {A k } for example, we associate its Fourier


120 5. Extensions to Periodic Systemsseries expansion, 1 given byĀ n =N−1∑k=0A k e −jθ Nkn , n = 0, 1, . . . , N − 1 , (5.2)where θ N = 2π/N. For future use, we arrange the Fourier coefficients Ā n<strong>in</strong> (5.2) to form the follow<strong>in</strong>g matrix⎡⎤Ā 0 Ā 1 . . . Ā N−1Ā 1 Ā N−1 Ā 0 . . . Ā N−2⎢N ⎣... ... ..⎥⎦ . (5.3)Ā 1 Ā 2 . . . Ā 05.1.1 Modulation RepresentationWe will study the system (5.1) <strong>in</strong> the frequency doma<strong>in</strong>. To do this, wemake use of the double-sided Z-transform 2 , which, for a sequence {x k }, isdef<strong>in</strong>ed by∞∑X(z) = x k z −k . (5.4)k=−∞Let U(z) <strong>and</strong> Y(z) be the Z-transforms of the <strong>in</strong>put <strong>and</strong> output sequences<strong>in</strong> (5.1), {u k } <strong>and</strong> {y k }, respectively. We <strong>in</strong>troduce the modulation representationof U <strong>and</strong> Y, denoted by Ū <strong>and</strong> Ȳ, <strong>and</strong> given by⎡⎤⎡U(z)U(z e −jθ N)Ū(z) = ⎢⎥⎣ . ⎦ , Ȳ(z) =⎢⎣U(z e −j(N−1)θ N)Y(z)Y(z e −jθ N).Y(z e −j(N−1)θ N)⎤⎥⎦ . (5.5)I.e., the modulation representation of a signal is obta<strong>in</strong>ed by “stack<strong>in</strong>g”<strong>in</strong>to a vector the signal <strong>and</strong> its N − 1 versions modulated by the roots ofunity of order N (except the root equal to 1). The modulation representationis sometimes called frequency doma<strong>in</strong> rais<strong>in</strong>g of a signal, as opposed tothe more common time doma<strong>in</strong> rais<strong>in</strong>g.Let Ā, ¯B, ¯C, ¯D, def<strong>in</strong>ed as <strong>in</strong> (5.3), be the matrices of Fourier series coefficientsof the sequences {A k }, {B k }, {C k } <strong>and</strong> {D k } <strong>in</strong> (5.1). Apply<strong>in</strong>g theZ-transform to (5.1), <strong>and</strong> us<strong>in</strong>g the matrices Ā, ¯B, ¯C, ¯D, it is easy to showthat Ū <strong>and</strong> Ȳ def<strong>in</strong>ed <strong>in</strong> (5.5) are related byȲ(z) = ¯H(z)Ū(z) , (5.6)1 Note that the Fourier series expansion of a periodic sequence is another periodic sequence.2 Throughout the rest of this chapter, the double-sided Z-transform will be called just theZ-transform.


5.1 Periodic Discrete-Time Systems 121where¯H(z) = ¯C ¯W −1 (zI − Ā ¯W −1 ) −1 ¯B + ¯D , (5.7)<strong>and</strong> where⎡⎤I 0 . . . 00 e −jθ NI 0 . . .¯W ⎢⎣... ... ..⎥⎦ . (5.8)0 . . . . . . e −j(N−1)θ NWe will refer to the matrix ¯H <strong>in</strong> (5.7) as the modulated transfer matrix ofsystem (5.1). Note that the use of this matrix allows one to treat a periodicsystem as a particular MIMO LTI system.An alternative to the frequency doma<strong>in</strong> <strong>in</strong>put-output representation given<strong>in</strong> (5.6) is found by us<strong>in</strong>g a time doma<strong>in</strong> rais<strong>in</strong>g technique. This latter techniquetakes the system (5.1) <strong>in</strong>to an equivalent time-<strong>in</strong>variant system bystack<strong>in</strong>g a set of N samples of the appropriate signal <strong>in</strong>to a vector. This isa form of series to parallel conversion. The resultant vector can be (s<strong>in</strong>glesided)Z-transformed <strong>in</strong> the usual way to obta<strong>in</strong> an <strong>in</strong>put-output transfermatrix, H R say. It is shown <strong>in</strong> e.g., Feuer <strong>and</strong> Goodw<strong>in</strong> (1996) that ¯H <strong>in</strong>(5.7) is related to this “time doma<strong>in</strong> raised” transfer matrix H R by a l<strong>in</strong>eartransformation of the formwhere Λ <strong>and</strong> W s are given by¯H(z) = W −1s Λ −1 (z)H R (z N )Λ(z)W s , (5.9)⎡⎤I 0 . . . 0Λ(z) =0 zI 0.⎢⎣.. . ... ⎥⎦ ,0 . . . . . . z N−1 I⎡⎤I I . . . . . . II W W 2 . . . W N−1W s = ⎢⎣.... ... .. ⎥⎦ ,I W N−1 W 2(N−1) . . . W (N−1)2with W = e −jθ NI <strong>and</strong> θ N = 2π/N.In view of (5.9) we may conclude that both ¯H <strong>and</strong> H R are equally validrepresentations of periodic systems <strong>and</strong>, hence, conta<strong>in</strong> all the relevantfrequency doma<strong>in</strong> <strong>in</strong>formation. Yet, the time doma<strong>in</strong> raised transfer functionhas a major disadvantage from a frequency doma<strong>in</strong> po<strong>in</strong>t of view,because it describes the output sequence at each N samples, rather than ateach sample. However, we are usually <strong>in</strong>terested <strong>in</strong> the response at eachsample, which requires us to (somehow) comb<strong>in</strong>e the <strong>in</strong>formation coded<strong>in</strong> the successive rows of the time doma<strong>in</strong> raised signals. This is precisely


122 5. Extensions to Periodic Systemsthe <strong>in</strong>terleav<strong>in</strong>g operation captured <strong>in</strong> (5.9). Note that H R will, <strong>in</strong> general,be difficult to <strong>in</strong>terpret whereas the response of the modulated transfermatrix ¯H <strong>in</strong> (5.7) can be directly <strong>in</strong>terpreted as the normal frequency responseat unity sampl<strong>in</strong>g period. 3Our analysis <strong>in</strong> the follow<strong>in</strong>g sections will depend on the “poles” <strong>and</strong>“zeros” of functions that are modulated transfer matrices such as ¯H <strong>in</strong>(5.7). Note that these are not st<strong>and</strong>ard transfer functions due to the frequencyshift<strong>in</strong>g. Not withst<strong>and</strong><strong>in</strong>g this, we def<strong>in</strong>e poles, zeros <strong>and</strong> theirassociated directions <strong>in</strong> the usual way as would be done if ¯H were a transferfunction. For example, we say that p is a nonm<strong>in</strong>imum phase zero of¯H if |p| > 1 (i.e., p £c∈ ) <strong>and</strong> rank ¯H(p) < rank ¯H(z) for almost all z.We end this <strong>in</strong>troductory section with a property of modulated transfermatrices that will be useful <strong>in</strong> subsequent analysis.Proposition 5.1.1. The modulated transfer matrix ¯H <strong>in</strong> (5.7) has the follow<strong>in</strong>gpropertyQ ¯H(z e −jθ N) = ¯H(z)Q ,where Q is given by⎡⎤0 . . . . . . . . . II 0 . . . . . . 0Q =. 0 I .. . ... ...⎢⎣.. .. . .. . ⎥ .. . ⎦0 . . . . . . I 0Proof. From (5.9), the matrix ¯H(z e −jθ N) can be expressed as¯H(z e −jθ N) = W −1s Λ −1 (z e −jθ N)H R (z N e −jNθ N)Λ(z e −jθ N)W s .But Nθ N = 2π <strong>and</strong> thus H R (z N e −jNθ N) = H R (z). It is also straightforwardto verify that Λ(z e −jθ N)W s = Λ(z)W s Q. The result then follows. □Note that Proposition 5.1.1 shows that the matrix Q ¯H(z e −jθ N) is obta<strong>in</strong>edby shift<strong>in</strong>g the columns of ¯H(z) to the left.5.2 Sensitivity FunctionsConsider the feedback configuration of Figure 5.1, where the plant <strong>and</strong> thecontroller are n-<strong>in</strong>put n-output discrete-time N-periodic systems.3 To further illustrate this claim, say that the N-periodic system happens to be time<strong>in</strong>variant.In this case ¯H <strong>in</strong> (5.7) turns out to be a diagonal matrix formed from the usualfrequency response of the time-<strong>in</strong>variant system, whereas the time doma<strong>in</strong> raised transferfunction will be related to the usual frequency response <strong>in</strong> a very <strong>in</strong>direct fashion (see Goodw<strong>in</strong><strong>and</strong> Gómez (1995) for more details).


5.2 Sensitivity Functions 123r k + e❜ ✲ ✐ ✲ k✲✲ y k¯K Ḡ ❜− ✻FIGURE 5.1. Feedback control system.We represent the plant <strong>and</strong> controller by their correspond<strong>in</strong>g modulatedtransfer matrices, Ḡ <strong>and</strong> ¯K, respectively. The signals {r k }, {e k } <strong>and</strong> {y k } aresequences represent<strong>in</strong>g the reference <strong>in</strong>put, error, <strong>and</strong> output, respectively,which have modulated representation ¯R, Ē, <strong>and</strong> Ȳ.We write Ḡ <strong>and</strong> ¯K us<strong>in</strong>g coprime factorizations, over the r<strong>in</strong>g of proper<strong>and</strong> stable transfer functions, as 4Ḡ = ˜D −1G ÑG = N G D −1G ,¯K = ˜D −1K ÑK = N K D −1K . (5.10)By an argument similar to the one used to establish (5.9), it can be shownthat N G , for example, is related to a coprime factor N R Gcorrespond<strong>in</strong>g tothe time doma<strong>in</strong> raised transfer function G R byN G (z) = W −1s Λ −1 (z)N R G(z N )Λ(z)W s , (5.11)<strong>and</strong> similar relations hold for the rema<strong>in</strong><strong>in</strong>g factors of the coprime factorizationsof Ḡ <strong>and</strong> ¯K <strong>in</strong> (5.10).We next consider the mapp<strong>in</strong>gs, <strong>in</strong> the modulated signal space, between¯R <strong>and</strong> Ē, <strong>and</strong> between ¯R <strong>and</strong> Ȳ. We will refer to these matrices as the modulatedsensitivity, ¯S, <strong>and</strong> the modulated complementary sensitivity, ¯T. In termsof the coprime factors <strong>in</strong> (5.10), ¯S <strong>and</strong> ¯T are given by¯S = D K ( ˜D G D K + Ñ G N K ) −1 ˜D G ,¯T = N G (Ñ K N G + ˜D K D G ) −1 Ñ K .(5.12)Note that these are the same expressions used <strong>in</strong> Chapter 4, but now ¯S <strong>and</strong>¯T are Nn × Nn transfer matrices that map modulated representations ofsignals.In the sequel, we make the follow<strong>in</strong>g assumption.Assumption 5.1. The sets of frequency locations of zeros <strong>and</strong> poles of theopen-loop system Ḡ ¯K outside the unit disk are disjo<strong>in</strong>t.◦Under Assumption 5.1, if q £c∈ is a zero of NG (i.e., a nonm<strong>in</strong>imumphase zero of Ḡ) with output direction Ψ ¢ ∈ Nn , then ¯S <strong>and</strong> ¯T satisfy the4 To simplify notation, we elim<strong>in</strong>ate the use of “bars” on the coprime factors of modulatedtransfer matrices.


£124 5. Extensions to Periodic Systemsfollow<strong>in</strong>g <strong>in</strong>terpolation constra<strong>in</strong>tΨ ∗ ¯S(q) = Ψ ∗ , <strong>and</strong> Ψ ∗ ¯T(q) = 0 . (5.13)Other <strong>in</strong>terpolation constra<strong>in</strong>ts hold at plant unstable poles as well as atzeros <strong>and</strong> poles of the controller £c<strong>in</strong> (see Lemma 4.1.1 <strong>in</strong> Chapter 4).It is <strong>in</strong>terest<strong>in</strong>g at this stage to reflect on the nature of the zeros ofN G . From (5.11), we have that N G (q) = Ws−1 Λ −1 (q)N R G (qN )Λ(q)W s <strong>and</strong>,therefore, the nonm<strong>in</strong>imum phase zeros of N G are the N th -roots of thenonm<strong>in</strong>imum phase zeros of N R G . Thus, if qN = r N q e jNθq is a nonm<strong>in</strong>imumphase zero of N R G, then the setZ {r q e jθq , r q e j(θq−θ N) , . . . , r q e j[θq−(N−1)θ N] } , (5.14)is a set of nonm<strong>in</strong>imum phase zeros of N G . We will see <strong>in</strong> the follow<strong>in</strong>gsection, however, that every element <strong>in</strong> Z gives the same <strong>in</strong>formation <strong>in</strong>terms of <strong>in</strong>tegral constra<strong>in</strong>ts on the modulated sensitivity function.5.3 Integral Constra<strong>in</strong>tsWe will use the ideas of §4.4 <strong>in</strong> Chapter 4, applied here to the modulatedsensitivity function. For convenience, we first recall some notation. Considerthe modulated sensitivity function, ¯S, <strong>in</strong> (5.12). We denote by ¯S ik , theelement <strong>in</strong> the i-row <strong>and</strong> k-column of ¯S. If Ψ is a vector ¢ <strong>in</strong> Nn , we denoteits elements by ψ i , i = 1, . . . , Nn, i.e., Ψ = [ψ ∗ 1 , ψ∗ 2 , . . . , ψ∗ Nn ]∗ . Also, weuse the <strong>in</strong>dex set I Ψ {i ∈ : ψ i ≠ 0} as the set of <strong>in</strong>dices of the nonzeroelements of Ψ.Given a set Z k = {z i , i = 1, . . . , n k } of complex numbers £c<strong>in</strong> , we def<strong>in</strong>eits Blaschke product as the functionB k (z) =∏n ki=1z i|z i |z − z i1 − z i z . (5.15)If Z k is empty, we def<strong>in</strong>e B k (z) = 1, ∀z.Next, let Ψ ∈ ¢ Nn , Ψ ≠ 0, be the output direction of a nonm<strong>in</strong>imumphase zero of the plant Ḡ, which thus satisfies (5.13). Assume that ¯S isproper <strong>and</strong> stable, <strong>and</strong> consider the vector function Ψ ∗ ¯S : ¢ → ¢ Nn ,whose Nn elements are proper, stable, scalar rational functions. Pick oneof these elements, say∑Nnρ k (z) ψ ∗ ¯S i ik (z) , (5.16)i=1where k is <strong>in</strong> I Ψ , <strong>and</strong> let B k be the Blaschke product of the zeros of ρ k <strong>in</strong>c, def<strong>in</strong>ed as <strong>in</strong> (5.15). Then, ρk can be factored asρ k (z) = B k (z) ˜ρ k (z) z −δ k, (5.17)


5.3 Integral Constra<strong>in</strong>ts 125where δ k is the relative degree of ρ k , <strong>and</strong> ˜ρ k is stable, m<strong>in</strong>imum phase,<strong>and</strong> has relative degree zero.We then have the follow<strong>in</strong>g result.Theorem 5.3.1 (Poisson Integral for ¯S). Let q = r q e jθq , r q > 1, be a zeroof the plant Ḡ, <strong>and</strong> let Ψ ∈ ¢ Nn , Ψ ≠ 0, be its output direction. Assumethat the modulated sensitivity function, ¯S, is proper <strong>and</strong> stable. Then, foreach <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫1 π2π −π∑Nnlog∣ψ ∗ iψ ∗ i=1 kwhere B k <strong>and</strong> δ k are as <strong>in</strong> (5.17).¯S ik (e jθ r 2 q − 1)∣ 1 − 2r q cos(θ − θ q ) + r 2 qdθ = log |B −1k(q)|+δ k log |q| ,(5.18)Proof. Follows by direct application of Theorem 4.4.1 <strong>in</strong> Chapter 4.□Note that the above result makes use of B k <strong>and</strong> δ k , which are <strong>in</strong> generalnot known unless a design has already been carried out. Regard<strong>in</strong>g the relativedegree δ k , it can be said that it is zero for strictly proper plants, but<strong>in</strong> the general case, all that can be said based on knowledge of the plantis that it will be nonnegative. S<strong>in</strong>ce we are <strong>in</strong>terested <strong>in</strong> fundamental constra<strong>in</strong>tsthat apply regardless of the controller used, we give the follow<strong>in</strong>gcorollary that establishes a constra<strong>in</strong>t that is <strong>in</strong>dependent of the controller.Corollary 5.3.2. Under the conditions of Theorem 4.4.1, the sensitivityfunction ¯S satisfies, for each <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫1 π2π −π∑Nnlog∣ψ ∗ iψ ∗ i=1 k¯S ik (e jθ r 2 q − 1)∣ 1 − 2r q cos(θ − θ q ) + r 2 dθ ≥ 0 . (5.19)qProof. The proof follows from (5.18), on not<strong>in</strong>g that |B −1k(q)| ≥ 1 <strong>and</strong>δ k log |q| ≥ 0.□As discussed at the end of §5.2, each nonm<strong>in</strong>imum phase zero r q e jθqof N G def<strong>in</strong>es the set Z <strong>in</strong> (5.14) of nonm<strong>in</strong>imum phase zeros of N G . Inpr<strong>in</strong>ciple, we should use all of them to compute the <strong>in</strong>tegral constra<strong>in</strong>tsassociated with the system. However, we will show next that every element<strong>in</strong> Z gives the same set of constra<strong>in</strong>ts.Suppose that the zero q = r q e jθq , with output direction Ψ, is used tocompute the <strong>in</strong>tegral constra<strong>in</strong>ts. Then there is an <strong>in</strong>tegral of the form(5.18) correspond<strong>in</strong>g to each ρ k <strong>in</strong> (5.16), such that k <strong>in</strong> I Ψ . Let˜q = r q e j(θq−θ N)


126 5. Extensions to Periodic Systemsbe used to compute another set of constra<strong>in</strong>ts. Because of Proposition 5.1.1,<strong>and</strong> we see thatQN G ( ˜q) = N G (q)Q ,˜Ψ = (Ψ ∗ Q) ∗ (5.20)is an output direction associated with ˜q.Proposition 5.1.1 also shows that Q ¯S(e jθ ) = ¯S(e j(θ+θ N) )Q, <strong>and</strong> thus,us<strong>in</strong>g (5.20),˜Ψ ∗ ¯S(e jθ ) = Ψ ∗ Q ¯S(e jθ )= Ψ ∗ ¯S(e j(θ+θ N) )Q .(5.21)Consider any element ρ ˜k , ˜k <strong>in</strong> I ˜Ψ , of ˜Ψ ∗ ¯S, i.e., ρ ˜k (z) ∑ Nn ˜ψ i=1 ∗ ¯S i i ˜k (z). Itthen follows us<strong>in</strong>g (5.21) thatρ ˜k (ejθ ) = ρ k (e j(θ+θ N) ) ,where k = ˜k − n if n < ˜k ≤ Nn, <strong>and</strong> k = (N − 1)n + ˜k if 1 ≤ ˜k ≤ n.It is now a matter of a simple computation to show that the <strong>in</strong>tegralconstra<strong>in</strong>t of the form (5.18) correspond<strong>in</strong>g to ρ ˜k <strong>and</strong> the zero ˜q is the sameas that correspond<strong>in</strong>g to ρ k <strong>and</strong> the zero q. Furthermore, the relationshipbetween k <strong>and</strong> ˜k is clearly bijective <strong>and</strong>, hence, every one of the <strong>in</strong>tegralsobta<strong>in</strong>ed from the zero ˜q can be obta<strong>in</strong>ed from the zero q. The converseis also true. The same also happens between the <strong>in</strong>tegrals associated withthe zeros r q e j(θq−θN) <strong>and</strong> r q e j(θq−2θN) , <strong>and</strong> so on. In conclusion, only oneof the zeros is needed to compute the <strong>in</strong>tegral constra<strong>in</strong>ts.An alternative to compute the <strong>in</strong>tegral constra<strong>in</strong>ts on ¯S for only one ofthe zeros <strong>in</strong> the set Z is to compute the <strong>in</strong>tegral constra<strong>in</strong>ts on the firstn columns of ¯S but for all the zeros <strong>in</strong> Z. This also follows from Proposition5.1.1. The importance of this fact is that, unlike time-<strong>in</strong>variant systems,fundamental limitations of periodic feedback control systems expressed<strong>in</strong> terms of <strong>in</strong>tegral constra<strong>in</strong>ts <strong>in</strong>volve different terms, each oneof them associated with how one s<strong>in</strong>gle frequency of the output is affectedby multiple frequencies of the <strong>in</strong>put, or how multiple frequencies of theoutput are affected by a s<strong>in</strong>gle frequency of the <strong>in</strong>put.F<strong>in</strong>ally, we po<strong>in</strong>t out that similar results can be obta<strong>in</strong>ed for the modulatedcomplementary sensitivity function.5.4 Design InterpretationsIn this section we use the <strong>in</strong>tegral constra<strong>in</strong>ts developed above to exploretwo aspects of the performance of l<strong>in</strong>ear periodic feedback systems. Inparticular, we apply the <strong>in</strong>tegrals to the problem of control of periodicsystems where it is required to have a time-<strong>in</strong>variant map for the closedloop, <strong>and</strong> to the problem of periodic control of a LTI system.


5.4 Design Interpretations 1275.4.1 Time-Invariant Map as a Design ObjectiveWe have remarked earlier (see the footnote on page 122) that a time-<strong>in</strong>variantsystem has a diagonal modulated transfer matrix. Hence, there isa clear connection between hav<strong>in</strong>g a time-<strong>in</strong>variant system as a designobjective <strong>and</strong> achiev<strong>in</strong>g diagonal decoupl<strong>in</strong>g for a MIMO time-<strong>in</strong>variantsystem. The latter problem was studied <strong>in</strong> §4.3.4 of Chapter 4, where itwas shown that there is usually a sensitivity cost associated with diagonaldecoupl<strong>in</strong>g. We will see next that the same is true for the problem ofhav<strong>in</strong>g a time-<strong>in</strong>variant closed loop for periodic systems.Consider the periodic feedback system of Figure 5.1, which, for simplicity,is assumed to be a SISO system. Assume that, <strong>in</strong>ter-alia, the controlobjective is to achieve a time-<strong>in</strong>variant map from the reference to the output.This implies that, <strong>in</strong> the frequency doma<strong>in</strong> raised system, ¯T(e jθ ) <strong>and</strong>¯S(e jθ ) are required to be block diagonal. We then have the follow<strong>in</strong>g result.Theorem 5.4.1. Let q = r q e jθq , r q > 1, be a zero of the plant Ḡ, <strong>and</strong> let Ψ ∈¢ Nn , Ψ ≠ 0, be its output direction. Assume that the modulated sensitivityfunction, ¯S, is proper <strong>and</strong> stable, <strong>and</strong> such that ¯S(e jθ ) is diagonal. Then, foreach <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫1 π2π −πor equivalentlylog | ¯S kk (e jθ r 2 q − 1)|1 − 2r q cos(θ − θ q ) + r 2 dθ ≥ 0 , (5.22)q∫1 πlog | ¯S 11 (e jθ r 2 q − 1)|2π −π1 − 2r q cos(θ − θ q + (k − 1)θ N ) + r 2 dθ ≥ 0 ,q(5.23)for each <strong>in</strong>dex k <strong>in</strong> I Ψ .Proof. The constra<strong>in</strong>t (5.22) is immediate from (5.19), on not<strong>in</strong>g that, if¯S(e jθ ) is diagonal, then ¯S ik (e jθ ) = 0 for i ≠ k.For the constra<strong>in</strong>t (5.23), note that ¯S kk (e jθ ) = ¯S 11 (e j(θ−(k−1)θ N) ), whichfollows from Proposition 5.1.1. We then obta<strong>in</strong>, us<strong>in</strong>g (5.22), that∫1 π2π −πlog | ¯S 11 (e j(θ−(k−1)θN) r 2 q − 1)|1 − 2r q cos(θ − θ q ) + r 2 dθ ≥ 0 .qMak<strong>in</strong>g the change of variables φ = θ−(k−1)θ N , <strong>and</strong> <strong>in</strong>vok<strong>in</strong>g periodicityof the <strong>in</strong>tegr<strong>and</strong>, we f<strong>in</strong>ally arrive at the constra<strong>in</strong>t (5.23).□Note that ¯S 11 (e jθ ) is the frequency response of the error to the referencesignal, i.e., E(e jθ ) = ¯S 11 (e jθ )R(e jθ ), <strong>and</strong> there are no terms <strong>in</strong>volv<strong>in</strong>gshifted functions of R(e jθ ) because it is assumed that the closed loop hasa time-<strong>in</strong>variant <strong>in</strong>put-output map.


128 5. Extensions to Periodic SystemsThe <strong>in</strong>equality (5.23) implies that ¯S 11 satisfies as many <strong>in</strong>tegral constra<strong>in</strong>tsas there are nonzero elements of the zero direction Ψ (i.e., for eachk <strong>in</strong> I Ψ ). Moreover, <strong>in</strong> each of this <strong>in</strong>tegrals, the weight<strong>in</strong>g functionr 2 q − 1W q (θ + (k − 1)θ N ) 1 − 2r q cos(θ − θ q + (k − 1)θ N ) + r 2 q(5.24)is shifted <strong>in</strong> angle for different values of k, <strong>and</strong> thus, their correspond<strong>in</strong>gmaximum values are located at different angles.The impact of this constra<strong>in</strong>t is more evident if we assume also that| ¯S 11 (e jθ )| is <strong>in</strong>tended to be reduced <strong>in</strong> some set Θ 1 ⊂ (−π, π), i.e.,| ¯S 11 (e jθ )| ≤ α < 1 , ∀θ ∈ Θ 1 [−θ 1 , θ 1 ] , θ 1 < π . (5.25)Let the weighted length of the <strong>in</strong>terval Θ 1 be def<strong>in</strong>ed, for each k <strong>in</strong> I Ψ , as(cf. (3.50) <strong>in</strong> Chapter 3)Θ k q(θ 1 ) Then, from (5.23), we obta<strong>in</strong>∫ θ1−θ 1W q (θ + (k − 1)θ N ) dθ .[2π − Θ k q(θ 1 )] log ‖ ¯S 11 ‖ ∞ + Θ k q(θ 1 ) log α ≥ 0 , ∀k ∈ I Ψ ,which is equivalent to‖ ¯S 11 ‖ ∞ ≥( 1α) Θk q (θ 1)2π−Θ k q (θ 1), ∀k ∈ IΨ . (5.26)Note that the value of Θ k q(θ 1 ) (for some k ∈ I Ψ ) will, <strong>in</strong> general, be largeif the frequency range Θ 1 where sensitivity reduction is required <strong>in</strong>cludesthe maximum value of the correspond<strong>in</strong>g weight<strong>in</strong>g function W q <strong>in</strong> (5.24).Therefore, <strong>in</strong> view of (5.26), peaks <strong>in</strong> | ¯S 11 (e jθ )| are more likely to be largewhen sensitivity reduction is required over ranges that <strong>in</strong>clude the maximumvalue of some of the different weight<strong>in</strong>g functions.We then note that this situation is, <strong>in</strong> general, worse for periodic systemsthan it is for time-<strong>in</strong>variant systems. Indeed, <strong>in</strong> the latter case, we needconsider only one weight<strong>in</strong>g function for each nonm<strong>in</strong>imum phase zerorather than as many as there are nonzero elements <strong>in</strong> the zero direction.We also note that the situation becomes potentially worse as the numberof elements <strong>in</strong> I Ψ <strong>in</strong>creases or as the dimension of the null space associatedwith the zero grows (see §4.3.4 <strong>in</strong> Chapter 4).We conclude from the previous analysis that, for periodic systems, theproblem of sensitivity reduction with the additional requirement that theclosed-loop map from reference to output be time-<strong>in</strong>variant is very likelyto result, <strong>in</strong> general, <strong>in</strong> large peaks for | ¯S(e jθ )|.


5.4 Design Interpretations 129As an illustration of the above results we study a simple situation whereit will turn out that there is no extra sensitivity cost associated with hav<strong>in</strong>ga time-<strong>in</strong>variant closed-loop map.Example 5.4.1. Consider a SISO plant described byx k+1 = Ax k + B k u k ,y k = Cx k + D k u k ,(5.27)where A <strong>and</strong> C are constant matrices, <strong>and</strong> {B k } <strong>and</strong> {D k } are N-periodicsequences. Assume further that the very special condition holds that thereexists a periodic scalar ga<strong>in</strong> K k ≠ 0, ∀k, such thatB k K k = B , <strong>and</strong> D k K k = D ,where B <strong>and</strong> D are constant. It is then clear that one can achieve time<strong>in</strong>varianceby simply redef<strong>in</strong><strong>in</strong>g the <strong>in</strong>put. We will use our previous analysisto show that, <strong>in</strong> this particular case, there is no penalty <strong>in</strong> achiev<strong>in</strong>gtime-<strong>in</strong>variance for the closed-loop system.Let Ḡ be as <strong>in</strong> (5.7), i.e., Ḡ is the modulated transfer matrix associatedwith the plant (5.27). Similarly, let ¯K be the modulated transfer matrix associatedwith the periodic ga<strong>in</strong> K k . Us<strong>in</strong>g the relationship (5.9) betweentime <strong>and</strong> frequency doma<strong>in</strong> rais<strong>in</strong>g, it is easy to see that ¯K is given by 5⎡K 0 0 . . . 0¯K = W −1s⎤0 K 1 . . ..⎢⎣.. .. . ... ⎥⎦ W s ,0 . . . . . . K N−1<strong>and</strong> we see that ¯K is nons<strong>in</strong>gular s<strong>in</strong>ce K k ≠ 0 for all k.Let N G D −1Gbe a right coprime factorization of Ḡ. We will show that theoutput zero directions of N G are canonical, i.e., for each zero q of N G , thereis a basis for the left null space of N G (q) constructed of vectors hav<strong>in</strong>gonly one nonzero element.We first note that there are no unstable pole-zero cancelations <strong>in</strong> Ḡ ¯K,<strong>and</strong>, clearly N G ( ¯K −1 D G ) −1 is a right coprime factor of Ḡ ¯K. Moreover,s<strong>in</strong>ce K k was chosen so that B k K k <strong>and</strong> D k K k are constant, it is clear thatḠ ¯K is diagonal. Because of this, it is possible to build a right coprime fraction¯N ¯D −1 = Ḡ ¯K where ¯N <strong>and</strong> ¯D are diagonal. We then conclude that 6N G = ¯NR , (5.28)5 The time doma<strong>in</strong> raised transfer matrix correspond<strong>in</strong>g to the scalar ga<strong>in</strong> K k is a diagonalmatrix hav<strong>in</strong>g the elements K 0 , K 1 , . . . , K N−1 <strong>in</strong> its diagonal (see Feuer <strong>and</strong> Goodw<strong>in</strong>(1996)).6 This is because any two coprime factorizations are related by units of the r<strong>in</strong>g of proper<strong>and</strong> stable transfer matrices, i.e., biproper <strong>and</strong> bistable matrices (Vidyasagar, 1985, Theorem4.1.43, p.75).


130 5. Extensions to Periodic Systemswhere R is biproper <strong>and</strong> bistable.S<strong>in</strong>ce ¯N is diagonal, it is straightforward to build a canonical basis forthe left null space of ¯N(q) <strong>and</strong>, because of (5.28), this basis will also be acanonical basis for the left null space of N G (q).Let Ψ be a vector <strong>in</strong> the left null space of N G (q); it follows that Ψ iscanonical, i.e., I Ψ has only one nonzero element. Hence, there is only oneweight<strong>in</strong>g function of the form (5.24), which implies that there is only one<strong>in</strong>tegral constra<strong>in</strong>t (5.23) associated with the zero with direction Ψ. Thissituation is no worse than that correspond<strong>in</strong>g to a time-<strong>in</strong>variant system.◦5.4.2 Periodic <strong>Control</strong> of Time-<strong>in</strong>variant PlantA question that has been the subject of an ongo<strong>in</strong>g discussion is whetheror not it is advantageous to use l<strong>in</strong>ear time-vary<strong>in</strong>g controllers (e.g., periodiccontrollers) for time-<strong>in</strong>variant plants. From some po<strong>in</strong>ts of view,periodic control of time-<strong>in</strong>variant plants seems to yield improved performanceover time-<strong>in</strong>variant control, whilst from other po<strong>in</strong>ts of view thereappear to be <strong>in</strong>herent disadvantages. 7 We give here a particular view ofthis problem based on the <strong>in</strong>tegral constra<strong>in</strong>ts derived <strong>in</strong> §5.3.Let G be the (usual) transfer function of the time-<strong>in</strong>variant plant, <strong>and</strong> letḠ be the modulated transfer matrix correspond<strong>in</strong>g to G, based on a periodof N, i.e., assum<strong>in</strong>g that an N-periodic controller will be used. Because ofthe time-<strong>in</strong>variant characteristic of the plant, Ḡ will have the follow<strong>in</strong>gstructure⎡⎤G(z) 0 . . . 0Ḡ(z) =0 G(ze −jθ N) . . ..⎢⎣.. .. . .. .⎥⎦ .0 . . . . . . G(ze −j(N−1)θ N)Then, if ˜d −1g ñ g <strong>and</strong> n g d −1g are left <strong>and</strong> right coprime factorizations 8 ofthe plant G, respectively, we can construct coprime factorizations of Ḡ =˜D −1G ÑG = N G D −1G, where the different factors conta<strong>in</strong> shifted versions of˜d g , ñ g , n g , <strong>and</strong> d g . For example, N G <strong>and</strong> ˜D G can be constructed as⎡⎤n g (z) 0 . . . 0N G (z) =0 n g (ze −jθ N) . . ..⎢⎣.. .. . ... ..⎥⎦ , (5.29)0 . . . . . . n g (ze −j(N−1)θ N)7 See section on notes <strong>and</strong> references at the end of the chapter.8 We use here lower case letters to avoid confusion with the factorizations of the modulatedmatrices.


5.4 Design Interpretations 131<strong>and</strong>⎡⎤˜d g (z) 0 . . . 0˜D G (z) =0 ˜d g (ze −jθ N) . . ..⎢⎣.. .. . ... ⎥⎦ . (5.30)0 . . . . . . ˜d g (ze −j(N−1)θ N)Let q = r q e jθq be a nonm<strong>in</strong>imum phase zero of n g , with output directionΨ ¢ ∈ Nn . Then every element <strong>in</strong> Z def<strong>in</strong>ed <strong>in</strong> the set (5.14) is anonm<strong>in</strong>imum phase zero of N G <strong>in</strong> (5.29). However, we know from §5.3that only one of the zeros <strong>in</strong> this set is needed to obta<strong>in</strong> the associated <strong>in</strong>tegralconstra<strong>in</strong>ts, because the rest lead to the same <strong>in</strong>tegrals. We will usethe zero q <strong>and</strong> its correspond<strong>in</strong>g direction Ψ. We observe that, a particularfeature of the modulated representation of the plant is that the direction,¯Ψ say, of q as a zero of N G <strong>in</strong> (5.29) has the form¯Ψ ∗ = [Ψ ∗ , 0, . . . , 0] . (5.31)We next study the structure of the modulated sensitivity function. Firstnote that, under the assumption of <strong>in</strong>ternal stability of the closed loop,the modulated form of the controller has a left coprime factorization (lcf)satisfy<strong>in</strong>g ˜D G D K + Ñ G N K = I (Vidyasagar, 1985). Hence, from (5.12), themodulated sensitivity is simply ¯S = D K ˜D G . Thus, splitt<strong>in</strong>g D K <strong>in</strong>to blocksof appropriate dimensions, <strong>and</strong> us<strong>in</strong>g the expression for ˜D G <strong>in</strong> (5.30), weobta<strong>in</strong> that ¯S(z) =⎡⎤D (11)K (z) ˜d g (z) D (12)K (z) ˜d g (ze −jθ N) . . . D (1N)K (z) ˜d g (ze −j(N−1)θ N)D (21)K (z) ˜d g (z) D (22)K (z) ˜d g (ze −jθ N) . . ..⎢.⎣ ... . ... .⎥⎦D (N1)K (z) ˜d g (z) . . . . . . D (NN)K(z) ˜d g (ze −j(N−1)θ N)Accord<strong>in</strong>g to Theorem 5.3.1, we have only to consider those columns of ¯Scorrespond<strong>in</strong>g to nonzero entries of ¯Ψ. Then it is clear from the structureof ¯Ψ given <strong>in</strong> (5.31) that the problem is equivalent to the multivariable<strong>in</strong>tegral constra<strong>in</strong>ts for ¯S 11 = D (11)K˜d g . This, <strong>in</strong> turn, generates the same<strong>in</strong>tegral constra<strong>in</strong>ts as if one uses a time-<strong>in</strong>variant controller hav<strong>in</strong>g D (11)Kas the denom<strong>in</strong>ator of a lcf. Thus, the use of periodic control is no lessconstra<strong>in</strong>ed than the use of time-<strong>in</strong>variant control.In conclusion, we see that periodic controllers offer no apparent advantageover time-<strong>in</strong>variant ones from the po<strong>in</strong>t of view of fundamental <strong>in</strong>tegralconstra<strong>in</strong>ts that must be satisfied by the sensitivity function. Weremark that similar conclusions hold, mutatis-mut<strong>and</strong>is, for the complementarysensitivity function.


132 5. Extensions to Periodic Systems5.5 SummaryThis chapter has studied Poisson <strong>in</strong>tegral constra<strong>in</strong>ts for discrete periodicfeedback systems. The <strong>in</strong>tegrals have been developed on a modulated representationof the sensitivity function. This modulated representation associatesa transfer function with periodic systems, which can then be dealtwith as one would a MIMO time-<strong>in</strong>variant system. One has to pay particularattention, however, to the structure of these modulated matrices <strong>and</strong>the nature of its zeros <strong>and</strong> poles.Us<strong>in</strong>g the resultant <strong>in</strong>tegral constra<strong>in</strong>ts, it is possible to analyze designlimitations <strong>in</strong>herent to l<strong>in</strong>ear, periodically time-vary<strong>in</strong>g systems. In particular,we have shown that there is generally an additional cost associatedwith hav<strong>in</strong>g a time-<strong>in</strong>variant target closed loop for a periodic open-loopplant. It was also shown that nonm<strong>in</strong>imum phase zeros <strong>and</strong>/or unstablepoles of a discrete LTI plant cont<strong>in</strong>ue to impose design limitations even ifa periodic time-vary<strong>in</strong>g controller is used.Notes <strong>and</strong> ReferencesThis chapter is ma<strong>in</strong>ly based on Goodw<strong>in</strong> <strong>and</strong> Gómez (1995).Frequency Doma<strong>in</strong> Rais<strong>in</strong>gThe orig<strong>in</strong>s of this tool can be related to early work of Zadeh (1950), who gave afrequency doma<strong>in</strong> description of general time-vary<strong>in</strong>g systems. The modulationrepresentation has been extensively used <strong>in</strong> the signal process<strong>in</strong>g literature, seee.g., Shenoy et al. (1994), Vetterli (1987) <strong>and</strong> Vetterli (1989).The modulated transfer matrix, when evaluated along the unit circle, is sometimesreferred to as the alias component matrix (Smith <strong>and</strong> Barnwell, 1987; Ramstad,1984; Vetterli, 1989).Time Doma<strong>in</strong> Rais<strong>in</strong>gFor a description of time doma<strong>in</strong> rais<strong>in</strong>g <strong>and</strong> its utility see e.g., Khargonekar et al.(1985), Meyer (1990), Ravi et al. (1990) <strong>and</strong> Feuer <strong>and</strong> Goodw<strong>in</strong> (1996). This latterreference establishes the relation between time <strong>and</strong> frequency doma<strong>in</strong> rais<strong>in</strong>g.Transform TechniquesFor a more detailed exposition of Fourier techniques see e.g., Feuer <strong>and</strong> Goodw<strong>in</strong>(1996). The double-sided Z-transform <strong>and</strong> its properties is studied, for example, <strong>in</strong>Frankl<strong>in</strong> et al. (1990).


5.5 Summary 133Periodic <strong>Control</strong> of LTI SystemsKhargonekar et al. (1985) argue that, for an important class of robustness problems,discrete periodic compensators are superior to time-<strong>in</strong>variant ones. Thissuperiority is expla<strong>in</strong>ed <strong>in</strong> terms of improved ga<strong>in</strong> <strong>and</strong> phase marg<strong>in</strong>s. Similarideas are presented <strong>in</strong> Lee et al. (1987) for cont<strong>in</strong>uous-time systems, <strong>and</strong> <strong>in</strong> Francis<strong>and</strong> Georgiou (1988) for sampled-data systems. On the other h<strong>and</strong>, Khargonekaret al. (1985) showed that time-vary<strong>in</strong>g controllers offer no advantage over time<strong>in</strong>variantones <strong>in</strong> the problem of weighted sensitivity m<strong>in</strong>imization. Furthermore,<strong>in</strong> Shamma <strong>and</strong> Dahleh (1991) it is argued that time-vary<strong>in</strong>g compensation doesnot improve optimal rejection of persistent bounded disturbances, <strong>and</strong> also it doesnot help <strong>in</strong> the bounded-<strong>in</strong>put bounded-output robust stabilization of time-<strong>in</strong>variantplants with unstructured uncerta<strong>in</strong>ty.An analysis of the use of periodic controllers, based on frequency doma<strong>in</strong> arguments,has been given by Goodw<strong>in</strong> <strong>and</strong> Feuer (1992) <strong>and</strong> Feuer (1993). Theseworks showed that the use of periodic control faces two problems: <strong>in</strong>herent presenceof high frequency components, <strong>and</strong> sensitivity to high frequency system uncerta<strong>in</strong>ty.


6Extensions to Sampled-DataSystemsThis chapter deals with fundamental limitations for sampled-data (SD)feedback systems. By the term SD, we refer to a system with both cont<strong>in</strong>uous-time<strong>and</strong> discrete-time signals — as is the case of digital control of ananalogue plant — but which is studied <strong>in</strong> cont<strong>in</strong>uous-time. This contrastswith the approach taken <strong>in</strong> §3.4 <strong>in</strong> Chapter 3, where we were concernedonly with the sampled behavior of the system. In this chapter, the full <strong>in</strong>tersamplebehavior will be taken <strong>in</strong>to account.Unfortunately, the discrete results <strong>in</strong> Chapter 3 are <strong>in</strong>sufficient to describefundamental limitations <strong>in</strong> SD systems, s<strong>in</strong>ce good sampled behavioris clearly necessary but not sufficient for good overall behavior.Because a SD system is <strong>in</strong>tr<strong>in</strong>sically time-vary<strong>in</strong>g due to the sampl<strong>in</strong>gprocess, one cannot use transfer functions to describe its <strong>in</strong>put-outputproperties. However, it is possible to calculate the Laplace transform ofthe response of a SD system to a particular <strong>in</strong>put, <strong>and</strong> hence one may evaluatethe steady-state response of a stable SD system to a s<strong>in</strong>usoidal <strong>in</strong>putof given frequency. For analogue systems, the response to such an <strong>in</strong>putis a s<strong>in</strong>usoid of the same frequency as the <strong>in</strong>put, but with amplitude <strong>and</strong>phase modified accord<strong>in</strong>g to the transfer function of the system evaluatedat the <strong>in</strong>put frequency (see §2.1.3 <strong>in</strong> Chapter 2). The response of a stableSD system to an <strong>in</strong>put s<strong>in</strong>usoidal signal, on the other h<strong>and</strong>, consists of asum of <strong>in</strong>f<strong>in</strong>itely many s<strong>in</strong>usoids spaced at <strong>in</strong>teger multiples of the sampl<strong>in</strong>gfrequency away from the frequency of the <strong>in</strong>put. We will refer tothe component hav<strong>in</strong>g the same frequency as the <strong>in</strong>put as the fundamental,<strong>and</strong> the other components as the harmonics. In fact, the fundamentalcorresponds to the first harmonic, which will be predom<strong>in</strong>ant <strong>in</strong> most ap-


136 6. Extensions to Sampled-Data Systemsplications. This is because higher frequency components will normally besuppressed <strong>in</strong> a well-designed SD system.The fundamental, <strong>and</strong> each of the harmonics, is governed by what weterm the frequency response function, which has many properties similar tothose of a transfer function. In particular, these frequency response functionshave sufficient structure to allow complex analysis to be applied toderive a set of formulae analogous to the Bode <strong>and</strong> Poisson <strong>in</strong>tegrals. As <strong>in</strong>the LTI case, these <strong>in</strong>tegrals describe trade-offs between system properties<strong>in</strong> different frequency ranges. Our analysis <strong>in</strong> this chapter concentrates onthe fundamental components of the SD response, but similar results maybe derived for the harmonics (Braslavsky, 1995).6.1 Prelim<strong>in</strong>aries.6.1.1 Signals <strong>and</strong> SystemWe consider the SISO SD feedback system shown <strong>in</strong> Figure 6.1, where G<strong>and</strong> F are the transfer functions of the plant <strong>and</strong> anti-alias<strong>in</strong>g filter, K d isthe digital controller, <strong>and</strong> S τ <strong>and</strong> H represent the sampler <strong>and</strong> hold devicerespectively.The plant <strong>and</strong> controller are assumed to be proper, <strong>and</strong> the filter strictlyproper <strong>and</strong> stable, 1 <strong>and</strong> they are all free of unstable hidden modes. Theplant may <strong>in</strong>clude a pure time delay, denoted by τ G ≥ 0. The relativedegree of the controller is denoted by RD K d .The signals <strong>in</strong> Figure 6.1 are as follows:r : reference <strong>in</strong>put,y : system output,d : output disturbancew : measurement noise,{u k } : discrete control sequence,u : analogue control <strong>in</strong>put,v : analogue output of the filter,{v k } : sampled output of the filter.Cont<strong>in</strong>uous-time signals are assumed scalar functions from [0, ∞) to ¡ ,<strong>and</strong> discrete sequences are def<strong>in</strong>ed for k = 0, 1, 2, . . ., tak<strong>in</strong>g values on ¡ .A class of signals that will be quite useful for our purposes is connectedwith the class of functions of bounded variation.Def<strong>in</strong>ition 6.1.1 (Function of Bounded Variation). A function h, def<strong>in</strong>edon the closed real <strong>in</strong>terval [a, b], is of bounded variation if the total variation1 The assumption that the filter is strictly proper is st<strong>and</strong>ard <strong>and</strong> guarantees that thesampl<strong>in</strong>g operation is well def<strong>in</strong>ed (cf. Sivashankar <strong>and</strong> Khargonekar, 1993; Dullerud <strong>and</strong>Glover, 1993). The assumption of stability is only made for simplicity of exposition <strong>and</strong> maybe removed.


6.1 Prelim<strong>in</strong>aries. 137d❜r.yFS τ{u❜ ✲ ❤ ✲ ❜ ❜ ✲ ✲ ❤❄ k }✲uK d H✲ G❜−✻❄ w❤✛✲❜of h on [a, b],FIGURE 6.1. SD control system.V h (a, b) supn∑|h(t k ) − h(t k−1 )| (6.1)k=1is f<strong>in</strong>ite. The supremum here is taken over every n ∈ <strong>and</strong> every partitionof the <strong>in</strong>terval [a, b] <strong>in</strong>to sub<strong>in</strong>tervals {t k , t k−1 }, where k = 1, 2, . . . n, <strong>and</strong>a = t 0 < t 1 < · · · < t n = b.A function h def<strong>in</strong>ed on [0, ∞) is of uniform bounded variation (UBV)(Braslavsky et al., 1995a) if, given δ > 0, the total variation V h (t, t + δ)on <strong>in</strong>tervals [t, t + δ] of length δ is uniformly bounded, that is, ifsup V h (t, t + δ) < ∞ . (6.2)t∈[0,∞)A function of BV is not necessarily cont<strong>in</strong>uous, but it is differentiablealmost everywhere. Moreover, the limitsh(t ± ) limɛ↓0h(t ± ɛ), ɛ > 0 (6.3)are well def<strong>in</strong>ed at every t, which means that h can have discont<strong>in</strong>uitiesof at most the “f<strong>in</strong>ite-jump” type. This is particularly appropriated for signals<strong>in</strong>volved <strong>in</strong> sampl<strong>in</strong>g operations, as we will see.We will assume throughout that the <strong>in</strong>put signals r, d, <strong>and</strong> w, are suchthat, when multiplied by some exponentially decay<strong>in</strong>g term e −σt , are functionsof UBV. It is straightforward to verify that steps, ramps, s<strong>in</strong>usoids<strong>and</strong> exponentials are all signals satisfy<strong>in</strong>g this condition. However, signalslike s<strong>in</strong>(e t2 ), <strong>and</strong> signals that conta<strong>in</strong> impulses are excluded.◦6.1.2 Sampler, Hold <strong>and</strong> Discretized SystemThe implementation of a controller for a cont<strong>in</strong>uous-time system by meansof a digital device, such as a computer, implies the process of sampl<strong>in</strong>g<strong>and</strong> reconstruction of analogue signals. By the sampl<strong>in</strong>g process, an analoguesignal is converted <strong>in</strong>to a sequence of numbers that can then be digitallymanipulated. The hold device performs the <strong>in</strong>verse operation, translat<strong>in</strong>gthe output of the digital controller <strong>in</strong>to a cont<strong>in</strong>uous-time signal. We


138 6. Extensions to Sampled-Data Systemswill assume throughout that nonl<strong>in</strong>earities associated with the process ofdiscretization, such as f<strong>in</strong>ite memory word-length, quantization, etc., haveno significant effect on the sampled-data system.We assume also that the sampl<strong>in</strong>g is regular, i.e., if τ is the sampl<strong>in</strong>gperiod, sampl<strong>in</strong>g is performed at <strong>in</strong>stants t = kτ, with k = 0, ±1, ±2, . . ..Associated with τ, we def<strong>in</strong>e the sampl<strong>in</strong>g frequency ω s = 2π/τ. By ω N =ω s /2 we denote the Nyquist frequency, <strong>and</strong> by Ω N the Nyquist range offrequencies [−ω N , ω N ].We consider an idealized model of the sampler. If v is a cont<strong>in</strong>uous-timesignal, we def<strong>in</strong>e the sampl<strong>in</strong>g operation with period τ byS τ v = {v k } ∞ k=0,where the sequence {v k } ∞ k=0 represents the sampled version of v, with v k =v(kτ + ), <strong>and</strong> k ∈ 0 . The z-transform operator is denoted by Z, i.e.,Z{u k } ∞∑u k z −k ,k=0<strong>and</strong> the Laplace transform operator is denoted by L, i.e., Lu = U, whereU(s) =∫ ∞0e −st u(t)dt .The hold device, H, is a generalized SD hold function (GSHF) a la Kabamba(1987), def<strong>in</strong>ed byu(t) = h(t − kτ)u k , kτ ≤ t < (k + 1)τ, k ∈ 0 . (6.4)The function h <strong>in</strong> the above def<strong>in</strong>ition, which characterizes the hold, isassumed to be of bounded variation, <strong>and</strong> with support on the <strong>in</strong>terval[0, τ]. In particular, the choice h(t) = 1 for t ∈ [0, T] yields the zero orderhold (ZOH). By consider<strong>in</strong>g GSHFs we extend the scope of our analysisto a wide class of SD control schemes that do not necessarily rely on ZOHreconstruction.Associated with the hold we def<strong>in</strong>e its frequency response function byH = Lh. S<strong>in</strong>ce h is supported on a f<strong>in</strong>ite <strong>in</strong>terval, it follows that H is anentire function, i.e., it has no s<strong>in</strong>gularities at any f<strong>in</strong>ite s <strong>in</strong> ¢ . For example,<strong>in</strong> the case of the ZOH we get the familiar response H(s) = (1 − e −sτ )/s.Frequency responses for other types of hold functions are derived <strong>in</strong> Middleton<strong>and</strong> Freudenberg (1995).The frequency response of the hold is useful <strong>in</strong> comput<strong>in</strong>g the Laplacetransform of the output of the hold device.Lemma 6.1.1. Consider a hold device as def<strong>in</strong>ed <strong>in</strong> (6.4) <strong>and</strong> its frequencyresponse function H. Let U d denote the z-transform of the <strong>in</strong>put sequence


6.1 Prelim<strong>in</strong>aries. 139{u k }, <strong>and</strong> U the Laplace transform of the hold output to this sequence.ThenU(s) = H(s) U d (e sτ ) . (6.5)Proof. Us<strong>in</strong>g (6.4), we can express the output of the hold device to {u k } asu(t) =∞∑h(t − kτ) u k(1 ( t − (k + 1) ) − 1 ( t − kτ )) , (6.6)k=0where 1 denotes the unit step function 1(t) = 1 if t ∈ [0, τ), <strong>and</strong> 0 otherwise.Hence,U(s) =∫ ∞0e −sτ u(t) dt=∞∑k=0∫ (k+1)τkτe −sτ h(t − kτ) u k dt∞∑∫ τ= e −skτ u k e −st h(t) dt .k=00The result then follows from the def<strong>in</strong>ition of the frequency response functionof the hold, H, <strong>and</strong> the def<strong>in</strong>ition of the z-transform.□We denote by (FGH) d the discretized plant, def<strong>in</strong>ed as(FGH) d ZS τ L −1 FGH,where L −1 FGH denotes the <strong>in</strong>verse Laplace transform of FGH. The assumptionson G, H <strong>and</strong> F stated above are sufficient to guarantee (e.g.,Freudenberg et al., 1995; Braslavsky et al., 1995a) that the discretized plant(FGH) d satisfies the well-known formula 2(FGH) d (e jωτ ) = 1 τ∞∑H k (jω)G k (jω)F k (jω), (6.7)k=−∞where the notation Y k (·) represents Y(· + jkω s ), with k an <strong>in</strong>teger, <strong>and</strong>ω s = 2π/τ. This notation will be frequently used <strong>in</strong> the sequel.In relation to (FGH) d , we also recall the def<strong>in</strong>itions of the discrete sensitivity<strong>and</strong> complementary sensitivity functions, here given byS d =11 + K d (FGH) d<strong>and</strong> T d = K d(FGH) d1 + K d (FGH) d.2 Sometimes called the impulse modulation formula (e.g., Araki et al., 1993), this identity isclosely related to the Poisson summation formula (e.g., Rud<strong>in</strong>, 1987).


140 6. Extensions to Sampled-Data SystemsAn important feature of SD systems is evident from (6.7), namely, the responseof the discretized plant at a frequency ω ∈ Ω N depends upon theresponse of the analogue plant, filter, <strong>and</strong> hold function at an <strong>in</strong>f<strong>in</strong>ite numberof frequencies. Indeed, it is well known that the steady-state responseof a stable SD system to a s<strong>in</strong>usoidal <strong>in</strong>put consists of a fundamental component<strong>and</strong> <strong>in</strong>f<strong>in</strong>itely many aliases shifted by multiples of the sampl<strong>in</strong>gfrequency. As we will see <strong>in</strong> §6.2, analogous expressions can also be obta<strong>in</strong>edfor the response to more general <strong>in</strong>puts.6.1.3 Closed-loop StabilityAs with the case of a ZOH, closed-loop stability is guaranteed by the assumptionsthat sampl<strong>in</strong>g is nonpathological <strong>and</strong> that the discretized systemis closed-loop stable. The next lemma is a generalization of the wellknownresult of Kalman et al. (1963) to the case of GSHFs (Middleton <strong>and</strong>Freudenberg, 1995). 3Lemma 6.1.2 (Nonpathological Sampl<strong>in</strong>g). Suppose that G <strong>and</strong> F are asdef<strong>in</strong>ed <strong>in</strong> §6.1.1 <strong>and</strong> assume further that(i) if p i <strong>and</strong> p k are CRHP poles of G, thenp i ≠ p k + jnω s , n = ±1, ±2, · · · (6.8)(ii) if p i is a CRHP pole of G, then H(s) has no zeros at s = p i .Then the discretized plant (6.7) is free of unstable hidden modes.S<strong>in</strong>ce the response of a GSHF may have zeros <strong>in</strong> ¢ +, Lemma 6.1.2 says,<strong>in</strong> particular, that it may be necessary to ensure that none of these zerosco<strong>in</strong>cides with an unstable pole of the analogue plant. 4 Under the nonpathologicalsampl<strong>in</strong>g hypothesis, it is straightforward to extend the exponential<strong>and</strong> L 2 <strong>in</strong>put-output stability results of Francis <strong>and</strong> Georgiou(1988) <strong>and</strong> Chen <strong>and</strong> Francis (1991) to the case of GSHF.Lemma 6.1.3. Suppose that G, F, K d , <strong>and</strong> H are as def<strong>in</strong>ed <strong>in</strong> §6.1.1 <strong>and</strong>§6.1.2, that the nonpathological sampl<strong>in</strong>g conditions (i) - (ii) are satisfied,that the product K d (FGH) d has no pole-zero cancelations <strong>in</strong> £ c , <strong>and</strong> thatall poles of S d lie <strong>in</strong> £ . Then the feedback system <strong>in</strong> Figure 6.1 is exponentiallystable <strong>and</strong> L 2 <strong>in</strong>put-output stable.Proof. The proof may be obta<strong>in</strong>ed by simple modification of the proofs ofTheorem 4 <strong>in</strong> Francis <strong>and</strong> Georgiou (1988), <strong>and</strong> Theorem 6 <strong>in</strong> Chen <strong>and</strong>Francis (1991).□◦3 See Middleton <strong>and</strong> Xie (1995a) for the multivariable case.4 Note that this is necessary <strong>in</strong> the SISO case.


6.2 Sensitivity Functions 141Lemma 6.1.3 establishes the conditions for the nom<strong>in</strong>al stability of theSD system of Figure 6.1, which will be assumed throughout the rest of thechapter.6.2 Sensitivity Functions6.2.1 Frequency ResponseThe steady-state response of a stable SD feedback system to a complexs<strong>in</strong>usoidal <strong>in</strong>put consists of a fundamental component at the frequencyof the <strong>in</strong>put as well as additional harmonics located at <strong>in</strong>teger multiplesof the sampl<strong>in</strong>g frequency away from the fundamental. This well-knownfact is discussed <strong>in</strong> textbooks (cf. Åström <strong>and</strong> Wittenmark, 1990; Frankl<strong>in</strong>et al., 1990), <strong>and</strong> has been emphasized <strong>in</strong> several recent research papers(e.g., Araki et al., 1993; Goodw<strong>in</strong> <strong>and</strong> Salgado, 1994).We next present expressions for the output response y <strong>in</strong> Figure 6.1 todisturbances <strong>and</strong> noise. Analogous expressions may be stated for the responseto the reference <strong>in</strong>put, <strong>and</strong> for the response of the control u to thesesignals.Lemma 6.2.1. Denote the responses of y to each of d <strong>and</strong> w by y d <strong>and</strong> y wrespectively. Then the Laplace transforms of these signals are given by[Y d (s) = I − 1 ]τ G(s) H(s) S d(e sτ ) K d (e sτ ) F(s) D(s)−∞∑k=−∞k≠0[ 1τ G(s) H(s) S d(e sτ ) K d (e sτ ) F k (s)]D k (s),(6.9)<strong>and</strong>[ ]1Y w (s) = −τ G(s) H(s) S d(e sτ ) K d (e sτ ) F(s) W(s)−∞∑k=−∞k≠0[ 1τ G(s) H(s) S d(e sτ ) K d (e sτ ) F k (s)]W k (s).(6.10)Proof. These formulae may be derived us<strong>in</strong>g st<strong>and</strong>ard techniques from SDcontrol theory (e.g., Frankl<strong>in</strong> et al., 1990; Åström <strong>and</strong> Wittenmark, 1990).We present only a derivation of (6.10). Assume that r <strong>and</strong> d are zero. Blockdiagram algebra <strong>in</strong> Figure 6.1 <strong>and</strong> (6.5) yieldY w (s) = G(s) H(s) U d (e sτ ) (6.11)<strong>and</strong>U d (z) = −S d (z) K d (z) V d (z). (6.12)


142 6. Extensions to Sampled-Data SystemsThe sampled output of the antialias<strong>in</strong>g filter can be written asV d (z) = ZS τ L −1 V(s)= ZS τ L −1 F(s)W(s).The assumptions on w, <strong>and</strong> the fact that F is strictly proper guarantee thevalidity of the relation (Braslavsky et al., 1995a)V d (e sτ ) = 1 τ∞∑k=−∞F k (s)W k (s). (6.13)Substitut<strong>in</strong>g (6.12) - (6.13) <strong>in</strong>to (6.11) <strong>and</strong> rearrang<strong>in</strong>g yields the desiredresult.□Under the assumption of closed-loop stability, the preced<strong>in</strong>g formulaemay be used to derive the steady-state response of the system to a periodic<strong>in</strong>put. As noted above, the response will be equal to the sum of<strong>in</strong>f<strong>in</strong>itely many harmonics of the <strong>in</strong>put frequency. The magnitude of eachcomponent is governed by a function analogous to the usual sensitivity orcomplementary sensitivity function for LTI systems.Def<strong>in</strong>ition 6.2.1 (SD Sensitivity Functions). We def<strong>in</strong>e the fundamentalsensitivity <strong>and</strong> complementary sensitivity functions byS 0 (s) I − 1 τ G(s) H(s) S d(e sτ ) K d (e sτ ) F(s) (6.14)<strong>and</strong>T 0 (s) 1 τ G(s) H(s) S d(e sτ ) K d (e sτ ) F(s) (6.15)respectively. For k ≠ 0 def<strong>in</strong>e the k-th harmonic response function byT k (s) 1 τ G k(s) H k (s) S d (e sτ ) K d (e sτ ) F(s). (6.16)These SD response functions are not rational functions, s<strong>in</strong>ce their def<strong>in</strong>ition<strong>in</strong>volves functions of the variable e sτ , like H(s), K d (e sτ ), <strong>and</strong> S d (e sτ ).In addition, note that they are not transfer functions <strong>in</strong> the usual sense,because they do not equal the ratio of the transforms of output to <strong>in</strong>putsignals. However, these functions do govern the steady-state frequencyresponse of the SD system.Lemma 6.2.2 (Steady-State Frequency Response). Let the hypotheses ofLemma 6.1.3 be satisfied <strong>and</strong> assume that d(t) = e jωt , t ≥ 0, <strong>and</strong> w(t) =e jωt , t ≥ 0. Then as t → ∞, we have thaty d (t) → y d ss(t) <strong>and</strong> y w (t) → y w ss (t),◦


6.2 Sensitivity Functions 143where<strong>and</strong>y d ss(t) = S 0 (jω)e jωt −y w ss (t) = −T 0 (jω)e jωt −∞∑k=−∞k≠0∞∑k=−∞k≠0T k (jω)e j(ω+kωs)t , (6.17)T k (jω)e j(ω+kωs)t . (6.18)Proof. See §B.2.1 <strong>in</strong> Appendix B.□Notice from (6.17) <strong>and</strong> (6.18) that the fundamental components of thedisturbance <strong>and</strong> noise responses can potentially be reduced over somefrequency ranges by shap<strong>in</strong>g S 0 <strong>and</strong> T 0 adequately. These facts correspondto the LTI case, as seen <strong>in</strong> Chapter 2.Yet, a feature that is dist<strong>in</strong>ctive of SD systems is also evident from (6.17)-(6.18), <strong>and</strong> this is the presence of harmonics at frequencies other than thatof the <strong>in</strong>put. The existence of these harmonics is due to the use of SD feedback,<strong>and</strong> is reflection of the fact that SD feedback has no counterpart <strong>in</strong>analogue systems.6.2.2 Sensitivity <strong>and</strong> RobustnessThe fundamental sensitivity <strong>and</strong> complementary sensitivity functions, togetherwith the harmonic response functions, may be also used to describedifferential sensitivity <strong>and</strong> robustness properties of a SD feedback system.As seen <strong>in</strong> §2.2.4 of Chapter 2, the sensitivity function of a LTI feedbacksystem governs the relative change <strong>in</strong> the reference response of thesystem with respect to small changes <strong>in</strong> the plant. Derivations similar tothose of Lemma 6.2.2 show that the steady-state response of the system <strong>in</strong>Figure 6.1 to a reference <strong>in</strong>put r(t) = e jωt , t ≥ 0, is given byy r ss(t) = T 0 (jω)e jωt −∞∑k=−∞k≠0T k (jω)e j(ω+kωs)t . (6.19)S<strong>in</strong>ce T 0 (jω) depends upon S d (e jωτ ), it follows from (6.7) that the fundamentalcomponent of the reference response at a particular frequency issensitive to variations <strong>in</strong> the plant response at <strong>in</strong>f<strong>in</strong>itely many frequencies.Lemma 6.2.3 (Differential Sensitivity). At each frequency ω, the relativesensitivity of the steady-state reference response (6.19) to variations<strong>in</strong> G(j(ω + lω s )) is given by(i) For l = 0,G(jω) ∂T 0 (jω)T 0 (jω) ∂G(jω) = S0 (jω). (6.20)


144 6. Extensions to Sampled-Data Systems(ii) For all l ≠ 0,G l (jω) ∂T 0 (jω)T 0 (jω) ∂G l (jω) = −T 0 (j(ω + lω s )). (6.21)Proof. The proof is a straightforward calculation, keep<strong>in</strong>g <strong>in</strong> m<strong>in</strong>d the dependenceof S d (e jωτ ) upon G l (jω).□These results may best be <strong>in</strong>terpreted by consider<strong>in</strong>g frequencies <strong>in</strong> theNyquist range. Fix ω ∈ Ω N . Then (6.20) states that the sensitivity of thefundamental component of the reference response to small variations <strong>in</strong>the plant at that frequency is governed by the fundamental sensitivity function.On the other h<strong>and</strong>, (6.21) states that the sensitivity of the fundamentalcomponent to higher frequency plant variations is governed by the fundamentalcomplementary sensitivity function evaluated at the higher frequency.Sensitivity functions S 0 <strong>and</strong> T 0 also have implications on the stabilityrobustness properties of the SD system. Suppose that the plant is subjectto a multiplicative perturbation, i.e., the perturbed plant, denoted ˜G, canbe written as˜G = G(1 + W 1 ∆) , (6.22)where ∆ is a stable <strong>and</strong> proper perturbation, <strong>and</strong> W 1 is a stable m<strong>in</strong>imumphase weight<strong>in</strong>g function used to represent frequency dependence of themodel<strong>in</strong>g error, <strong>and</strong> such that GW 1 is proper. 5Lemma 6.2.4 (Robust Stability aga<strong>in</strong>st Multiplicative Perturbations). Considerthe system of Figure 6.1 <strong>and</strong> let the assumptions of Lemma 6.1.3hold. A necessary condition for the system to rema<strong>in</strong> stable when the plantis perturbed as <strong>in</strong> (6.22) for all ∆ such that ‖∆‖ ∞ < 1 is that‖T 0 (jω)W 1 (jω)‖ ∞ ≤ 1. (6.23)Proof. See Appendix B.□Unsurpris<strong>in</strong>gly, a similar result holds for S 0 when we consider a divisiveperturbation model, where the perturbed plant is expressed by˜G = (1 + W 2 ∆) −1 G , (6.24)where ∆ is as for the multiplicative perturbation model, <strong>and</strong> W 2 is stable,m<strong>in</strong>imum phase, <strong>and</strong> such that GW 2 is proper.5 See also §2.2.4 <strong>in</strong> Chapter 2, where we gave an equivalent description of multiplicativeuncerta<strong>in</strong>ty.


6.3 Interpolation Constra<strong>in</strong>ts 145Lemma 6.2.5 (Robust Stability aga<strong>in</strong>st Divisive Perturbations). Underthe assumptions of Lemma 6.1.3, a necessary condition for the system ofFigure 6.1 to rema<strong>in</strong> stable when the plant is perturbed as <strong>in</strong> (6.24) for all∆ such that ‖∆‖ ∞ < 1 is that‖S 0 (jω)W 2 (jω)‖ ∞ ≤ 1. (6.25)Proof. See Appendix B.□It follows from Lemmas 6.2.4 <strong>and</strong> 6.2.5 that if |T 0 (jω)| or |S 0 (jω)| arevery large at any frequency, then the SD system will exhibit poor robustnessto unstructured uncerta<strong>in</strong>ty <strong>in</strong> the analogue plant at that frequency.6.3 Interpolation Constra<strong>in</strong>tsIn this section we present a set of <strong>in</strong>terpolation constra<strong>in</strong>ts that must besatisfied by the SD sensitivity functions def<strong>in</strong>ed <strong>in</strong> (6.14)-(6.16). SD sensitivityresponses have fixed values on ¢ + that are determ<strong>in</strong>ed by the openloopzeros <strong>and</strong> poles of the plant, hold response, <strong>and</strong> digital controller.As we will see later, a significant difference between the SD case <strong>and</strong> thecont<strong>in</strong>uous-time only or discrete-time only cases is that the poles <strong>and</strong> zerosof the controller yield different constra<strong>in</strong>ts than do those of the plant.We <strong>in</strong>troduce the follow<strong>in</strong>g notation for the ORHP zeros of S 0 <strong>and</strong> T 0 ,respectively given byZ S {s ∈ ¢ + : S 0 (s) = 0} , (6.26)Z T {s ∈ ¢ + : T 0 (s) = 0} . (6.27)Two def<strong>in</strong>itions concern<strong>in</strong>g the mapp<strong>in</strong>g between the z-plane <strong>and</strong> thes-plane will be h<strong>and</strong>y <strong>in</strong> the sequel.Def<strong>in</strong>ition 6.3.1 (Unfolded Images <strong>and</strong> Periodic Reflections). Given anumber z 0 ¢ ∈ , we say that s k0 = 1 τ log z 0 + jkω s , with k = 0, ±1, ±2, · · · ,is an unfolded image of z 0 .Given a number s 0 ¢ ∈ , we say that s k0 = s 0 +jkω s , with k = ±1, ±2, · · · ,is a periodic reflection of s 0 .◦The follow<strong>in</strong>g theorem describes the <strong>in</strong>terpolation relations for the fundamentalsensitivity <strong>and</strong> complementary sensitivity functions.Theorem 6.3.1 (Interpolation Constra<strong>in</strong>ts for S 0 <strong>and</strong> T 0 ). Consider thesystem of Figure 6.1 <strong>and</strong> assume that the hypotheses of Lemma 6.1.3 aresatisfied. Then,(i) S 0 <strong>and</strong> T 0 have no poles <strong>in</strong> ¢ +.(ii) Z S ⊃ {p ∈ ¢ + : p is a pole of G}.


146 6. Extensions to Sampled-Data Systems(iii) Z T = {q ∈ ¢ + }, where q is either(a) an ORHP zero of G, i.e., G(q) = 0;(b) an ORHP zero of H, i.e., H(q) = 0;(c) an unfolded image of a nonm<strong>in</strong>imum phase zero of K d , i.e.,K d (e qτ ) = 0;(d) a periodic reflection of an ORHP pole p of G, i.e., q = p + jkω sfor some <strong>in</strong>teger k ≠ 0.(iv) If b ∈ £ c is a pole of K d , <strong>and</strong> b k is an unfolded image of b, i.e.,b k = 1 τ log b + jmω s, m = 0, ±1, ±2, · · · , thenProof. Introduce factorizationsS 0 (b m ) = 1 − G(b m) H(b m ) F(b m ),τ(FGH) d (b)(6.28)T 0 (b m ) = G(b m) H(b m ) F(b m ).τ(FGH) d (b)(6.29)−sτ N(s)G(s) F(s) = eD(s) ,where N <strong>and</strong> D are coprime rational functions with no poles <strong>in</strong> ¢ +, <strong>and</strong>(FGH) d (z) = N d(z)D d (z) , (6.30)where N d <strong>and</strong> D d are coprime rational functions with no poles £c<strong>in</strong> . Bythe Youla parametrization, all controllers K d that stabilize (6.30) have theform 6K d = Y d + D d Q dX d − N d Q d, (6.31)where Q d , X d , <strong>and</strong> Y d are stable, <strong>and</strong> X d <strong>and</strong> Y d satisfy the Bezout identityIt follows that S d = D d (X d − N d Q d ) <strong>and</strong>Us<strong>in</strong>g (6.33) <strong>in</strong> (6.14) <strong>and</strong> (6.15) yieldsS 0 (s) = 1 − 1 τD d X d + N d Y d = 1. (6.32)K d S d = D d (Y d + D d Q d ). (6.33)N(s)H(s)e−sτ D d (e sτ ) [Y d (e sτ ) + D d (e sτ )Q d (e sτ )] (6.34)D(s)6 We suppress dependence on the transform variable when convenient; the mean<strong>in</strong>g willbe clear from the context.


6.3 Interpolation Constra<strong>in</strong>ts 147<strong>and</strong>T 0 (s) = 1 τThen:N(s)H(s)e−sτ D d (e sτ ) [Y d (e sτ ) + D d (e sτ )Q d (e sτ )] . (6.35)D(s)(i): T 0 is stable because each factor <strong>in</strong> the numerator of (6.35) is stable,<strong>and</strong> because the assumption of nonpathological sampl<strong>in</strong>g guaranteesthat any unstable pole of 1/D must be canceled by a correspond<strong>in</strong>gzero of D d (e sτ ).(ii): It follows from (6.32) that Y d (e pτ ) = 1/N d (e pτ ). Us<strong>in</strong>g this fact, <strong>and</strong>evaluat<strong>in</strong>g (6.34) <strong>in</strong> the limit as s → p yieldsReplace (FGH) d (e sτ ) us<strong>in</strong>g (6.7):S 0 (s) −→ 1 − lims→pF(s)G(s)H(s)τ(FGH) d (e sτ ) .S 0 H(s)G(s)F(s)(s) −→ 1 − lim ∑ ∞ s→pk=−∞ F k(s) G k (s) H k (s) . (6.36)By the assumptions that F is stable <strong>and</strong> that sampl<strong>in</strong>g is nonpathological,G <strong>and</strong> F have no poles at p + jkω s , k ≠ 0. Us<strong>in</strong>g this fact,<strong>and</strong> the fact that H has no f<strong>in</strong>ite poles, yields that each term <strong>in</strong> thedenom<strong>in</strong>ator of (6.36) rema<strong>in</strong>s f<strong>in</strong>ite as s → p except the term k = 0.Then (ii) follows.(iii): Observe first that (6.35) implies that T 0 has ORHP zeros only at theORHP zeros of N, H, D d (e sτ ), or [Y d (e sτ ) + D d (e sτ )Q d (e sτ )].From the assumptions <strong>in</strong> §6.1.1, G, F, <strong>and</strong> GF are each free of unstablehidden modes. Hence N <strong>and</strong> D can have no common ORHP zeros<strong>and</strong> (iii)a follows. By the assumption of nonpathological sampl<strong>in</strong>g,neither can H <strong>and</strong> D. Hence (iii)b follows. Note next that the zerosof D d (e sτ ) lie at p + jkω s , k = 0, ±1, ±2, · · · , where p is any ORHPpole of G <strong>and</strong> hence a zero of D. It follows from this fact that theratio D d (e sτ )/D(s) can have zeros only for k = ±1, ±2, · · · . By theassumption of nonpathological sampl<strong>in</strong>g, no other cancelations occur,<strong>and</strong> all these zeros are <strong>in</strong>deed zeros of T 0 . This proves (iii)d. By(6.31), the ORHP zeros of [Y d (e sτ ) + D d (e sτ )Q d (e sτ )] are identicalto the ORHP zeros of K d (e sτ ). By the hypotheses of Lemma 6.1.3,none of these zeros can co<strong>in</strong>cide with those of D d (e sτ ), <strong>and</strong> thuswith those of D. This proves (iii)c.(iv): It follows from (6.33) that K d (b)S d (b) = 1/(FGH) d (b). Substitutionof this identity <strong>in</strong>to (6.14)-(6.15) yields (6.28)-(6.29).


148 6. Extensions to Sampled-Data SystemsTheorem 6.3.1 establishes ORHP values of S 0 <strong>and</strong> T 0 that are fixed byopen-loop ORHP poles <strong>and</strong> zeros of the plant, which hold irrespective ofthe controller.There are a number of differences between the <strong>in</strong>terpolation constra<strong>in</strong>tsfor the SD <strong>and</strong> the cont<strong>in</strong>uous-time cases; we describe these <strong>in</strong> the follow<strong>in</strong>gremarks.Remark 6.3.1 (Unstable Plant Poles). Each ORHP plant pole yields constra<strong>in</strong>tsdirectly analogous to the cont<strong>in</strong>uous-time case; i.e., at each ORHP pole p,S 0 (p) = 0 <strong>and</strong> T 0 (p) = 1. Furthermore, each of these poles yields the additionalconstra<strong>in</strong>ts (iii)d, i.e., S 0 (p + jkω s ) = 1 <strong>and</strong> T 0 (p + jkω s ) = 0, k ≠ 0,which arise from the periodically spaced zeros of S d (e sτ ) <strong>and</strong> the fact thatnonpathological sampl<strong>in</strong>g precludes all but one of these zeros from be<strong>in</strong>gcanceled by a pole of G.◦Remark 6.3.2 (Nonm<strong>in</strong>imum Phase Plant Zeros). Each ORHP zero of theplant, q, yields constra<strong>in</strong>ts directly analogous to the cont<strong>in</strong>uous-time case;i.e., S 0 (q) = 1 <strong>and</strong> T 0 (q) = 0. Note <strong>in</strong> particular that these constra<strong>in</strong>ts arepresent <strong>in</strong>dependent of the choice of the hold function. The zeros of the discretizedplant ly<strong>in</strong>g <strong>in</strong> £ c , on the other h<strong>and</strong>, do not impose any <strong>in</strong>herentconstra<strong>in</strong>ts on S 0 . Indeed, suppose that ν ∈ £ c is a zero of (FGH) d . Thenfor each ν k 1 τ log(ν) + jkω s, k = 0, ±1, ±2, · · · , it follows thatS 0 (ν k ) = 1 − 1 τ G(ν k)H(ν k )K d (ν)F(ν k ), (6.37)<strong>and</strong> thus the size of S 0 (ν k ) is not <strong>in</strong>dependent of the choice of controller.◦Remark 6.3.3 (Unstable <strong>Control</strong>ler Poles). In the analogue case, unstable plant<strong>and</strong> controller poles yield identical constra<strong>in</strong>ts on the sensitivity <strong>and</strong> complementarysensitivity functions; namely, when evaluated at such a pole,the sensitivity must equal zero <strong>and</strong> the complementary sensitivity mustequal one. Compar<strong>in</strong>g (ii) <strong>and</strong> (iv) <strong>in</strong> Theorem 6.3.1, we see that <strong>in</strong> a SDsystem unstable plant <strong>and</strong> controller poles will generally yield differentconstra<strong>in</strong>ts on sensitivity <strong>and</strong> complementary sensitivity. In particular, unstablecontroller poles will yield correspond<strong>in</strong>g zeros of S 0 only <strong>in</strong> specialcases.◦Remark 6.3.4 (Zeros of K d ). Each zero of the controller ly<strong>in</strong>g £ <strong>in</strong> c imposes<strong>in</strong>f<strong>in</strong>itely many <strong>in</strong>terpolation constra<strong>in</strong>ts upon the cont<strong>in</strong>uous-time systembecause there are <strong>in</strong>f<strong>in</strong>itely many po<strong>in</strong>ts <strong>in</strong> the s-plane that map tothe location of the zero <strong>in</strong> the z-plane. These constra<strong>in</strong>ts are due to the factthat a pole at any of these po<strong>in</strong>ts will lead to an unstable discrete pole-zerocancelation.◦Remark 6.3.5 (Zeros of Hold Response). By (iii)b, zeros of H ly<strong>in</strong>g <strong>in</strong> theORHP impose constra<strong>in</strong>ts on the sensitivity function identical to those im-□


6.3 Interpolation Constra<strong>in</strong>ts 149posed by ORHP zeros of the plant. A ZOH has zeros only on the jω-axis,but GSHF response functions may have zeros <strong>in</strong> the open right half plane(Braslavsky et al., 1995b).◦Remark 6.3.6 (Zeros of S 0 ). The list of ORHP zeros for T 0 is exhaustive;however, that for S 0 is not. It is <strong>in</strong>terest<strong>in</strong>g to contrast this situation withthe analogue case. For analogue systems, the ORHP zeros of the sensitivityfunction consist precisely of the union of the ORHP poles of the plant <strong>and</strong>controller. On the other h<strong>and</strong>, by Theorem 6.3.1 (ii) <strong>and</strong> (iv), unstable plantpoles yield zeros of S 0 while unstable controller poles generally do not.Furthermore, as the follow<strong>in</strong>g example shows, S 0 may have ORHP zeroseven if both plant <strong>and</strong> controller are stable.Example 6.3.1. Consider the plant G(s) = 1/(s + 1). Discretize with aZOH, sample period τ = 1, <strong>and</strong> no anti-alias<strong>in</strong>g filter (i.e., F(s) = 1). Thisyields (FGH) d (z) = .6321/(z − .3670). A stabiliz<strong>in</strong>g discrete controller forthis plant isK d (z) = (4.8158)(z2 + .1z + 0.3988)(z 2 − 1.02657z + 0.9025) .Both plant <strong>and</strong> controller are stable; yet it may be verified that S 0 has zerosat s = 0.2 ± j (see Figure 6.2).log | So(s) |−1−2−3−4−5−6−7−800.10.20.3σ0.40.50.6 00.5j ω / ω N1FIGURE 6.2. <strong>Fundamental</strong> sensitivity for Example 6.3.1◦


150 6. Extensions to Sampled-Data Systems6.4 Poisson Integral formulaeThe <strong>in</strong>terpolation constra<strong>in</strong>ts derived <strong>in</strong> the preced<strong>in</strong>g section fix the valuesof the SD response functions at po<strong>in</strong>ts of the ORHP. In this section,we translate these constra<strong>in</strong>ts <strong>in</strong>to Poisson <strong>in</strong>tegral relations that must besatisfied by the SD sensitivity functions. These relations are equivalent tothose presented <strong>in</strong> Chapter 3.We <strong>in</strong>troduce some Blaschke products first. For convenience, we denotethe ORHP zeros of S 0 <strong>and</strong> T 0 respectively asZ S = {p i : i = 1, 2, . . . , n p } , <strong>and</strong> Z T = {q i : i = 1, 2, . . . , n q } ,where n S <strong>and</strong> n T may be <strong>in</strong>f<strong>in</strong>ite. Correspond<strong>in</strong>gly, let their Blaschke productsben pn∏ s − p i∏ qs − q iB S (s) , <strong>and</strong> B T (s) . (6.38)s + ¯p i s + ¯q iWe also denote by 7i=1B G : the Blaschke product of the ORHP poles of G,B G : the Blaschke product of the ORHP zeros of G,B H : the Blaschke product of the ORHP zeros of H,B Kd : the Blaschke product of the unfolded images of the ORHP zeros ofK d , <strong>and</strong>B G⋔ : the Blaschke product of the periodic reflections of the ORHP polesof G.i=16.4.1 Poisson Integral for S 0The follow<strong>in</strong>g theorem presents a Poisson <strong>in</strong>tegral <strong>in</strong>equality that must besatisfied by log |S 0 (jω)|.Theorem 6.4.1 (Poisson <strong>in</strong>tegral for S 0 ). Assume that the hypotheses ofLemma 6.1.3 are satisfied. Let q = σ q + jω q , with σ q > 0, be an ORHPzero of T 0 , as described <strong>in</strong> Theorem 6.3.1. Then,∫ ∞log |S 0 σ q(jω)|−∞σ 2 dω ≥ π log|B−1q + (ω − ω q )2 G(q)| . (6.39)7 S<strong>in</strong>ce there are three Blaschke products emanat<strong>in</strong>g from G, we use <strong>in</strong> the notation somek<strong>in</strong>d of graphical <strong>in</strong>dication, i.e., to <strong>in</strong>dicate a zero, to <strong>in</strong>dicate a pole, <strong>and</strong> ⋔ to suggestperiodic reflection.


6.4 Poisson Integral formulae 151Proof. Factorize S 0 as S 0 = ˜SB S , where B S is given <strong>in</strong> (6.38), <strong>and</strong> where ˜Shas no poles or zeros <strong>in</strong> the ORHP. Then apply<strong>in</strong>g Corollary A.6.3 <strong>in</strong> AppendixA to ˜S implies that (6.39) holds with equality if B G (q) is replacedby B S (q). S<strong>in</strong>ce the set of ORHP zeros of S 0 due to the ORHP poles of Gis generally a proper subset of all such zeros (cf. Remark 6.3.6) <strong>in</strong>equality(6.39) follows. □Theorem 6.4.1 has several design implications, which we describe <strong>in</strong> aseries of remarks.Remark 6.4.1 (Nonm<strong>in</strong>imum Phase Plant Zeros). As <strong>in</strong> the cont<strong>in</strong>uous timecase, if the plant is nonm<strong>in</strong>imum phase, then requir<strong>in</strong>g that |S 0 (jω)| < 1over a frequency range Ω implies that, necessarily, |S 0 (jω)| > 1 at otherfrequencies. The severity of this trade-off depends upon the relative locationof the ORHP zero <strong>and</strong> the frequency range Ω. We now discuss this <strong>in</strong>more detail.We recall the def<strong>in</strong>ition of the weighted length of an <strong>in</strong>terval by thePoisson kernel for the half plane, <strong>in</strong>troduced <strong>in</strong> Chapter 3, (3.33). Let s 0 =σ 0 + jω 0 be a po<strong>in</strong>t ly<strong>in</strong>g <strong>in</strong> ¢ + , <strong>and</strong> consider the frequency <strong>in</strong>terval Ω 1 =[−ω 1 , ω 1 ). Then, we had thatΘ s0 (ω 1 ) =∫ ω1σ 0−ω 1σ 2 0 + (ω − ω 0) dω. 2We have also seen <strong>in</strong> §3.3.2 that Θ s0 (ω 1 ) equals the negative of the phaselag contributed by a Blaschke product of s 0 at the upper end po<strong>in</strong>t of the<strong>in</strong>terval Ω 1 . With this notation, the follow<strong>in</strong>g result is an immediate consequenceof (6.39).Corollary 6.4.2 (Water-Bed Effect). Suppose that q is an ORHP zero of theplant, <strong>and</strong> suppose thatThen|S 0 (jω)| ≤ α, for all ω <strong>in</strong> Ω 1 .Θ q(ω 1 )sup |S 0 (jω)| ≥ (1/α)∣ ππ−Θ q(ω 1 ) B−1G (q)∣ ∣ π−Θ q(ω 1 )(6.40)ω>ω 1◦The bound (6.40) shows that if disturbance attenuation is required throughouta frequency <strong>in</strong>terval <strong>in</strong> which the ORHP zero contributes significantphase lag, then disturbances will be greatly amplified at some higher frequency.The factor due to the Blaschke product <strong>in</strong> (6.40) shows that plantswith approximate ORHP pole-zero cancelations yield particularly sensitivefeedback systems.◦


152 6. Extensions to Sampled-Data SystemsRemark 6.4.2 (Nonm<strong>in</strong>imum Phase Hold Zeros). An ORHP zero of the holdresponse imposes precisely the same trade-off as does a zero of the plant<strong>in</strong> the same location. This trade-off is exacerbated if the ORHP hold zero isnear an unstable plant pole. Poor sensitivity <strong>in</strong> this case is to be expected,as an exact pole-zero cancelation yields an unstable hidden mode <strong>in</strong> thediscretized plant 8 .◦Remark 6.4.3 (Unstable Plant Poles). Us<strong>in</strong>g analogue control, the sensitivityfunction of a system with an unstable, but m<strong>in</strong>imum phase, plant can bemade arbitrarily small over an arbitrarily wide frequency range (Zames<strong>and</strong> Bensoussan, 1983) while keep<strong>in</strong>g sensitivity bounded outside thisrange. This is no longer true for digital controllers <strong>and</strong> the fundamentalsensitivity function. The follow<strong>in</strong>g result is an immediate consequence of(6.39). ◦Corollary 6.4.3.(i) Assume that the plant has a real ORHP pole, p = σ p . Then√( ) 2‖S 0 σp‖ ∞ ≥ 1 + . (6.41)ω N(ii) Assume that the plant has an ORHP complex conjugate pole pair,p = σ p + jω p , ¯p = σ p − jω p . Then for k = ±1, ±2 . . .√√( ) 2 () 2‖S 0 σpσ p‖ ∞ ≥ 1 +1 +. (6.42)kω N ω p − kω NProof. We show only (i); (ii) is similar. Evaluate (6.39) with q equal to oneof the periodic reflections of p, i.e., q = σ p + jkω s , with k = ±1, ±2, . . .Then◦π log ‖S 0 ‖ ∞ ≥From (6.43) follows∫ ∞log |S 0 (jω)|−∞∣≥ π log∣2σ p + jkω s−jkω s∣ ∣∣∣.σ pσ 2 p + (ω − kω s ) 2 dω√‖S 0 ‖ ∞ ≥ 1 + [σ p /(kω N )] 2√≥ 1 + (σ p /ω N ) 2 .(6.43)□8 See the conditions for nonpathological sampl<strong>in</strong>g <strong>in</strong> Lemma 6.1.2.


6.4 Poisson Integral formulae 153In either case of Corollary 6.4.3, the fundamental sensitivity functionnecessarily has a peak strictly greater than one.For a real pole, achiev<strong>in</strong>g good sensitivity requires that the sampl<strong>in</strong>grate be sufficiently fast with respect to the time constant of the pole; e.g.,achiev<strong>in</strong>g ‖S 0 ‖ ∞ < 2 requires that ω N > σ p / √ 3. This condition is alsonecessary for a complex pole pair. Furthermore, sensitivity will be poor ifω p ≈ kω N for some k ≠ 0. The reason for poor sensitivity <strong>in</strong> this caseis clear; if ω p = kω N , then the complex pole pair violates the nonpathologicalsampl<strong>in</strong>g condition (6.8), <strong>and</strong> the discretized plant will have anunstable hidden mode.More generally, we haveCorollary 6.4.4. Assume that the plant has unstable poles p i <strong>and</strong> p k withp i ≠ ¯p k . Then∣ ‖S 0 ‖ ∞ ≥ max¯p i + p k + jkω s ∣∣∣k≠0∣(6.44)p i − p k − jkω s<strong>and</strong>‖S 0 ‖ ∞ ≥ maxk≠0∣ p i + p k + jkω s ∣∣∣∣(6.45)¯p i − p k − jkω sIt follows that if sampl<strong>in</strong>g is “almost pathological”, <strong>in</strong> that p i − p k ≈jkω s , or ¯p i − p k ≈ jkω s , then sensitivity will be large.Remark 6.4.4 (Approximate Discrete Pole Zero Cancelations). Let the discretecontroller have an ORHP zero a. Then (6.39) holds with q equal to one ofthe unfolded images of a <strong>in</strong> the s-plane. If the plant has an unstable polenear one of these po<strong>in</strong>ts, then the right h<strong>and</strong> side of (6.39) will be large,<strong>and</strong> S 0 will have a large peak. Poor sensitivity is plausible, because thissituation corresponds to an approximate pole-zero cancelation betweenan ORHP zero of the controller <strong>and</strong> a pole of the discretized plant. ◦◦6.4.2 Poisson Integral for T 0We now derive a result for T 0 dual to that for S 0 obta<strong>in</strong>ed <strong>in</strong> the previoussection. An important difference is that we can characterize all ORHPzeros of T 0 , <strong>and</strong> thus obta<strong>in</strong> an <strong>in</strong>tegral equality.First, we note an additional property of the hold response function.Lemma 6.4.5. The hold response function may be factored asH(s) = ˜H(s)e −sτ HB H (s), (6.46)where τ H ≥ 0, B H is as def<strong>in</strong>ed on page 150, <strong>and</strong> log | ˜H| satisfies the Poisson<strong>in</strong>tegral relation.


154 6. Extensions to Sampled-Data SystemsProof. Follows directly from the def<strong>in</strong>ition of H, which implies that H is afunction <strong>in</strong> H 2 , <strong>and</strong> thus the factorization of Hoffman (1962, pp. 132-133)applies.□Theorem 6.4.6 (Poisson <strong>in</strong>tegral for T 0 ). Suppose that the conditions ofLemma 6.1.3 are satisfied. Let p = σ p + jω p be an ORHP pole of G. Then∫ ∞log |T 0 σ p(jω)|−∞σ 2 p + (ω − ω p ) 2 dω = πσ p(τ G + τ H + RD K d τ)+ π log |B −1G (p)|+ π log |B −1H (p)|+ π log |B −1G⋔ (p)|+ π log |B −1K d(p)| ,(6.47)where RD K d denotes the relative degree of K d .Proof. Note that T 0 has an <strong>in</strong>ner-outer factorizationT 0 (s) = ˜T(s) e −sτ Ge −sτ He −sτ RD K dB G (s) B H (s) B G⋔ (s) B Kd (s)where ˜T satisfies Corollary A.6.3 <strong>in</strong> Appendix A. S<strong>in</strong>ce log | ˜T(jω)| = log |T 0 (jω)|,the result follows.□We comment on the design implications of Theorem 6.4.6 <strong>in</strong> a series ofremarks.Remark 6.4.5. The first term on the right h<strong>and</strong> side of (6.47) shows that|T 0 (jω)| will display a large peak if there is a long time delay <strong>in</strong> the plant,digital controller, or hold function.◦Remark 6.4.6. The third <strong>and</strong> fourth terms on the right h<strong>and</strong> side of (6.47)show that |T 0 (jω)| will display a large peak if there is an approximate unstablepole-zero cancelation <strong>in</strong> the plant, or between the plant <strong>and</strong> the holdfunction. By the nonpathological sampl<strong>in</strong>g condition (ii) <strong>in</strong> Lemma 6.1.2,the latter peak corresponds to an approximate unstable pole-zero cancelation<strong>in</strong> the discretized plant.◦The follow<strong>in</strong>g result is analogous to Corollary 6.4.4 for T 0 .Corollary 6.4.7.(i) Assume that p = σ p , a real pole. Then‖T 0 ‖ ∞ ≥s<strong>in</strong>h( πσpω N)( πσpω N) . (6.48)


6.4 Poisson Integral formulae 155(ii) Assume that p = σ p + jω p , a complex pole. Then‖T 0 ‖ ∞ ≥( ) πσps<strong>in</strong>hω( Nπσpω N)∣( πps<strong>in</strong>h)( )πωpω N ω N ( )πωp∣ ∣s<strong>in</strong>ω∣N( πpω N). (6.49)Proof. Consider the Blaschke product correspond<strong>in</strong>g to the periodic reflectionsof p, which can be also written asB p (s) =∞∏k=1( ) 2 s − p1 −jkω s( ) 2 s + ¯p1 −jkω sUs<strong>in</strong>g the identities (Lev<strong>in</strong>son <strong>and</strong> Redheffer, 1970, p. 387),<strong>and</strong> s<strong>in</strong> jα = j s<strong>in</strong>h α yieldsB p (s) =s<strong>in</strong> παπα∞ = ∏) (1 − α2k 2k=1( ) s − ps<strong>in</strong>h πω( s) s − pπω s( ) ¯p + sπω( s¯p + s) (6.50)ω ss<strong>in</strong>h πNote that the first factor on the right h<strong>and</strong> side of (6.50) converges toone as s → p. It follows thatInvert<strong>in</strong>g yields (6.48). FurthermoreB ¯p (p) =( ) πσpω NB p (p) = ( ) (6.51)πσps<strong>in</strong>hω N( ) πωps<strong>in</strong>ω( Nπωpω N)( ) πpω( N) (6.52)πps<strong>in</strong>hω NTogether (6.51)-(6.52) yield (6.49).□


156 6. Extensions to Sampled-Data SystemsFigure 6.3(a) gives plots of the bound (6.49) versus σ p = Re p for variousvalues of ω p = Im p, <strong>and</strong> Figure 6.3(b) gives plots of the bound (6.49) versusω p for various values of σ p . The pole location has been normalizedby the Nyquist frequency. Note <strong>in</strong> Figure 6.3(b) that for a complex pole‖T 0 ‖ ∞ will become arbitrarily large as ω p → kω N , k = ±1, ±2 . . . , becausesampl<strong>in</strong>g becomes pathological at such frequencies. It follows fromthese plots that to achieve good robustness the Nyquist frequency shouldbe chosen several times larger than the radius of any unstable pole.1010 5 (b)8610 410 3σp /ω N0.30.60.94σp /ω Nωp/ω N0.020.70.90.9500 0.2 0.4 0.6 0.8 1(a)10 210 1FIGURE 6.3. Lower bounds on ‖T 0 ‖ ∞.10 00 1 2 3 4 5ωp /ω N6.5 Example: Robustness of Discrete ZeroShift<strong>in</strong>gWe illustrate the use of the tools presented <strong>in</strong> this chapter by analyz<strong>in</strong>gan example of design orig<strong>in</strong>ally presented <strong>in</strong> Er <strong>and</strong> Anderson (1994). Theplant is given bys − 5P(s) =(s + 1)(s + 3) .It is required that the closed-loop b<strong>and</strong>width, ω b , satisfy the lower boundω b ≥ 15.3 rad/s. Notice that the nonm<strong>in</strong>imum phase zero of the analogueplant lies well with<strong>in</strong> the desired closed-loop b<strong>and</strong>width. Hence, bythe results of Chapter 3, if this b<strong>and</strong>width is achieved with an analoguecontroller, then the result<strong>in</strong>g closed-loop system will have very poor sensitivity<strong>and</strong> robustness properties.Briefly, the design suggested <strong>in</strong> Er <strong>and</strong> Anderson (1994) consists of firstus<strong>in</strong>g a GSHF to relocate the discrete zeros so that the discretized plantis m<strong>in</strong>imum phase, <strong>and</strong> then apply<strong>in</strong>g a st<strong>and</strong>ard discrete-time LQG/LTRprocedure (e.g., Zhang <strong>and</strong> Freudenberg, 1993). It follows that perfect LTR


6.5 Example: Robustness of Discrete Zero Shift<strong>in</strong>g 157r +❜ ✲ ✐ ✲ Plant− ✻y✲ ❛ ✟ ❄ ❛Ty✲ k❜GSHF✛LQR/LQGCompensator✛FIGURE 6.4. Structure for the GSHF-based LTR of Er <strong>and</strong> Anderson (1994).at the sampl<strong>in</strong>g times is feasible <strong>in</strong>dependent of the zero distribution of theanalogue plant. The proposed scheme is shown <strong>in</strong> Figure 6.4. The sampl<strong>in</strong>gperiod is chosen as τ = 0.04s, <strong>and</strong> an appropriate GSHF is givenby{−1957 for 0 ≤ t < 0.02 ,h(t) =(6.53)1707 for 0.02 ≤ t < 0.04 .This yields a m<strong>in</strong>imum phase discretized plant. In Figure 6.5 we plot |S d (e jωτ )|<strong>and</strong> |T d (e jωτ )| obta<strong>in</strong>ed with the compensator suggested <strong>in</strong> Er <strong>and</strong> Anderson(with their parameter q = 3). The closed-loop specification is achieved<strong>and</strong> both |S d (e jωτ )| <strong>and</strong> |T d (e jωτ )| are well behaved.1.4| S(e jωτ ) |1.210.80.60.4| T(e jωτ ) |0.200 0.2 0.4 0.6 0.8 1FIGURE 6.5. Discrete sensitivity<strong>and</strong> complementary sensitivityfunctions.654321|T 0 (jω)||W(jω)| −100 0.5 1 1.5 2 2.5 3ω / ω NFIGURE 6.6. <strong>Fundamental</strong> complementarysensitivity <strong>and</strong> weight W 1 .However, as the results <strong>in</strong> this chapter predict, the closed-loop systemwill still be prone to sensitivity <strong>and</strong> robustness problems. Figure 6.6 showsa plot of |T 0 (jω)|, which exhibits large values with<strong>in</strong> <strong>and</strong> outside the Nyquist<strong>in</strong>terval. Follow<strong>in</strong>g Lemma 6.2.4, we should expect poor sensitivity to perturbations<strong>in</strong> the analogue plant for which W 1 <strong>in</strong> (6.22) satisfies|T 0 (jω)| >1|W 1 (jω)| ,at some ω. Indeed, suppose that the plant has an unmodeled time delayτ G . This may be represented by a multiplicative perturbation model as <strong>in</strong>


158 6. Extensions to Sampled-Data Systems(6.22), with∆(s) = 1 − e−sτ Gsτ G (1 + δ) , <strong>and</strong> W 1(s) = −τ G s(1 + δ),where 0 < δ ≪ 1. In Figure 6.6 we have also plotted 1/W 1 (jω)| for τ G =0.0024s, <strong>and</strong> it can be seen that there is a frequency for which |T 0 (jω)| ≈1/|W 1 (jω)|. In fact, as can be checked by simulation, the system becomesunstable even for this small time delay. Note that this extreme sensitivityto small errors <strong>in</strong> the analogue plant model is not apparent from the Bodeplots of the discrete sensitivities.6.6 SummaryThis chapter has developed performance limitations for SISO SD systems.The approach relies on the analysis of the SD sensitivity functions S 0 <strong>and</strong>T 0 , which serve to quantify the sensitivity <strong>and</strong> robustness properties ofthe system tak<strong>in</strong>g <strong>in</strong>tersample behavior <strong>in</strong>to account.It follows from the results <strong>in</strong> this chapter that SD systems do not escapethe difficulty imposed upon analogue feedback design by open-loop nonm<strong>in</strong>imumphase zeros, unstable poles, <strong>and</strong> time delays. This difficulty,then, persists when the controller is implemented digitally <strong>and</strong>, furthermore,<strong>in</strong>dependently of the type of hold function used.In addition, a number of limitations are unique to digital controller implementations.First, there are design limitations due to potential nonm<strong>in</strong>imumphase zeros of the hold function. Second, <strong>and</strong> perhaps most <strong>in</strong>terest<strong>in</strong>g,are the design limitations due to unstable plant poles. If the samplerate is “almost pathological” <strong>and</strong>/or is slow with respect to the time constantof the pole, then sensitivity, robustness, <strong>and</strong> response to exogenous<strong>in</strong>puts will all be poor.Notes <strong>and</strong> ReferencesThe contents of this chapter are largely based on Freudenberg et al. (1995). In thispaper, <strong>in</strong>tegral constra<strong>in</strong>ts on S 0 <strong>and</strong> T 0 of the Bode-type have also been derived.Extensions of these results to multirate SD systems have been reported <strong>in</strong> Middleton<strong>and</strong> Xie (1995b), while applications to the analysis of robustness propertiesof zero shift<strong>in</strong>g control schemes have been presented <strong>in</strong> Freudenberg et al. (1994),from which the example <strong>in</strong> §6.5 was taken; see also Braslavsky (1995) for a detaileddiscussion. More on performance limitations for SD systems may be found<strong>in</strong> Araki (1993), Feuer <strong>and</strong> Goodw<strong>in</strong> (1994) <strong>and</strong> Zhang <strong>and</strong> Zhang (1994).<strong>Fundamental</strong> sensitivity functions were first <strong>in</strong>troduced <strong>in</strong> Goodw<strong>in</strong> <strong>and</strong> Salgado(1994) to study SD systems consider<strong>in</strong>g full <strong>in</strong>tersample behavior. A frequencydoma<strong>in</strong> framework embody<strong>in</strong>g all harmonics has been presented <strong>in</strong> Araki


6.6 Summary 159et al. (1993). Recent references on the frequency response of SD systems <strong>in</strong>cludeYamamoto <strong>and</strong> Araki (1994), Hagiwara et al. (1995), Rosenvasser (1995), <strong>and</strong> Yamamoto<strong>and</strong> Khargonekar (1996).A rigorous <strong>and</strong> self-conta<strong>in</strong>ed derivation of the key sampl<strong>in</strong>g formula that yields(6.7) may be found <strong>in</strong> Braslavsky et al. (1995a).For further read<strong>in</strong>g on SD systems see the recent books by Chen <strong>and</strong> Francis(1995) <strong>and</strong> Feuer <strong>and</strong> Goodw<strong>in</strong> (1996).


Part III<strong>Limitations</strong> <strong>in</strong> L<strong>in</strong>earFilter<strong>in</strong>g


7General ConceptsThis chapter sets up the general framework for the discussion <strong>in</strong> Part IIIregard<strong>in</strong>g sensitivity limitations <strong>in</strong> filter design. In particular, two ma<strong>in</strong>concepts are <strong>in</strong>troduced here: filter<strong>in</strong>g sensitivity functions <strong>and</strong> bounded errorestimators. These concepts are the start<strong>in</strong>g po<strong>in</strong>t of a theory of designlimitations for a broad class of l<strong>in</strong>ear filter<strong>in</strong>g problems, as we will see <strong>in</strong>the follow<strong>in</strong>g chapters.7.1 General Filter<strong>in</strong>g ProblemThis section presents the filter<strong>in</strong>g problem that we will be primarily concernedwith the next few chapters.Consider the general filter<strong>in</strong>g configuration of Figure 7.1, where the signalsare as follows,¡ ¡¡ ¡¡ ¡v : process <strong>in</strong>put, <strong>in</strong> m , y : measured output, <strong>in</strong> l ,w : measurement noise, <strong>in</strong> l , ^z : estimate, <strong>in</strong> n ,z : signal to be estimated, <strong>in</strong> n , ˜z : estimation error, <strong>in</strong> n .The plant, G, is a LTI system given by the follow<strong>in</strong>g state-space representation(cf. the notation <strong>in</strong> (2.4)):⎡A B 0⎤G =s ⎣ C 1 D 1 0 ⎦ , (7.1)C 2 D 2 I


164 7. General Concepts❜ v ✲❜ w ✲Gzy✲F+✲ ❥✲˜z−✻^zFIGURE 7.1. General filter<strong>in</strong>g configuration.where the pair (A, C 2 ) is assumed to be detectable. For simplicity, we willalso assume that the pair (A, B) is stabilizable, <strong>and</strong> <strong>in</strong>dicate the ma<strong>in</strong> differenceswith the nonstabilizable case when relevant. 1 The process <strong>in</strong>putv <strong>and</strong> the measurement noise w are nonmeasured external <strong>in</strong>puts to theplant.The filter is represented by a proper transfer function F. The generalproblem of filter<strong>in</strong>g then focuses on the design of a suitable F such thatthe target signal z can be estimated from the noisy measurements y. Inoptimal filter<strong>in</strong>g, for example, F is selected to m<strong>in</strong>imize a certa<strong>in</strong> performancecriteria, typically <strong>in</strong>volv<strong>in</strong>g the “smallness” of the estimation error<strong>in</strong> some sense based on particular assumptions about v <strong>and</strong> w.Sometimes the process has, <strong>in</strong> addition to v, a control or measured <strong>in</strong>putu. In this case the filter will generally use both measured <strong>in</strong>put <strong>and</strong>output <strong>in</strong> the construction of the estimate. An important class of filtersthat use both u <strong>and</strong> y <strong>in</strong> the estimate is the class of unbiased filters, whichwill be briefly considered <strong>in</strong> §7.3.1. The follow<strong>in</strong>g example illustrates theconstruction of a filter when a measured <strong>in</strong>put is available.Example 7.1.1. Suppose that the plant G <strong>in</strong> Figure 7.1 has an additionalmeasured <strong>in</strong>put u, i.e., G is given by the state-space descriptionẋ = Ax + B u u + Bv, x(0) = x 0 ,z = C 1 x + D 1 v,y = C 2 x + D 2 v + w.(7.2)One way to estimate the signal z us<strong>in</strong>g both u <strong>and</strong> y is to build a full-stateobserver for the state x. Then, the filter F obta<strong>in</strong>ed from this observer willbe given by a realization of the follow<strong>in</strong>g general form:˙ξ = ^Aξ + K u u + K y y, ξ(0) = ξ 0 ,^x = ^Cξ,^z = C 1^x,(7.3)where ξ is the state of the filter <strong>and</strong> ^x is an estimate of the plant’s state x.Different choices of the matrices <strong>in</strong> (7.3) will lead to filters hav<strong>in</strong>g different1 See Seron (1995) for the complete analysis of this latter case.


7.2 Sensitivity Functions 165characteristics, such as boundedness of the estimation error, unbiasedness,etc. More on this <strong>in</strong> §7.3.◦For convenience, we partition the plant as shown <strong>in</strong> Figure 7.2, with theobvious def<strong>in</strong>itions for the transfer functions G z <strong>and</strong> G y appear<strong>in</strong>g there.Note that, by the assumption of detectability from y <strong>and</strong> stabilizabilityfrom v, the function G y conta<strong>in</strong>s all the unstable modes of G, some or allof which may not appear <strong>in</strong> G z .❜ v ✲❜ w ✲✲✲G zG y✲+❣+✻✲zy✲FIGURE 7.2. Structure of the plant.The follow<strong>in</strong>g st<strong>and</strong><strong>in</strong>g assumptions about the transfer functions F <strong>and</strong>G z are needed <strong>in</strong> the sequel.Assumption 7.1.(i) F is stable.(ii) G z is right <strong>in</strong>vertible.Condition (i) <strong>in</strong> Assumption 7.1 is clear, <strong>and</strong> the least requirement thatwe can ask for the filter; namely, that it should be a stable operation on theavailable signal, y. Condition (ii) is necessary for our def<strong>in</strong>ition of filter<strong>in</strong>gsensitivity functions, <strong>in</strong> the section com<strong>in</strong>g. In relation to this condition,recall that G z (s) is right <strong>in</strong>vertible if it has full row rank at almost all s ∈ ¢ ,which requires that there be at least as many process <strong>in</strong>puts as signals tobe estimated, i.e., m ≥ n.For future use, we <strong>in</strong>troduce a notation for generic <strong>in</strong>put-output operators<strong>in</strong> the sett<strong>in</strong>g of Figure 7.1. If a <strong>and</strong> b are two generic signals (i.e.,either of v, w, z, etc.), we denote by the symbol H ba the direct – i.e., generallyphysical <strong>and</strong> open-loop – mapp<strong>in</strong>g from signal a to signal b. Forexample, H zv denotes the map from process <strong>in</strong>put v to signal z, i.e., theoperator given by the function G z .◦7.2 Sensitivity FunctionsThe pr<strong>in</strong>cipal focus <strong>in</strong> the filter<strong>in</strong>g literature has been on various optimizationprocedures. Until recently, by far the greatest emphasis was onquadratic performance criteria. These criteria measure performance <strong>in</strong> terms


166 7. General Conceptsof variance of the estimation error (Anderson <strong>and</strong> Moore, 1979; Kailath,1974). However, <strong>in</strong> later years, attention has focused on other optimizationcriteria. For example, the recently <strong>in</strong>troduced robust design methods employthe <strong>in</strong>f<strong>in</strong>ity norm (Yaesh <strong>and</strong> Shaked, 1989; Nagpal <strong>and</strong> Kharghonekar,1991; Bernste<strong>in</strong> <strong>and</strong> Haddad, 1989; de Souza et al., 1995). Whilst optimizationmethods play an <strong>in</strong>valuable role <strong>in</strong> determ<strong>in</strong><strong>in</strong>g the best solution undera given set of conditions, they fail to p<strong>in</strong>-po<strong>in</strong>t why a desired performancelevel is not achievable. They also do not allow one to evaluate thebenefits of chang<strong>in</strong>g the measurement system, for example, by relocat<strong>in</strong>ga sensor or by add<strong>in</strong>g additional sensors.One way to address these issues is to identify complementary operators— such as S <strong>and</strong> T <strong>in</strong> the control case — that represent relevant propertiesof the filter<strong>in</strong>g system, <strong>and</strong> then use <strong>in</strong>tegral constra<strong>in</strong>ts to quantify the <strong>in</strong>herentsensitivity trade-offs associated with filter<strong>in</strong>g problems. This is theapproach taken <strong>in</strong> Goodw<strong>in</strong> et al. (1995) <strong>and</strong> Seron <strong>and</strong> Goodw<strong>in</strong> (1995),<strong>and</strong> is the one that we will develop <strong>in</strong> this chapter.Consider the operators H ˜zv , H^zv , <strong>and</strong> H zv mapp<strong>in</strong>g process <strong>in</strong>put to estimationerror, estimate, <strong>and</strong> estimated signal, respectively, <strong>in</strong> the diagramof Figure 7.1. Then, we have the follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition 7.2.1 (Filter<strong>in</strong>g Sensitivity Functions). Let the conditions <strong>in</strong>Assumption 7.1 hold. Then, the filter<strong>in</strong>g sensitivity <strong>and</strong> complementary sensitivityfunctions, denoted by P <strong>and</strong> M, respectively, are def<strong>in</strong>ed asP(s) H ˜zv (s)H −1zv (s), <strong>and</strong> M(s) H^zv (s)H −1zv (s). (7.4)The filter<strong>in</strong>g sensitivity P represents the relative effect of the process <strong>in</strong>puton the estimation error, while the filter<strong>in</strong>g complementary sensitivityM represents the relative effect of the process <strong>in</strong>put on the estimate. Interms of the transfer functions def<strong>in</strong>ed <strong>in</strong> Figure 7.2, we can express P <strong>and</strong>M as follows:P = (G z − FG y )G −1z , <strong>and</strong> M = FG y G −1z . (7.5)The follow<strong>in</strong>g complementarity constra<strong>in</strong>t holds for the filter<strong>in</strong>g sensitivity<strong>and</strong> complementary sensitivity functions.Theorem 7.2.1. The filter<strong>in</strong>g sensitivity <strong>and</strong> complementary sensitivitydef<strong>in</strong>ed <strong>in</strong> (7.4) satisfy the relationP(s) + M(s) = I, (7.6)at any complex frequency s that is not a pole of P <strong>and</strong> M.Proof. It follows from Figure 7.1 thatH ˜zv + FH yv = H zv . (7.7)◦


7.2 Sensitivity Functions 167Then (7.6) follows immediately us<strong>in</strong>g the def<strong>in</strong>itions (7.4) <strong>and</strong> not<strong>in</strong>g thatH^zv = FH yv .□Equation (7.6) is rem<strong>in</strong>iscent of the complementary result of feedbackcontrol, i.e., (3.7), that the sum of sensitivity, S, <strong>and</strong> complementary sensitivity,T, is the identity operator. However, P <strong>and</strong> M are not the direct ‘filter<strong>in</strong>g’versions of S <strong>and</strong> T; this will be discussed <strong>in</strong> more detail <strong>in</strong> §7.2.2.The follow<strong>in</strong>g subsection studies the use of the filter<strong>in</strong>g sensitivities <strong>in</strong>measur<strong>in</strong>g important properties related to the filter<strong>in</strong>g problem.7.2.1 Interpretation of the SensitivitiesConsider, for simplicity, that P <strong>and</strong> M <strong>in</strong> (7.4) are scalar, <strong>and</strong> assume furtherthat the ratio of system transfer functions H yv /H zv is proper.From the mere def<strong>in</strong>ition of P <strong>in</strong> (7.4), it is evident that |P(jω)| measuresthe magnitude of the frequency response of the estimation error relative tothe magnitude of the frequency response of the signal to be estimated. Thissuggests a criterion to specify an appropriate shape for |P(jω)| <strong>in</strong> design.Indeed, suppose for example that the frequency distribution of the process<strong>in</strong>put v is known. Then, achiev<strong>in</strong>g |P(jω)| small at those frequencies wherev has a significant magnitude will be desirable, s<strong>in</strong>ce it will reduce therelative effect of v on the estimation error.On the other h<strong>and</strong>, the filter<strong>in</strong>g complementary sensitivity M representsthe ratio between the transfer function from the process <strong>in</strong>put to the estimatedsignal <strong>and</strong> the transfer function from the process <strong>in</strong>put to the actualsignal. Thus, it is desirable to achieve |M(jω)| close to one over thefrequency range where the spectrum of v is concentrated. This will result<strong>in</strong> an impact (<strong>in</strong> magnitude) of the process <strong>in</strong>put on the estimate similarto the impact that it has on the signal to be estimated.Typically, the spectrum of v is concentrated at low frequencies, so appropriatedshapes for P <strong>and</strong> M <strong>in</strong> this case will be as shown <strong>in</strong> Figure 7.3.1.21.211|P(jω)|0.80.6|M(jω)|0.80.60.40.40.20.2010 −2 10 −1 10 0ω10 1 10 2010 −2 10 −1 10 0ω10 1 10 2FIGURE 7.3. Typical shapes for |P(jω)| <strong>and</strong> |M(jω)|.In addition, note from (7.5) that M it is the product of a proper transferfunction <strong>and</strong> the filter transfer function F. Clearly, the effect of the output


168 7. General Conceptsdisturbance, w, on the estimation error (<strong>and</strong> on the estimate itself) will bereduced if |F(jω)| is small over the range of frequencies where w is strong.This desired reduction of |F| can be readily translated <strong>in</strong>to the shape of|M(jω)|, after normaliz<strong>in</strong>g through the factor |H yv (jω)/H zv (jω)|. Notethat, s<strong>in</strong>ce F is a strictly proper transfer function, its output disturbancerejection properties are typically aimed at high-frequency disturbances,such as measurement noise.The forego<strong>in</strong>g discussion analyzed the sensitivities as relative ga<strong>in</strong>s,<strong>and</strong> gave some <strong>in</strong>sights <strong>in</strong>to desirable shapes for their frequency responses.However, dynamic <strong>in</strong>terpretations of the filter<strong>in</strong>g sensitivities can also beobta<strong>in</strong>ed for particular cases.Indeed, suppose that we express the filter as driven by the <strong>in</strong>novationsprocess (Kailath, 1968), denoted ι, <strong>and</strong> def<strong>in</strong>ed asι y − ^z, (7.8)where ^z is, <strong>in</strong> this case, an estimate of the noise-free output, i.e., z = y − w(notice that here H zv = H yv ). Then it is possible to express the filter<strong>in</strong>gsystem as a feedback loop, as shown <strong>in</strong> Figure 7.4.❜v✲H yv❜w+✲z❥❄ y ι✲ ❥✲+ +− ✻F ι^z✲FIGURE 7.4. Filter<strong>in</strong>g Feedback Loop.Loosely speak<strong>in</strong>g, then, the sensitivities will represent dynamic responsesto system disturbances <strong>in</strong> those filter<strong>in</strong>g problems where the estimate, ^z,is fed-back <strong>in</strong>to this feedback loop.The follow<strong>in</strong>g examples analyze two particular cases where the filter<strong>in</strong>gsensitivities have dynamic mean<strong>in</strong>g.Example 7.2.1 (‘Output-Filter<strong>in</strong>g’ Sensitivities). Assume that the variableto be estimated is the disturbance-free output, i.e., z <strong>in</strong> Figure 7.1 isgiven byz = y − w.In this case, we have that G y = G z <strong>in</strong> Figure 7.2, <strong>and</strong> hence the sensitivities(7.5) becomeP y = 1 − F,M y = F.(7.9)We call these functions the output-filter<strong>in</strong>g sensitivities. It is <strong>in</strong>structive toexpress these sensitivities <strong>in</strong> terms of the filter function F ι , shown <strong>in</strong> Fig-


7.2 Sensitivity Functions 169ure 7.4, which maps <strong>in</strong>novations <strong>in</strong>to estimate. We have,P y = (1 + F ι ) −1 ,M y = F ι (1 + F ι ) −1 ,(7.10)which are similar to the control sensitivities S <strong>and</strong> T. It is then evidentthat P y is the dynamic transfer between the measurement noise, w, <strong>and</strong>the <strong>in</strong>novations, ι, whilst M y is the dynamic transfer between w <strong>and</strong> theestimated variable, ^z.◦Example 7.2.2. Alternatively assume that the transfer function H yv can beexpressed asH yv = H yz H zv ,such that z = H zv v. Assume further that H yz is proper. Then, it is naturalto factor the filter function F ι <strong>in</strong> Figure 7.4 asF ι = F y F z ,for appropriate transfer functions F y <strong>and</strong> F z , <strong>and</strong> take ^z = F z ι as an estimatefor z. The correspond<strong>in</strong>g filter<strong>in</strong>g loop is shown <strong>in</strong> Figure 7.5.❜ v ✲❜ ❜w zwz ❄+ + ❄y✲ ❥✲✲ ❥ ✲ ❥ ✲ ι ✲ ^zH zv H yz F z F y ✲++ +− ✻FIGURE 7.5. Particular Filter<strong>in</strong>g Loop.It is easy to see that, <strong>in</strong> this case, the filter<strong>in</strong>g sensitivities are given byP = [1 + F z (F y − H yz )]P y ,M = F z P y H yz ,(7.11)where P y is the output-filter<strong>in</strong>g sensitivity given by (7.10). The sensitivities<strong>in</strong> (7.11) are the responses of ι <strong>and</strong> ^z to the <strong>in</strong>ternal disturbance w z ,respectively. Note that they collapse <strong>in</strong>to P y <strong>and</strong> M y if F y = H yz . The lattercondition would be a natural choice for F y .◦7.2.2 Filter<strong>in</strong>g <strong>and</strong> <strong>Control</strong> ComplementarityThe filter<strong>in</strong>g sensitivities P <strong>and</strong> M are not the direct counterpart of the controlsensitivities S <strong>and</strong> T. This is because the estimation problem depicted<strong>in</strong> Figure 7.1, when set <strong>in</strong> a control framework, is more general than theone-degree of freedom control loop where S <strong>and</strong> T are def<strong>in</strong>ed. Indeed, the“filter<strong>in</strong>g” versions of S <strong>and</strong> T would correspond to the output-filter<strong>in</strong>g


170 7. General Conceptssensitivities, i.e., the sensitivities aris<strong>in</strong>g <strong>in</strong> the problem of estimat<strong>in</strong>g thesystem noise-free output (z = y − w), as discussed <strong>in</strong> Example 7.2.1.We will next derive a complementarity constra<strong>in</strong>t for l<strong>in</strong>ear feedbackcontrol that parallels the filter<strong>in</strong>g constra<strong>in</strong>t given <strong>in</strong> (7.6). Consider thecontrol loop shown <strong>in</strong> Figure 7.6, where G is the plant transfer functiongiven by[ ]Gzv GG zu, (7.12)G yv G yu<strong>and</strong> K is a controller that stabilizes the feedback loop.❜ v ✲✲z❜dGu✲y ✲ ❥❄+0−+y❄K ✛ ❥✛+n❜FIGURE 7.6. <strong>Control</strong> Loop.This configuration is st<strong>and</strong>ard, e.g., <strong>in</strong> the H ∞ control literature. 2 Theplant G has two sets of <strong>in</strong>puts: the external <strong>in</strong>puts, v, <strong>and</strong> the control <strong>in</strong>puts,u. Also, it has a set outputs, y 0 , that are available to the controllerK after be<strong>in</strong>g corrupted with output disturbances, d, <strong>and</strong> sensor noise, n,<strong>and</strong> a set of signals of <strong>in</strong>terest, z, which are generally the signals to becontrolled or regulated.The “classical” sensitivity <strong>and</strong> complementary sensitivity, S <strong>and</strong> T, aredef<strong>in</strong>ed for a simpler unity feedback control loop (see Figure 3.2 <strong>in</strong> Chapter3), where they are directly connected with performance, disturbancerejection <strong>and</strong> robustness properties (cf. §2.2.2 <strong>in</strong> Chapter 2). However, aswe see next, they can also be def<strong>in</strong>ed for the feedback loop <strong>in</strong> Figure 7.6.Introduce first the notation H ba to <strong>in</strong>dicate the total mapp<strong>in</strong>g – possiblynonphysical – from signal a to signal b, i.e., after comb<strong>in</strong><strong>in</strong>g (add<strong>in</strong>g,compos<strong>in</strong>g, etc.) all the ways <strong>in</strong> which a is a function of b <strong>and</strong> solv<strong>in</strong>g anyfeedback loops that may exist. 3 Then, we have thatS = H yd = (I + G yu K) −1 ,T = −H yn = G yu K(I + G yu K) −1 .As seen <strong>in</strong> Chapter 3, S <strong>and</strong> T satisfy the complementarity constra<strong>in</strong>tS + T = I.(7.13)2 In the st<strong>and</strong>ard configuration, <strong>in</strong> fact, d <strong>and</strong> n are absorbed <strong>in</strong>to v. Here we choose todraw them explicitly to emphasize the way <strong>in</strong> which they affect the output y.3 Cf. the notation H ba <strong>in</strong>troduced at the end of §7.1.


7.2 Sensitivity Functions 171This relation <strong>in</strong>volves mapp<strong>in</strong>gs from external <strong>in</strong>puts that are directly <strong>in</strong>jected<strong>in</strong>to the loop (i.e., without <strong>in</strong>termediate dynamics), to system signalsthat are fed-back <strong>in</strong> the same loop (see Figure 7.6). A more generalresult, comparable to the filter<strong>in</strong>g relation given <strong>in</strong> (7.7), can be developedfor the configuration of Figure 7.6. This structural constra<strong>in</strong>t <strong>in</strong>volvesmapp<strong>in</strong>gs from external <strong>in</strong>puts that are <strong>in</strong>jected dynamically <strong>in</strong> the loop,to some <strong>in</strong>ternal variables, e.g., the system (comb<strong>in</strong>ation of) states, z, whichare not directly fed-back. This result is stated below.Theorem 7.2.2. Consider the general control loop of Figure 7.6, where theplant G is given by (7.12). The total mapp<strong>in</strong>gs, H zv <strong>and</strong> H zd , from theexternal <strong>in</strong>puts v <strong>and</strong> d to the <strong>in</strong>ternal variables z, satisfy the follow<strong>in</strong>gstructural relation:H zv + (−H zd )G yv = G zv . (7.14)Proof. From Figure 7.6, <strong>and</strong> the def<strong>in</strong>itions <strong>in</strong> (7.12) <strong>and</strong> (7.13), we have, <strong>in</strong>the Laplace transform doma<strong>in</strong><strong>and</strong>Y 0 = SG yv V + TD,Z = G zv V + G zu K(D − Y 0 )= (G zv − G zu KSG yv )V + G zu KSD.(7.15)ThenH zv = G zv − G zu KSG yv ,H zd = G zu KS,from which (7.14) follows.□Notice that, when consider<strong>in</strong>g mapp<strong>in</strong>gs to z, the sensor noise, n, <strong>in</strong>Figure 7.6 can be set to zero s<strong>in</strong>ce its <strong>in</strong>fluence on z is the same as that of d.A complementarity constra<strong>in</strong>t that parallels (7.6) can be readily obta<strong>in</strong>edby multiply<strong>in</strong>g through both sides of (7.14) by G −1zv <strong>and</strong> mak<strong>in</strong>g similardef<strong>in</strong>itions to those <strong>in</strong> (7.4).The relation given <strong>in</strong> (7.14) is clearly analogous to that of (7.7). This <strong>in</strong>dicatesthat the filter<strong>in</strong>g problem considered <strong>in</strong> §7.2 fits <strong>in</strong>to the structureof the control configuration of Figure 7.6. In the search for generality, however,we have gone too far: we now need to select the particular choices ofG <strong>and</strong> K that will give the exact control counterpart of (7.7).Mak<strong>in</strong>g the choices of <strong>in</strong>puts <strong>and</strong> outputs as shown <strong>in</strong> Figure 7.7, weconclude that the controller K corresponds to the filter F, whilst the plantG corresponds to the matrix[ ]Gzv −IG f .G yv 0


172 7. General Concepts❜ v ✲✲˜z❜w^z✲G fFy ✲ ❥❄+0−y✛FIGURE 7.7. Equivalent Filter<strong>in</strong>g Loop.Observe that sett<strong>in</strong>g the entry (2,2) of G f equal to zero amounts to open<strong>in</strong>gthe loop (S = I, T = 0 <strong>in</strong> (7.13)), but this is the way <strong>in</strong> which we haveset up the filter<strong>in</strong>g problem <strong>in</strong> §7.2. In the context of bounded error filter<strong>in</strong>g,however, we will soon see that F satisfies conditions that ensureboundedness of ˜z. There is then, ideally, no need for feedback if these conditionshold.Although a feedback loop can be made explicit by consider<strong>in</strong>g the filterdriven by the <strong>in</strong>novations, as we have seen <strong>in</strong> §7.2.1, note that the merepresence of feedback does not make the observer a closed loop <strong>in</strong> general(i.e., when the signal to be estimated is not just the disturbance-free output,i.e., z ≠ y − w). Indeed, it was shown by Bhattacharyya (1976) that anobserver is a closed-loop system if <strong>and</strong> only if it can be expressed as a systemdriven by the estimation error ˜z = z − ^z. Moreover, Bhattacharyya (1976)proved that an observer must be a closed-loop system if it is to provideobserver action despite arbitrarily small perturbations of the system statespace matrices.7.3 Bounded Error EstimatorsIt is natural to require that a state estimator, <strong>in</strong> addition to be<strong>in</strong>g stable,should produce a bounded estimation error when all the signals enter<strong>in</strong>gthe system are bounded. Thus, from the general class of stable estimators,we will make a mild restriction to bounded error estimators (BEEs) accord<strong>in</strong>gto the follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition 7.3.1 (BEE). We say that a stable filter is a BEE if, for all f<strong>in</strong>ite<strong>in</strong>itial states of the plant <strong>and</strong> the filter, the estimation error ˜z is boundedwhenever the system <strong>in</strong>puts v <strong>and</strong> w are bounded.◦Let x 0 be the <strong>in</strong>itial state of the plant, <strong>and</strong> let V <strong>and</strong> W be the Laplacetransforms of the process <strong>in</strong>put <strong>and</strong> measurement noise, respectively. Itthen follows from Figures 7.1 <strong>and</strong> 7.2, <strong>and</strong> (7.1) that — modulo exponentiallydecay<strong>in</strong>g terms due to the filter <strong>in</strong>itial state — the Laplace transform


7.3 Bounded Error Estimators 173of the estimation error, denoted by ˜Z, is given by˜Z = ˜G 0 x 0 + (G z − FG y )V − FW, (7.16)where˜G 0 (s) [C 1 − F(s)C 2 ](sI − A) −1 (7.17)leads the estimation error component due to the plant <strong>in</strong>itial state.Thus, s<strong>in</strong>ce the filter F is stable <strong>and</strong> the plant G — i.e., the pair (A, B) —is stabilizable, a necessary <strong>and</strong> sufficient condition for F to be BEE is thatthe transfer function G z − FG y be stable. Note that this means that• unstable poles shared by G z <strong>and</strong> G y must be canceled 4 by a nonm<strong>in</strong>imumphase zero of the difference G z − FG y , while• unstable poles of G y that do not appear <strong>in</strong> G z must be canceled bynonm<strong>in</strong>imum phase zeros of F.If (A, B) is not stabilizable then a necessary <strong>and</strong> sufficient condition for Fto be BEE is that the transfer function ˜G 0 <strong>in</strong> (7.17) be stable. In Chapters 10<strong>and</strong> 11, we will consider predictors <strong>and</strong> smoothers based on BEEs thatperform full-state filter<strong>in</strong>g. For future reference then, we note that, if F is<strong>in</strong>tended to estimate the full state, F is a BEE if <strong>and</strong> only if the transferfunction˜G 0 (s) = [I − F(s)C 2 ](sI − A) −1 (7.18)is stable.The BEE assumption is the m<strong>in</strong>imum that can be required from a filter<strong>and</strong>, therefore, it underlies numerous state estimation problems <strong>in</strong> theliterature. The class of BEEs <strong>in</strong>cludes, for example, observers of the form(7.3) that assign pre-specified stable dynamics to the estimation error. Indeed,assume that the external <strong>in</strong>puts to the plant <strong>in</strong> (7.2) are zero <strong>and</strong>write the correspond<strong>in</strong>g dynamics of the state estimation error, ˜x x− ^Cξ,as˙˜x = (A − ^CK y C 2 )x − ^C ^Aξ .Assume next that we desire that ˜x evolve as˙˜x = Øx, (7.19)where à is a stability matrix of the same dimensions as A. This will occurif <strong>and</strong> only if the matrices ^A, K y <strong>and</strong> ^C satisfyA − ^CK y C 2 = à <strong>and</strong> ^C ^A = à ^C. (7.20)It is clear that a filter satisfy<strong>in</strong>g (7.19) <strong>and</strong> (7.20) is a BEE.4 In frequency <strong>and</strong> direction <strong>in</strong> the multivariable case.


174 7. General ConceptsBEEs do not, <strong>in</strong> general, assign the error dynamics, s<strong>in</strong>ce the estimationerror component due to the <strong>in</strong>itial state of the observer may decay withdifferent dynamics than the component due to the plant <strong>in</strong>itial state. Tofurther see the difference, note that the estimation error correspond<strong>in</strong>g toa general BEE for full-state estimation satisfies the follow<strong>in</strong>g equations[ẋ ] [ [ ]A 0 x= ˙ξ K y C 2^A],ξ˜x = [ I− ^C ] [ xξ],(7.21)while the estimation error correspond<strong>in</strong>g to an observer with pre-specifiederror dynamics à satisfies (7.19). We can then argue that the concept ofbounded error estimation corresponds to mak<strong>in</strong>g the unstable modes ofthe plant unobservable from the estimation error, whilst the subclass thatachieves pre-specified error dynamics actually assigns the eigenvalues ofthe estimation error. The follow<strong>in</strong>g examples illustrate these po<strong>in</strong>ts.Example 7.3.1. We want to estimate the state of the scalar plant ẋ = x, y =x. A filter F 1 , of the form (7.3), given by[ ] [ ]0 1 −2˙ξ =ξ + y,−1 −2 2^x = [ 0 1 ] ξ,is a BEE for this plant, s<strong>in</strong>ce ˜G 0 <strong>in</strong> (7.16) becomes, us<strong>in</strong>g the above data,˜G 0 (s) = (s2 − 1) 1(s 2 + 2s + 1) (s − 1)= 1s + 1 ,which is stable. However, this filter does not assign error dynamics, s<strong>in</strong>ce(7.20) is not satisfied, i.e.,^C ^A − A ^C = [−1 − 3] ≠ − ^CK y C 2 ^C = [0 2].On the other h<strong>and</strong>, the filter F 2 given by[ ]0 1˙ξ =ξ +−1 −2^x = [ 1 1 ] ξ.[02]y,is also a BEE <strong>and</strong>, moreover, achieves an estimation error with dynamics˙˜x = −˜x, <strong>and</strong> thus satisfies (7.20) with à = −1. Figure 7.8 shows the timeresponse of both filters to an <strong>in</strong>itial state error.◦


7.3 Bounded Error Estimators 1754stateestimate4stateestimate332211000 2 4 6t0 2 4 6tFIGURE 7.8. Estimates of ẋ = x: general BEE (left) <strong>and</strong> observer that assigns theerror dynamics (right).Example 7.3.2. Consider the special case of an error driven stable observerfor the system (7.2), hav<strong>in</strong>g the same order as the system. Such an observermay be of the form (7.3) where K y is such that ^A = A − K y C 2 is a stabilitymatrix <strong>and</strong> ^C = I, i.e.,˙^x = (A − K y C 2 )^x + K u u + K y y. (7.22)The observer (7.22) is easily seen to be a BEE for full-state estimate s<strong>in</strong>ce˜G 0 = [I − (sI − A + K y C 2 ) −1 K y C 2 ](sI − A) −1= [I + (sI − A) −1 K y C 2 ] −1 (sI − A) −1= (sI − A + K y C 2 ) −1 ,(7.23)which is stable. Moreover, the error dynamics are given by à = ^A.General BEEs do not specify the way <strong>in</strong> which the filter is to treat themeasured <strong>in</strong>put, if available. Thus, they may be biased, i.e., the estimationerror has a component driven by the measured <strong>in</strong>put. On the other h<strong>and</strong>,unbiased filters are designed to process the measured <strong>in</strong>put <strong>in</strong> the sameway as the plant does, as seen <strong>in</strong> the follow<strong>in</strong>g subsection.7.3.1 Unbiased EstimatorsIn this section we discuss a special subclass of BEEs, namely, that of unbiasedestimators. The dist<strong>in</strong>guish<strong>in</strong>g feature of this type of BEE is that theyachieve a total decoupl<strong>in</strong>g of the estimation error from the measured <strong>in</strong>put.The formal def<strong>in</strong>ition of this subclass is given below (Middleton <strong>and</strong>Goodw<strong>in</strong>, 1990).◦


176 7. General ConceptsDef<strong>in</strong>ition 7.3.2 (Unbiased Estimator). We say that a stable filter for theplant (7.2) is unbiased if, whenever v ≡ 0 <strong>and</strong> w ≡ 0, for all measured<strong>in</strong>puts u, <strong>and</strong> for all f<strong>in</strong>ite <strong>in</strong>itial states of the plant <strong>and</strong> the filter, the estimationerror ˜z decays asymptotically to zero.◦We then have the follow<strong>in</strong>g result.Lemma 7.3.1. A stable state estimator for the system (7.2) is unbiased if<strong>and</strong> only if(i) it is a BEE <strong>and</strong>(ii) the transfer function from the measured <strong>in</strong>put u to the estimationerror, H ˜xu , is identically zero.Proof. Immediate from Def<strong>in</strong>ition 7.3.2 on writ<strong>in</strong>g the expression for theestimation error.□Example 7.3.3. Consider aga<strong>in</strong> Example 7.3.2. Note that the special observer(7.22) is an unbiased estimator if the particular choice K u = B u ismade. This can be seen on not<strong>in</strong>g thatH ˜xu (s) = (sI − A + K y C 2 ) −1 B u − (sI − ^A) −1 K u ≡ 0us<strong>in</strong>g (7.23). This particular observer is generally known as an identity observer.◦As the above example suggests, unbiased estimators of the form (7.3)for (7.2) are observers that assign the error dynamics which, <strong>in</strong> addition to(7.20), satisfy the constra<strong>in</strong>t^CK u = B u . (7.24)A nice feature of unbiased filters is that separation applies, that is, if usedto control the plant, then the result<strong>in</strong>g closed loop is automatically stableprovided the state estimates are fed-back with a ga<strong>in</strong> such that, if the samega<strong>in</strong> were to be used on the true states, then stability would result. This iswhy the unbiasedness requirement is frequently presupposed <strong>in</strong> the def<strong>in</strong>itionsof observers appear<strong>in</strong>g <strong>in</strong> the control literature. However, we willshow <strong>in</strong> subsequent chapters that there are <strong>in</strong>herent trade-offs <strong>in</strong> filter designthat affect all BEEs, whether unbiased or not.7.4 SummaryThis chapter has <strong>in</strong>troduced the notation <strong>and</strong> def<strong>in</strong>itions necessary for thedevelopment of the rema<strong>in</strong><strong>in</strong>g chapters <strong>in</strong> this part. The ma<strong>in</strong> conceptshere are the filter<strong>in</strong>g sensitivity functions <strong>and</strong> the class of BEEs. The filter<strong>in</strong>gsensitivity functions play a role similar to their control counterparts;


7.4 Summary 177we have discussed <strong>in</strong> this chapter some important <strong>in</strong>terpretations <strong>in</strong> thecontext of l<strong>in</strong>ear filter<strong>in</strong>g design, <strong>and</strong> their connections to a general controlparadigm.The filter<strong>in</strong>g sensitivities are the basis for the theory of design limitationsdeveloped <strong>in</strong> the next chapters. This theory of limitations holds fora wide class of l<strong>in</strong>ear filter<strong>in</strong>g problems, namely, those based on BEEs,which are estimators that produce a bounded estimation error under anypossible <strong>in</strong>put driv<strong>in</strong>g the system.Notes <strong>and</strong> ReferencesThe def<strong>in</strong>ition of filter<strong>in</strong>g sensitivity functions is taken from Seron <strong>and</strong> Goodw<strong>in</strong>(1995). Goodw<strong>in</strong> et al. (1995) <strong>in</strong>troduced similar functions for the case of unbiasedestimators. BEEs were also <strong>in</strong>troduced <strong>in</strong> Seron <strong>and</strong> Goodw<strong>in</strong> (1995); see Seron(1995) for a more detailed discussion.In view of its generality, the class of l<strong>in</strong>ear BEEs encompasses various specificstate estimators that have been proposed <strong>in</strong> contemporary literature. For example,robust H ∞ optimization designs, <strong>in</strong> general lead to BEEs that are not necessarilyunbiased (see e.g., de Souza et al., 1995). The bias results from the fact that, <strong>in</strong> thepresence of plant parameter uncerta<strong>in</strong>ty, the component of the estimation errordue to the known <strong>in</strong>put signal u cannot, <strong>in</strong> general, be completely canceled. Infact, <strong>in</strong> de Souza et al. (1995) the authors have shown, through comparison withKalman filter <strong>and</strong> nom<strong>in</strong>al H ∞ filter design, that the biased component of theestimation error may be m<strong>in</strong>imized by allow<strong>in</strong>g the filter to be biased from theoutset.For further read<strong>in</strong>g on filters <strong>and</strong> observers see e.g., the textbooks Anderson<strong>and</strong> Moore (1979) <strong>and</strong> O’Reilly (1983).


8SISO Filter<strong>in</strong>gIn this chapter we exam<strong>in</strong>e the fundamental design trade-offs that applyto l<strong>in</strong>ear scalar filter<strong>in</strong>g problems based on bounded error estimators. Wewill see that, due to the condition of bounded error estimation, the filter<strong>in</strong>gsensitivity functions <strong>in</strong>troduced <strong>in</strong> the last chapter are necessarilyconstra<strong>in</strong>ed at po<strong>in</strong>ts <strong>in</strong> the complex plane determ<strong>in</strong>ed by ORHP poles<strong>and</strong> zeros of the plant. These <strong>in</strong>terpolation constra<strong>in</strong>ts, <strong>in</strong> turn, translate<strong>in</strong>to Poisson <strong>and</strong> Bode-type <strong>in</strong>tegral relations, which show essential limitations<strong>in</strong> the achievable performance, <strong>and</strong> <strong>in</strong>duce clear trade-offs <strong>in</strong> filterdesign.8.1 Interpolation Constra<strong>in</strong>tsWe have seen <strong>in</strong> Chapter 3 that, for the case of feedback control <strong>and</strong> underm<strong>in</strong>imality assumption, S has zeros at the unstable open-loop plantpoles <strong>and</strong> T has zeros at the nonm<strong>in</strong>imum phase open-loop plant zeros. Inthis section we consider the filter<strong>in</strong>g sensitivities result<strong>in</strong>g from any scalarBEE, <strong>and</strong> show that right half plane poles <strong>and</strong> zeros of the plant transfermatrices G y <strong>and</strong> G z impose constra<strong>in</strong>ts on particular complex values of P<strong>and</strong> M given <strong>in</strong> (7.5) of Chapter 7.First, it is <strong>in</strong>terest<strong>in</strong>g to observe that P <strong>and</strong> M <strong>in</strong> (7.5) are not necessarilyanalytic <strong>in</strong> the ORHP s<strong>in</strong>ce G z may have ORHP zeros (that are not canceled<strong>in</strong> the division). This fact differs from the common assumption <strong>in</strong>control theory that the sensitivity <strong>and</strong> complementary sensitivity are ana-


180 8. SISO Filter<strong>in</strong>glytic <strong>in</strong> the ORHP. Note, however, that the more general control sensitivitiesthat can be def<strong>in</strong>ed from (7.14) <strong>in</strong> Chapter 7 are potentially nonanalytic<strong>in</strong> the ORHP if H zv is nonm<strong>in</strong>imum phase.For the problem of filter<strong>in</strong>g with BEEs, we have the follow<strong>in</strong>g result onthe analyticity of P <strong>and</strong> M <strong>in</strong> the ORHP.Lemma 8.1.1. Consider the filter<strong>in</strong>g sensitivities given <strong>in</strong> (7.5) <strong>and</strong> assumethat F is a BEE. Then P <strong>and</strong> M are analytic <strong>in</strong> the ORHP if <strong>and</strong> only if eitherof the follow<strong>in</strong>g conditions hold.(i) G z is m<strong>in</strong>imum phase.(ii) Every ORHP zero of G z is also a zero of the product FG y .Proof. Immediate from (7.5).□The follow<strong>in</strong>g lemma formalizes the <strong>in</strong>terpolation constra<strong>in</strong>ts that P <strong>and</strong>M must satisfy at the ORHP poles <strong>and</strong> zeros of G y <strong>and</strong> G z .Lemma 8.1.2 (Interpolation Constra<strong>in</strong>ts). Assume that F <strong>in</strong> (7.5) is a BEE.Then P <strong>and</strong> M must satisfy the follow<strong>in</strong>g conditions.(i) If p ∈ ¢ + is a pole of G z , thenP(p) = 0, <strong>and</strong> M(p) = 1.(ii) If q ∈ ¢ + is a zero of G y that is not also zero of G z , thenP(q) = 1, <strong>and</strong> M(q) = 0.(iii) If q ∈ ¢ + is a zero of G z that is not also zero of FG y , then P(s) <strong>and</strong>M(s) have a pole at s = q.Proof. Case (i) readily follows from (7.5) by not<strong>in</strong>g that the term G z − FG yis stable if F is a BEE. Case (ii) is immediate from (7.5) s<strong>in</strong>ce F is stable.F<strong>in</strong>ally, case (iii) follows from Lemma 8.1.1.□We <strong>in</strong>troduce for convenience the follow<strong>in</strong>g notation for the ORHP zerosof P <strong>and</strong> M: 1 Z P {s ∈ ¢ + : P(s) = 0} ,Z M {s ∈ ¢ + : M(s) = 0} .(8.1)Then, Lemma 8.1.2 establishes that Z P <strong>in</strong>cludes the set of ORHP poles ofG z <strong>and</strong> Z M conta<strong>in</strong>s ORHP zeros of G y . As <strong>in</strong> the control case, we have1 Recall that, when def<strong>in</strong><strong>in</strong>g a set of zeros of a transfer function, the zeros are repeatedaccord<strong>in</strong>g to their multiplicities.


8.2 Integral Constra<strong>in</strong>ts 181then translated the plant characteristics of <strong>in</strong>stability <strong>and</strong> “nonm<strong>in</strong>imumphaseness” <strong>in</strong>to properties that the functions P <strong>and</strong> M must satisfy <strong>in</strong> theORHP. Note that Z P also conta<strong>in</strong>s those ORHP zeros of (G z − FG y ) thatare not also zeros of G z <strong>and</strong> Z M also conta<strong>in</strong>s those ORHP zeros of F thatare not also zeros of the (possibly nonproper) transfer function G z /G y .Notice that s<strong>in</strong>ce the plant is detectable from y, the transfer function G yhas all the unstable poles of G z . In consequence, G y /G z — <strong>and</strong> hence M— has no zeros at any unstable pole of G.The <strong>in</strong>terpolation constra<strong>in</strong>ts given <strong>in</strong> Lemma 8.1.2 will be translated<strong>in</strong>to constra<strong>in</strong>ts on the frequency response of the sensitivities <strong>in</strong> the follow<strong>in</strong>gsection.8.2 Integral Constra<strong>in</strong>tsIn this section we use the <strong>in</strong>terpolation constra<strong>in</strong>ts of Lemma 8.1.2 to derivePoisson <strong>and</strong> Bode-type <strong>in</strong>tegral relations satisfied by P <strong>and</strong> M. Theserelations constra<strong>in</strong> the values of the filter<strong>in</strong>g sensitivities on the imag<strong>in</strong>aryaxis. The Poisson <strong>in</strong>tegral constra<strong>in</strong>ts on P <strong>and</strong> M are derived us<strong>in</strong>gthe Poisson <strong>in</strong>tegral formulae for the recovery of a function, analytic <strong>in</strong> theORHP, from its values on the imag<strong>in</strong>ary axis (see §A.6.1 <strong>in</strong> Appendix A).Let the sets (8.1) be given byZ P = {p i : i = 1, . . . , n p } ,Z M = {q i : i = 1, . . . , n q } ,<strong>and</strong> let the correspond<strong>in</strong>g Blaschke products be 2B P (s) =n p∏i=1ns − p i∏ q, <strong>and</strong> B M (s) =s + p iWe also <strong>in</strong>troduce the follow<strong>in</strong>g sets of zeros.i=1s − q is + q i. (8.2)Z z {s ∈ ¢ + : G z (s) = 0 <strong>and</strong> F(s)G y (s) ≠ 0} ,Z y {s ∈ ¢ + : G y (s) = 0 <strong>and</strong> G z (s) ≠ 0} ,Z 1/z {s ∈ ¢ + : G −1z (s) = 0} .(8.3)Note that the set of zeros of P <strong>in</strong>cludes Z 1/z , i.e.,Z P ⊃ Z 1/z , (8.4)2 Recall that, if the set Z orig<strong>in</strong>at<strong>in</strong>g the Blaschke product B is empty, then we def<strong>in</strong>eB(s) = 1, ∀s ∈ .


182 8. SISO Filter<strong>in</strong>gthe set of zeros of M <strong>in</strong>cludes Z y , i.e.,Z M ⊃ Z y , (8.5)<strong>and</strong> Z z is the set of ORHP poles of P <strong>and</strong> M. Denote the Blaschke productscorrespond<strong>in</strong>g to the sets <strong>in</strong> (8.3) by B z , B y <strong>and</strong> B 1/z . It is then easy to seethat we can factor P <strong>and</strong> M asP(s) = ˜P(s)B P (s)/B z (s) ,M(s) = ˜M(s)B M (s)/B z (s) ,(8.6)where ˜P <strong>and</strong> ˜M have no zeros or poles <strong>in</strong> the ORHP. We then obta<strong>in</strong> thefollow<strong>in</strong>g <strong>in</strong>tegral constra<strong>in</strong>ts.Theorem 8.2.1 (Poisson Integral for P). Assume that F <strong>in</strong> (7.5) is a BEE.Let q = σ q + jω q , σ q > 0, be a zero of G y that is not also zero of G z (i.e.,q ∈ Z y <strong>in</strong> (8.3)). Then, assum<strong>in</strong>g that P is proper,∫ ∞−∞σ qlog |P(jω)|σ 2 q + (ω q − ω) 2 dω = π log ∣ ∣B −1P(q)∣ ∣ − π log ∣ ∣B −1z (q) ∣ .(8.7)Proof. The proof follows by us<strong>in</strong>g factorization (8.6) <strong>and</strong> apply<strong>in</strong>g the Poisson<strong>in</strong>tegral formula (Theorem A.6.1 <strong>in</strong> Appendix A) to log ˜P. □Theorem 8.2.2 (Poisson Integral for M). Suppose that F <strong>in</strong> (7.5) is a BEE.Let p = σ p + jω p , σ p > 0, be a pole of G z (i.e., p ∈ Z 1/z <strong>in</strong> (8.3)). Then,assum<strong>in</strong>g that M is proper,∫ ∞−∞σ plog |M(jω)|σ 2 p + (ω p − ω) 2 dω = π log ∣ B−1M (p)∣ ∣∣ − π log ∣B −1z (p) ∣ .(8.8)Proof. Same as the proof of Theorem 8.2.1, this time us<strong>in</strong>g ˜M.□The results <strong>in</strong> Theorems 8.2.1 <strong>and</strong> 8.2.2 are affected by the particularchoice of filter, s<strong>in</strong>ce the Blaschke product B z on the RHSs of (8.7) <strong>and</strong>(8.8) depends on F. In particular, we see that the <strong>in</strong>tegral constra<strong>in</strong>ts on P<strong>and</strong> M are relaxed by nonm<strong>in</strong>imum phase zeros of G z that are not sharedwith FG y . An explanation for this may be given if we th<strong>in</strong>k of a nonm<strong>in</strong>imumphase zero as a “mild time-delay”. 3 Then, a delay <strong>in</strong> the signal tobe estimated, with respect to the measured signals, would allow for moretime to compute a better estimate — <strong>in</strong> the sense discussed <strong>in</strong> §7.2.1 of3 This <strong>in</strong>tuitively follows from the fact that a time-delay, analyzed by its Padé approximations(e.g., Middleton <strong>and</strong> Goodw<strong>in</strong>, 1990), may be seen as conta<strong>in</strong><strong>in</strong>g an arbitrarily largenumber of nonm<strong>in</strong>imum phase zeros.


8.2 Integral Constra<strong>in</strong>ts 183Chapter 7. An extreme case of this phenomenon can be found <strong>in</strong> the processof smooth<strong>in</strong>g, where the signal to be estimated is artificially delayed.See Chapters 10 <strong>and</strong> 11 for more discussion.Constra<strong>in</strong>ts that are completely <strong>in</strong>dependent of the estimator parametersare readily obta<strong>in</strong>ed from these results under an additional condition,as we see <strong>in</strong> the follow<strong>in</strong>g corollary.Corollary 8.2.3. Suppose that P <strong>and</strong> M <strong>in</strong> (7.5) are proper <strong>and</strong> that F isa BEE. Consider the sets def<strong>in</strong>ed <strong>in</strong> (8.3), <strong>and</strong> assume further that if thereexists a complex number s 0 <strong>in</strong> ¢ + such that G z (s 0 ) = 0 then also G y (s 0 ) =0. Then,(i) if q = σ q + jω q ∈ Z y , we have that∫ ∞−∞σ q∣ ∣ ∣∣Blog |P(jω)|σ 2 q + (ω q − ω) 2 dω ≥ π log −1 ∣∣1/z (q) ; (8.9)(ii) if p = σ p + jω p ∈ Z 1/z , we have that∫ ∞−∞σ plog |M(jω)|σ 2 p + (ω p − ω) 2 dω ≥ π log ∣ B−1y (p) ∣ . (8.10)Proof. First notice that the assumption that every nonm<strong>in</strong>imum phase zeroof G z is also a zero of G y implies that the set Z z is empty, so B z disappearsfrom the RHSs of (8.7) <strong>and</strong> (8.8). Then, s<strong>in</strong>ce Z P <strong>in</strong> (8.1) <strong>in</strong>cludes the setZ 1/z def<strong>in</strong>ed <strong>in</strong> (8.3), the Blaschke product of the ORHP zeros of P, B P , hasB 1/z as a factor <strong>and</strong> hence log |B −1P(q)| ≥ log |B−11/z(q)|. Thus, <strong>in</strong>equality(8.9) follows from (8.7). In a similar way, <strong>in</strong>equality (8.10) follows from(8.8) on not<strong>in</strong>g that Z M ⊃ Z y . □From the above results it can be <strong>in</strong>ferred that reduction of |P(jω)| or|M(jω)| <strong>in</strong> one frequency range must be compensated by an <strong>in</strong>crease <strong>in</strong>another frequency range. The presence of ORHP poles <strong>and</strong> zeros <strong>in</strong> thosetransfer functions that l<strong>in</strong>k the process <strong>in</strong>put v to the output y <strong>and</strong> targetsignal z, lead to more str<strong>in</strong>gent requirements. This will be discussed <strong>in</strong>more detail <strong>in</strong> §8.3.Bode’s <strong>in</strong>tegral theorems of feedback control (cf. §3.1.2 <strong>in</strong> Chapter 3)also have filter<strong>in</strong>g analogs, as shown next.Theorem 8.2.4 (Bode Integral for P). Suppose that P <strong>in</strong> (7.5) is proper <strong>and</strong>that F is a BEE. Let Z P = {p i : i = 1, . . . , n p } be the set of zeros of P <strong>in</strong> theORHP <strong>and</strong> let Z z <strong>in</strong> (8.3) be given by Z z = {ζ i : i = 1, . . . , n ζ }. Then, ifP(j∞) ≠ 0∫ ∞0logP(jω)∣P(j∞)∣ dω = π 2 lims→∞s[P(s) − P(∞)]P(∞)n p∑ ∑+ π p i − π ζ i . (8.11)i=1n ζi=1


184 8. SISO Filter<strong>in</strong>gProof. Similar to the proof of (3.1.4) <strong>in</strong> Chapter 3. Note that here the contourof Figure 3.3 will have <strong>in</strong>dentations <strong>in</strong>to the right half plane that avoidthe branch cuts of log P correspond<strong>in</strong>g to both the ORHP zeros <strong>and</strong> polesof P.□Theorem 8.2.5 (Bode Integral for M). Assume that M <strong>in</strong> (7.5) is proper<strong>and</strong> that F is a BEE. Let Z M = {q i : i = 1, . . . , n q } be the set of zeros of M<strong>in</strong> the ORHP <strong>and</strong> let Z z <strong>in</strong> (8.3) be given by Z z = {ζ i : i = 1, . . . , n ζ }. Then,if M(0) ≠ 0∫ ∞0logM(jω)∣ M(0) ∣dωω 2 = π 21M(0) lim dM(s)s→0 dsn q∑+ πi=1n1 ∑ ζ− πq ii=11ζ i. (8.12)Proof. Same as the proof of Theorem 8.2.4.□As we see <strong>in</strong> (8.11) <strong>and</strong> (8.12), nonm<strong>in</strong>imum phase zeros of G z that arenot zeros of FG y have an effect on the Bode <strong>in</strong>tegral constra<strong>in</strong>ts that is consistentwith that discussed above for Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, i.e., theyrelax the constra<strong>in</strong>ts. On the other h<strong>and</strong>, we see <strong>in</strong> (8.12) — as well as <strong>in</strong>(8.8) — that the presence of nonm<strong>in</strong>imum phase zeros of G y that are notzeros of G z can only make the constra<strong>in</strong>ts more severe, s<strong>in</strong>ce Z M ⊃ Z y .This can be <strong>in</strong>terpreted as these nonm<strong>in</strong>imum phase zeros “block<strong>in</strong>g”,this time, the measured signal from which the estimates are constructed.It is not surpris<strong>in</strong>g, then, that more severe limits on the achievable performanceshould be expected. As we will see <strong>in</strong> Chapters 10 <strong>and</strong> 11, theproblem of prediction exemplifies an extreme case of this latter situation.In the follow<strong>in</strong>g section we discuss design implications of the constra<strong>in</strong>tsgiven by the Poisson <strong>in</strong>tegrals on P <strong>and</strong> M.8.3 Design InterpretationsThe objective of this section is to illustrate how the constra<strong>in</strong>ts imposed bythe <strong>in</strong>tegral relations given <strong>in</strong> the §8.2 can be translated <strong>in</strong>to restrictions onthe shape of the frequency responses of the filter<strong>in</strong>g sensitivities.To beg<strong>in</strong> with, note that, assum<strong>in</strong>g G z m<strong>in</strong>imum phase, a direct applicationof (8.9) <strong>and</strong> (8.10) gives lower bounds on the H ∞ norms of P <strong>and</strong>M, i.e.,‖P‖ ∞ ≥ maxq∈Z y∣ ∣∣B −11/z (q) ∣ ∣∣ ,‖M‖ ∞ ≥ maxp∈Z 1/z∣ ∣B −1y (p) ∣ ∣ .(8.13)This implies, for example, that both ‖P‖ ∞ > 1 <strong>and</strong> ‖M‖ ∞ > 1 if G z isunstable <strong>and</strong> m<strong>in</strong>imum phase <strong>and</strong> G y is nonm<strong>in</strong>imum phase. Moreover,


8.3 Design Interpretations 185these bounds hold for all possible designs. More <strong>in</strong>formation, however,can be obta<strong>in</strong>ed if we assume that certa<strong>in</strong> natural design goals are imposed.From the discussion <strong>in</strong> §7.2.1 of Chapter 7, it is clear that the relativeeffect of the process disturbance on the estimation error will be reducedif |P(jω)| is small at the frequencies where the power of the process noisev is concentrated. A similar conclusion holds for |M(jω)| <strong>and</strong> the <strong>in</strong>fluenceof the measurement noise w on the estimation error. In general, asdiscussed <strong>in</strong> §7.2.1, the spectrum of v is typically concentrated at low frequencies,while the spectrum of w is typically more significant at highfrequencies. This suggests that a reasonable approach to design would beto make |P(jω)| small at low frequencies <strong>and</strong> |M(jω)| small at high frequencies.Thus, as previously discussed, a value of |M(jω)| close to one,<strong>in</strong> the range of frequencies where the spectrum of v is large, will avoid adeleterious impact of the process disturbance on the estimate.The forego<strong>in</strong>g discussion <strong>in</strong>dicates that typical requirements on the filter<strong>in</strong>gsensitivities are captured, for example, by the follow<strong>in</strong>g specifications.|P(jω)| < α 1 for ω ∈ [0, ω 1 ] , (8.14)|M(jω)| < α 2 for ω ∈ [ω 2 , ∞] , (8.15)where α 1 <strong>and</strong> α 2 are small positive numbers. Notice that (8.15) can also beset <strong>in</strong> terms of P, tak<strong>in</strong>g <strong>in</strong>to account the complementarity relation (7.6),i.e.,|P(jω)| < 1 + α 2 for ω ∈ [ω 2 , ∞] . (8.16)By the same token, an alternative way of express<strong>in</strong>g (8.14) is|M(jω)| < 1 + α 1 for ω ∈ [0, ω 1 ] . (8.17)Figure 8.1 expresses graphically these requirements on the frequency responseof P. These specifications are similar to the design objectives <strong>in</strong>feedback control, with P <strong>and</strong> M play<strong>in</strong>g the role of the sensitivity S <strong>and</strong>complementary sensitivity T, respectively. In the feedback control problem,open-loop ORHP poles <strong>and</strong> zeros <strong>in</strong>hibit achiev<strong>in</strong>g these objectives,as we have seen <strong>in</strong> §3.3.2 of Chapter 3. We will see below that correspond<strong>in</strong>glimitations apply to filter<strong>in</strong>g problems.Once design requirements such as those stated <strong>in</strong> (8.14) <strong>and</strong> (8.15) havebeen established, the results <strong>in</strong> §8.2 can be used to foresee their effecton achievable filter performance <strong>in</strong> other frequency ranges. For example,whenever |P(jω)| is small, log |P(jω)| is negative. It follows from (8.9), forexample, that if specification (8.14) holds, log |P(jω)| must be positive <strong>in</strong>some other frequency range.More quantitative <strong>in</strong>formation regard<strong>in</strong>g these peaks is given below. LetΘ s0 (ω 1 ) be the weighted length of the <strong>in</strong>terval [−ω 1 , ω 1 ], as def<strong>in</strong>ed <strong>in</strong>(3.33) <strong>in</strong> Chapter 3. We then have the follow<strong>in</strong>g corollary.


186 8. SISO Filter<strong>in</strong>g|P(jω)|1α 2α 1ω 1 ω 2 ωFIGURE 8.1. Frequency specifications for P.Corollary 8.3.1. Suppose that P <strong>and</strong> M <strong>in</strong> (7.5) are proper, G z is m<strong>in</strong>imumphase, <strong>and</strong> F is a BEE. Suppose further that (8.14) <strong>and</strong> (8.15) hold. Then,(i) for each q = σ q + jω q ∈ Z y , we have that‖P‖ ∞ ≥( )Θ q(ω 1 )( ) π−Θq(ω 2)1 Θ q(ω 2 )−Θ q(ω 1 ) 1 Θ q(ω 2 )−Θ q(ω 1 )α 1 1 + α 2∣×∣ π∣∣1/z (q) Θ q(ω 2 )−Θ q(ω 1 ) ; (8.18)∣B −1(ii) for each p = σ p + jω p ∈ Z 1/z , we have that‖M‖ ∞ ≥( ) π−Θp(ω 2)( )Θ p(ω 1 )1 Θ p(ω 2 )−Θ p(ω 1 ) 1 Θ p(ω 2 )−Θ p(ω 1 )α 2 1 + α 1× ∣ ∣ B−1yπ(p) ∣ Θ p(ω 2 )−Θ p(ω 1 ) . (8.19)Proof. We use specification (8.15) on M to get the bound (8.16) on P. It thenfollows from (8.14), (8.16), <strong>and</strong> (8.9) thatΘ q (ω 1 ) log |α 1 | + [Θ q (ω 2 ) − Θ q (ω 1 )] log ‖P‖ ∞∣+ [π − Θ q (ω 2 )] log |1 + α 2 | ≥ π logHence‖P‖ [Θq(ω 2)−Θ q(ω 1 )]∞≥∣B −11/z (q) ∣ ∣∣( 1α 1) Θq(ω 1 ) ( 11 + α 2) π−Θq(ω 2 ) ∣ ∣∣B −11/z (q) ∣ ∣∣π.Inequality (8.18) then follows. In a similar way, (8.19) can be obta<strong>in</strong>ed us<strong>in</strong>g(8.15), (8.17) <strong>and</strong> (8.10).□


8.3 Design Interpretations 187To give an idea of how severe the constra<strong>in</strong>ts given by (8.18) <strong>and</strong> (8.19)may be, we consider the follow<strong>in</strong>g numerical example.Example 8.3.1. Consider the filter<strong>in</strong>g sett<strong>in</strong>g of Figure 7.1 <strong>in</strong> Chapter 7.Assume that the plant is as given <strong>in</strong> Figure 7.2. Further say that G y hasone real nonm<strong>in</strong>imum phase zero at q = 0.5, <strong>and</strong> that G z is stable <strong>and</strong>m<strong>in</strong>imum phase. Suppose that we require specifications (8.14) <strong>and</strong> (8.16)to hold for the filter<strong>in</strong>g sensitivity for this system. Now we propose the follow<strong>in</strong>gexperiment: fix the high-frequency specifications (8.16) with ω 2 =10 <strong>and</strong> α 2 = 0.5, <strong>and</strong> vary the low-frequency specifications, i.e., the b<strong>and</strong>widthω 1 <strong>and</strong> level of reduction, α 1 . Under these conditions, we analyzethe lower bound b P , def<strong>in</strong>ed as the RHS of (8.18), i.e.,b P ( )Θ q(ω 1 )( ) π−Θq(ω 2)1 Θ q(ω 2 )−Θ q(ω 1 ) 1 Θ q(ω 2 )−Θ q(ω 1 )α 1 1 + α 2∣×∣ π∣∣1/z (q) Θ q(ω 2 )−Θ q(ω 1 ) . (8.20)∣B −1Figure 8.2 shows the behavior of b P vs. ω 1 for values of α 1 rang<strong>in</strong>g from0.6 to 0.1. Notice that, s<strong>in</strong>ce G z was assumed stable, there is no contributionof the Blaschke product B 1/z on the RHS of (8.18). As we see <strong>in</strong>this figure, the presence of a nonm<strong>in</strong>imum phase zero <strong>in</strong> G y <strong>in</strong>troducesrestrictions on the m<strong>in</strong>imum achievable value of ‖P‖ ∞ . More specifically,if |P(jω)| is required to be “small” over a range of frequencies where thenonm<strong>in</strong>imum phase zero of G y has nonneglectable phase-lag (<strong>in</strong> roughl<strong>in</strong>es, when q is with<strong>in</strong> [0, ω 1 )), then |P(jω)| will necessarily be “large”at frequencies outside this range. Hence, there is a clear trade-off on theamount of sensitivity reduction that can be obta<strong>in</strong>ed <strong>in</strong> the ranges [0, ω 1 )<strong>and</strong> [ω 1 , ω 2 ).2015b P10α 1 = 0.1 0.3 0.6500 0.5 1 1.5 2ω 1FIGURE 8.2. Lower bound b P vs. ω 1 for different values of α 1 , α 2 = 0.5, ω 2 = 10,<strong>and</strong> a nonm<strong>in</strong>imum phase zero q = 0.5.This trade-off is worsened if, <strong>in</strong> addition, G z has an unstable pole, s<strong>in</strong>cethen the Blaschke product B 1/z <strong>in</strong> (8.20) may have a considerable contri-


188 8. SISO Filter<strong>in</strong>gbution, depend<strong>in</strong>g on the position of the pole relative to the zero of G y .Figure 8.3 aga<strong>in</strong> displays b P vs. ω 1 , this time for a fixed value of α 1 = 0.5,<strong>and</strong> three different values of a s<strong>in</strong>gle real unstable pole of G z , p = 2, 5, 20.As we see, the closer the pole <strong>and</strong> the zero are, the worse the trade-off<strong>in</strong>duced on the values of |P(jω)|.2015b P10p = 2 5 20500 0.5 1 1.5 2ω 1FIGURE 8.3. Lower bound b P vs. ω 1 for different values of the pole p, α 1 = 0.6,α 2 = 0.5, ω 2 = 10, <strong>and</strong> a nonm<strong>in</strong>imum phase zero q = 0.5.Example 8.3.1 illustrates the effects of a real nonm<strong>in</strong>imum phase zero ofG y on the achievable shapes of |P(jω)|. A practical lesson of this exampleis that nonm<strong>in</strong>imum phase zeros closer to the jω-axis will impose tougherrestrictions on P. Similar conclusions apply for complex zeros.It is not difficult to see, after compar<strong>in</strong>g expressions (8.18) <strong>and</strong> (8.19),that unstable poles of G z play a similar role with M, although this timepoles farther from the jω-axis are the most troublesome. We summarizethese conclusions as follows.(i) ORHP zeros of G y constra<strong>in</strong> the magnitude of P. Zeros closer to thejω-axis impose more severe constra<strong>in</strong>ts.(ii) ORHP poles of G z constra<strong>in</strong> the magnitude of M. Poles farther fromthe jω-axis impose more severe constra<strong>in</strong>ts.(iii) The comb<strong>in</strong>ation of ORHP zeros <strong>and</strong> poles of the plant worsen theconstra<strong>in</strong>ts on both sensitivities. More severe constra<strong>in</strong>ts should beexpected if there is a pole close to a zero <strong>in</strong> the ORHP.In connection with practical design specifications as those shown <strong>in</strong> Figure8.1, these constra<strong>in</strong>ts imply that large peaks <strong>in</strong> the sensitivities mayoccur. For example, this will occur if reduction of P is desired over a frequencyrange much larger than any nonm<strong>in</strong>imum phase zero of M. Consequently,high sensitivity to disturbances should be expected at an <strong>in</strong>termediaterange of frequencies.◦


8.4 Examples: Kalman Filter 189It also follows from Corollary 8.3.1 <strong>and</strong> the def<strong>in</strong>ition of the Blaschkeproduct, that nonm<strong>in</strong>imum phase zeros of G z weaken the constra<strong>in</strong>ts onboth sensitivities.In the follow<strong>in</strong>g section, we consider more examples for the case thatparticular design methodologies are used for the filter design.8.4 Examples: Kalman FilterWe consider now a particular design methodology, namely, Kalman filterdesign, to illustrate how the bounds derived <strong>in</strong> the previous section canbe used to assess <strong>and</strong> compare the performance achieved with differentdesigns. The first example studies a Kalman filter design for an open-loopunstable, nonm<strong>in</strong>imum phase system (i.e., one hav<strong>in</strong>g ORHP zeros <strong>in</strong> G y ),which is affected by process noise with spectrum concentrated at low frequencies.Example 8.4.1 (Kalman Filter Design). Consider a system of the formshown <strong>in</strong> Figure 7.1 <strong>in</strong> Chapter 7, where G, of the form (7.1) <strong>in</strong> Chapter 7,is given by the follow<strong>in</strong>g matrices[ ] 0 1A = , B =3 −2[ 01], C 1 = [ 2 1 ] , C 2 = [ −2 1 ] . (8.21)These data give the follow<strong>in</strong>g transfer functions for the partition of theplant shown <strong>in</strong> Figure 7.2,G z (s) =s + 2(s + 3)(s − 1) , <strong>and</strong> G s − 2y(s) =(s + 3)(s − 1) . (8.22)From (8.22), (8.4) <strong>and</strong> (8.5), we have that P <strong>and</strong> M have nonm<strong>in</strong>imumphase zeros, namely, p = 1 ∈ Z P , <strong>and</strong> q = 2 ∈ Z M . Hence, |B −11/z (q)| =|B −1y (p)| = 3 <strong>in</strong> (8.18) <strong>and</strong> (8.19).Arbitrarily choos<strong>in</strong>g ω 1 = 1, α 2 = 0.5 <strong>and</strong> ω 2 = 100 for specifications(8.16)-(8.17), leads to the bounds shown <strong>in</strong> Figure 8.4. In this figure, b Pdenotes the lower bound on ‖P‖ ∞ given by (8.20), <strong>and</strong> b M denotes thelower bound on ‖M‖ ∞ given by the RHS of (8.19) <strong>in</strong> Corollary 8.3.1.The curves <strong>in</strong> Figure 8.4 hold for any l<strong>in</strong>ear filter for the system (7.2) <strong>in</strong>Chapter 7 with data (8.21). As an illustration for a particular estimator, weuse a st<strong>and</strong>ard Kalman filter (see for example Anderson <strong>and</strong> Moore, 1979).With<strong>in</strong> this paradigm, we will analize the effect of chang<strong>in</strong>g the design fordifferent levels of sensitivity attenuation, i.e., different values of α 1 <strong>and</strong>ω 1 <strong>in</strong> Figure 8.1.The estimate provided by the Kalman filter achieves the optimal estimationerror <strong>in</strong> the m<strong>in</strong>imum variance sense, provided that the systemis affected by process <strong>and</strong> measurement noises that are white <strong>and</strong> have


190 8. SISO Filter<strong>in</strong>g765432b Pb M100 1 2 3 4α 1FIGURE 8.4. b P <strong>and</strong> b M for ω 1 = 1, α m = 0.5 <strong>and</strong> ω 2 = 100.zero mean. The filter is computed by solv<strong>in</strong>g an algebraic Riccati equationparametrized by the system matrices <strong>and</strong> by the noises <strong>in</strong>crementalcovariances.In order to build a mechanism to adjust α 1 <strong>and</strong> ω 1 , we add the follow<strong>in</strong>ghypotheses:(i) the measurement noise, w, is white with known <strong>in</strong>cremental varianceR, <strong>and</strong>(ii) the process <strong>in</strong>put, v, is generated by a low-pass model driven bywhite noise,˙v = −av + av 0 , (8.23)where a is a positive number that serves as a free parameter, <strong>and</strong> v 0is a white noise with known <strong>in</strong>cremental variance Q.The Kalman filter is then designed for a composite system that <strong>in</strong>cludesthe noise model (8.23), <strong>and</strong> the number a <strong>and</strong> the ratio Q/R can be usedto adjust the values of α 1 <strong>and</strong> ω 1 achieved <strong>in</strong> various designs.Some typical curves are shown <strong>in</strong> Figures 8.5 <strong>and</strong> 8.6, which illustratethe achieved shapes of |P| <strong>and</strong> |M| for fixed Q <strong>and</strong> R <strong>and</strong> two differentvalues of a. On the other h<strong>and</strong>, Figures 8.7 <strong>and</strong> 8.8 show the achievedshapes of |P| <strong>and</strong> |M| for a fixed value a = 1 <strong>and</strong> two different ratios Q/R.From various designs, we have extracted those cases for which ω 1 couldbe reasonably taken as 1, <strong>and</strong> have compared the correspond<strong>in</strong>g value ofα 1 <strong>and</strong> the peak sensitivities ‖P‖ ∞ <strong>and</strong> ‖M‖ ∞ . These are shown <strong>in</strong> Figures8.9 <strong>and</strong> 8.10, respectively. For the sake of comparison, we also show<strong>in</strong> these figures the bounds b P <strong>and</strong> b M plotted earlier <strong>in</strong> Figure 8.4.As can be seen from these last figures, the bounds b P <strong>and</strong> b M are ratherconservative for this example; the peaks found <strong>in</strong> |P(jω)| <strong>and</strong> |M(jω)| approximatelydouble the bounds predicted.◦Example 8.4.1 shows that the design trade-offs aris<strong>in</strong>g from the essentiallimitations imposed by ORHP poles <strong>and</strong> zeros of the plant can <strong>in</strong> fact


8.4 Examples: Kalman Filter 191665544|P(jω)|32a=1|M(jω)|32a=1a=0.11a=0.11010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.5. |P| result<strong>in</strong>g from theKalman filter for Q = 100, R = 1 <strong>and</strong>two values of a.010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.6. |M| result<strong>in</strong>g from theKalman filter for Q = 100, R = 1 <strong>and</strong>two values of a.8866|P(jω)|42Q=50Q=1000|M(jω)|42Q=50Q=1000010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.7. |P| result<strong>in</strong>g from theKalman filter for a = 1, R = 1 <strong>and</strong> twovalues of Q.010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.8. |M| result<strong>in</strong>g from theKalman filter for a = 1, R = 1 <strong>and</strong> twovalues of Q.767654||P|| ∞54||M|| ∞3b P322b M1102 2.5 3 3.5 4α 1FIGURE 8.9. Achieved H ∞ norm forP <strong>and</strong> predicted bound b P .02 2.5 3 3.5 4α 1FIGURE 8.10. Achieved H ∞ normfor M <strong>and</strong> predicted bound b M .


192 8. SISO Filter<strong>in</strong>gbe significantly worse than those predicted us<strong>in</strong>g the analysis of §8.3. Thestrength of this analysis lies on its generality, <strong>and</strong> the degree of its conservativenessis l<strong>in</strong>ked to the particular design methodology employed.The follow<strong>in</strong>g example compares the filter<strong>in</strong>g sensitivities result<strong>in</strong>g fromthe previous Kalman Filter design with those achieved by a design thatm<strong>in</strong>imizes the H ∞ norm of weighted versions of P <strong>and</strong> M.Example 8.4.2 (Kalman Filter versus H ∞ Designs). Consider aga<strong>in</strong> theplant given by (7.1) <strong>in</strong> Chapter 7 <strong>and</strong> the data (8.21). A filter design thatguarantees bounds on the H ∞ norms of the filter<strong>in</strong>g sensitivities can beachieved us<strong>in</strong>g a parametrization of all BEEs developed <strong>in</strong> Seron <strong>and</strong> Goodw<strong>in</strong>(1995). Under this parametrization P can be written asP = T 1 − LT 2 , (8.24)where T 1 <strong>and</strong> T 2 are proper <strong>and</strong> stable transfer functions computed fromthe data given <strong>in</strong> (8.21), <strong>and</strong> L is an arbitrary proper stable transfer function,which serves as a free parameter. An H ∞ mixed sensitivity problem(Kwakernaak, 1985) is then obta<strong>in</strong>ed if we compute L to m<strong>in</strong>imize∥ W ∥1(T 1 − LT 2 ) ∥∥∥∞, (8.25)W 2 LT 2where the weights W 1 <strong>and</strong> W 2 serve to give an adequate shape to the solutionon the jω-axis. Ideally, |W 1 (jω)| represents the <strong>in</strong>verse of the desiredshape for |P(jω)|, while W 2 serves to impose the condition that M shouldroll off at high frequencies. Here, we have selectedW 1 (s) = s + 1/a 1k(s + 1)<strong>and</strong> W 2 (s) = 0.5s + 110 4 , (8.26)which are plotted <strong>in</strong> Figure 8.11 for k = 10 <strong>and</strong> a 1 = 0.2.00.5−2|W 1 (jω)|0.40.30.20.1log |W 2 (jω)|−4−6−8010 −4 10 −2 10 0ω10 2 10 4−1010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.11. Weighs W 1 , for k = 10 <strong>and</strong> a 1 = 0.2 (left), <strong>and</strong> W 2 (right).By chang<strong>in</strong>g the values of k <strong>and</strong> a 1 <strong>in</strong> W 1 we can obta<strong>in</strong> different designssatisfy<strong>in</strong>g specification (8.14). The selection of W 1 as <strong>in</strong> Figure 8.11


8.5 Example: Inverted Pendulum 193was made to approximately match the shapes obta<strong>in</strong>ed by the Kalman filter<strong>in</strong> Example 8.4.1; <strong>in</strong> particular, note that we are requir<strong>in</strong>g P to havea value close to 1/|W 1 (0)| = 2 at low frequencies. We po<strong>in</strong>t out that theH ∞ method allows for a lower reduction level simply by chang<strong>in</strong>g theparameters of W 1 . Moreover, it turns out that this methodology can yieldsmaller peaks <strong>in</strong> the achieved sensitivities than those obta<strong>in</strong>ed with theKalman filter of the previous example. Figures 8.12 <strong>and</strong> 8.13 compare theabsolute bounds b P <strong>and</strong> b M for this system with the H ∞ norms of P <strong>and</strong>M obta<strong>in</strong>ed with both methodologies for different values of α 1 . The restof the specifications for these curves are ω 1 = 1, ω 2 = 100, <strong>and</strong> α 2 = 0.5.77Bound & ||P|| ∞65432KFH ∞b PBound & ||M|| ∞65432KFH ∞b M1102 2.5 3 3.5 4α 1FIGURE 8.12. Achieved H ∞ normfor P, Kalman filter <strong>and</strong> H ∞ designs,<strong>and</strong> predicted bound b P .02 2.5 3 3.5 4α 1FIGURE 8.13. Achieved H ∞ normfor M, Kalman filter <strong>and</strong> H ∞ designs,<strong>and</strong> predicted bound b M .As can be seen from Figures 8.12 <strong>and</strong> 8.13, <strong>and</strong> as anticipated, the conservativenessof the predicted bounds depends on the particular designtechnique chosen. In this example, the H ∞ method has a water-bed effectmilder than that of the Kalman filter. Sensitivities correspond<strong>in</strong>g to bothdesigns are shown <strong>in</strong> Figures 8.14 <strong>and</strong> 8.15 for α 1 = 2.53. The parametersfor W 1 <strong>in</strong> (8.26) are k = 10 <strong>and</strong> a 1 = 0.2; the parameters for the Kalmanfilter are taken as a = 1, R = 1, <strong>and</strong> Q = 263.We see from this example that the results given <strong>in</strong> this chapter can beeffectively used to compare different filter design methods.◦8.5 Example: Inverted PendulumConsider the <strong>in</strong>verted pendulum analyzed <strong>in</strong> §1.3.3 of Chapter 1, <strong>and</strong> revisited<strong>in</strong> §3.3.3 of Chapter 3. It is well known that this system is observablefrom the carriage position measurement; however, practical evidencesuggests that it is difficult to estimate the angle θ from the available data.We will use the framework of Chapters 7 <strong>and</strong> 8 to study the sensitivitylimitations that apply to the problem of angle estimation from the carriageposition measurement.


194 8. SISO Filter<strong>in</strong>g665544|P(jω)|32H ∞|M(jω)|32H ∞KF1KF1010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.14. |P| result<strong>in</strong>g from theKalman filter <strong>and</strong> the H ∞ norm m<strong>in</strong>imizationdesigns.010 −4 10 −2 10 0ω10 2 10 4FIGURE 8.15. |M| result<strong>in</strong>g from theKalman filter <strong>and</strong> the H ∞ norm m<strong>in</strong>imizationdesigns.Choos<strong>in</strong>g z = θ, the l<strong>in</strong>earized model for this system (e.g., Middleton,1991) has the form (7.1) <strong>in</strong> Chapter 7, where⎡⎤0 1 0 0A = ⎢0 0 −g m 1 /m 2 0⎥⎣0 0 0 1⎦ ,0 0 (1 + m 1 /m 2 ) g/l 0⎡B = ⎢⎣01/m 20−1/(m 2 l)C 1 = [ 0 0 1 0 ] , C 2 = [ 1 0 0 0 ] .Here m 1 is the mass at the end of the pendulum, m 2 is the carriage mass, gis the gravitational constant, <strong>and</strong> l is the pendulum length. These data givethe follow<strong>in</strong>g transfer functions for the partition<strong>in</strong>g of the plant shown <strong>in</strong>Figure 7.2 of Chapter 7,G z =⎤⎥⎦ ,−1m 2 l (s + p)(s − p) , <strong>and</strong> G (s + q)(s − q)y =m 2 s 2 (s + p)(s − p) , (8.27)where q = √ g/l <strong>and</strong> p = q √ 1 + m 1 /m 2 . Say we take q = 1. Choos<strong>in</strong>gthe specification parameters <strong>in</strong> (8.14) <strong>and</strong> (8.15) as α 1 = 0.25, ω 1 = 0.01,α 2 = 0.5, <strong>and</strong> ω 2 = 100, Corollary 8.3.1 gives the follow<strong>in</strong>g lower boundson the peak sensitivities‖S‖ ∞ ≥ 6, <strong>and</strong> ‖T‖ ∞ ≥ 6. (8.28)We next design a filter for the above system follow<strong>in</strong>g the H ∞ mixedsensitivity m<strong>in</strong>imization procedure outl<strong>in</strong>ed <strong>in</strong> Example 8.4.2. In particular,we choose the weights <strong>in</strong> (8.25) asW 1 (s) = 0.1 s + 2s + 0.01 ,


8.6 Summary 195<strong>and</strong> W 2 as given <strong>in</strong> (8.26). Figure 8.16 shows the values of log |P(jω)| <strong>and</strong>log |M(jω)| achieved by this design, for the mass ratio m 1 /m 2 = 0.2, 0.4, 1.For m 1 /m 2 = 1, <strong>in</strong> particular, the peak sensitivities actually achieved arearound two times the values of the lower bounds <strong>in</strong> (8.28). The reasonsfor this conservatism <strong>in</strong>clude the fact that the sectionally constant shapesassumed by Corollary 8.3.1 are actually unrealistic from a practical designpo<strong>in</strong>t of view.10 210 2log |P(jω)|10 010 4 ωmass ratio = 0.2mass ratio = 1log |M(jω)|10 4 ω10 0mass ratio = 1mass ratio = 0.210 −210 −210 −410 −4 10 −2 10 0 10 2 10 410 −410 −4 10 −2 10 0 10 2 10 4FIGURE 8.16. log |P(jω)| <strong>and</strong> log |M(jω)| for the <strong>in</strong>verted pendulum.We note that, for the mass ratio m 1 /m 2 = 0.1, <strong>and</strong> the same specificationparameters as <strong>in</strong> (8.14) <strong>and</strong> (8.15), Corollary 8.3.1 gives the lower bounds<strong>in</strong> (8.28) equal to 44. Recall<strong>in</strong>g that P maps relative <strong>in</strong>put disturbances torelative estimation errors (see §7.2.1 <strong>in</strong> Chapter 7), this validates the claimmade <strong>in</strong> §1.1 of Chapter 1 that relative <strong>in</strong>put errors of the order of 1% willappear as angle relative estimation errors of the order of 50%.8.6 SummaryThis chapter has discussed a new concept <strong>in</strong> the area of filter<strong>in</strong>g, namelythat of sensitivity of the estimation error to process <strong>and</strong> measurementnoise. These sensitivities always satisfy a complementarity constra<strong>in</strong>t, whichcan be used to establish <strong>in</strong>tegral bounds on the magnitude of their frequencyresponses. These bounds give new <strong>in</strong>sights <strong>in</strong>to the fundamentallimits that apply to all l<strong>in</strong>ear, bounded error filters.The utility of the results has been v<strong>in</strong>dicated by specific examples. Inparticular, §8.3 <strong>and</strong> §8.4 have shown that, for a given signal from a systemto be estimated, fundamental design constra<strong>in</strong>ts apply to all filter<strong>in</strong>g problemsirrespective of the method used to obta<strong>in</strong> a specific design. Thus, onecan use the design constra<strong>in</strong>ts to judge, a priori, whether or not, a desiredlevel of performance is achievable. Moreover, the constra<strong>in</strong>ts can only bemodified if the <strong>in</strong>tr<strong>in</strong>sic nature of the problem is changed at source, e.g.,


196 8. SISO Filter<strong>in</strong>gby modify<strong>in</strong>g the location of sensors, which, <strong>in</strong>ter-alia, changes the numeratorpolynomials <strong>and</strong> hence the bounds <strong>in</strong> the sensitivity constra<strong>in</strong>ts.Notes <strong>and</strong> ReferencesThis chapter was ma<strong>in</strong>ly based on Goodw<strong>in</strong> et al. (1995), Seron <strong>and</strong> Goodw<strong>in</strong>(1995) <strong>and</strong> Seron (1995).


9MIMO Filter<strong>in</strong>gThis chapter <strong>in</strong>vestigates sensitivity limitations <strong>in</strong> multivariable l<strong>in</strong>ear filter<strong>in</strong>g.Similar to those obta<strong>in</strong>ed <strong>in</strong> Chapter 4 for the control problem, multivariable<strong>in</strong>tegral constra<strong>in</strong>ts hold for the MIMO version of the filter<strong>in</strong>gsensitivities <strong>in</strong>troduced <strong>in</strong> Chapter 7. Also similar to the control case, thereare different ways to extend the SISO <strong>in</strong>tegrals to a MIMO sett<strong>in</strong>g. The approachfollowed <strong>in</strong> this chapter emphasizes the trade-offs that arise whenthe system is required to satisfy, besides frequency conditions, structuralspecifications on the multivariable sensitivities.In the MIMO case, frequency restrictions similar to those aris<strong>in</strong>g <strong>in</strong> SISOsystems apply to l<strong>in</strong>ear comb<strong>in</strong>ations of the scalar entries of the filter<strong>in</strong>gsensitivities P <strong>and</strong> M. Thus, the penalties imposed by ORHP zeros <strong>and</strong>poles are somehow “shared” by the scalar entries of the sensitivities <strong>and</strong>,<strong>in</strong> this sense, the limitations may “relax” with respect to SISO systems.However, if, <strong>in</strong> addition to frequency specifications, a particular structureis required for these functions (e.g., diagonal, diagonal-dom<strong>in</strong>ant, triangular,etc.), then, as we will see, extra costs may arise.The results presented <strong>in</strong> this chapter can be used to obta<strong>in</strong> performancelimitations <strong>in</strong> a variety of filter<strong>in</strong>g applications where multivariable structuralfeatures are of particular <strong>in</strong>terest. To illustrate, we explore the designtrade-offs aris<strong>in</strong>g <strong>in</strong> problems of detection <strong>and</strong> isolation of system faults.


198 9. MIMO Filter<strong>in</strong>g9.1 Interpolation Constra<strong>in</strong>tsWe consider aga<strong>in</strong> the general sett<strong>in</strong>g <strong>in</strong>troduced <strong>in</strong> Chapter 7. As <strong>in</strong> theprevious chapter, the st<strong>and</strong><strong>in</strong>g assumption here is that of BEE (see §7.3).We summarize the conditions required <strong>in</strong> this chapter for the functions F,G z , <strong>and</strong> G y def<strong>in</strong>ed <strong>in</strong> §7.1, <strong>in</strong> the follow<strong>in</strong>g assumption.Assumption 9.1.(i) F is a BEE, i.e., F <strong>and</strong> G z − FG y are stable.(ii) G z is right <strong>in</strong>vertible <strong>and</strong> m<strong>in</strong>imum phase.(iii) G y G −1zis proper (here G −1z is a right <strong>in</strong>verse of G z ).S<strong>in</strong>ce the plant G is stabilizable, then stability of F <strong>and</strong> G z − FG y is anecessary <strong>and</strong> sufficient condition for the filter to be a BEE, as def<strong>in</strong>ed<strong>in</strong> §7.3 <strong>in</strong> Chapter 8. It can be <strong>in</strong>ferred from condition (i) that unstablemodes shared by G z <strong>and</strong> G y are canceled 1 by a zero of the difference G z −FG y , while those unstable poles of G y that do not appear <strong>in</strong> G z must benecessarily canceled by nonm<strong>in</strong>imum phase zeros of F.In condition (ii), recall that for G z to be right <strong>in</strong>vertible it is necessarythat there be at least as many process <strong>in</strong>puts as signals to be estimated. Theassumption that G z has no CRHP zeros is made to simplify the analysis.Condition (iii) is assumed to guarantee that the sensitivities are proper.It was shown <strong>in</strong> Chapter 8 that, <strong>in</strong> the scalar case, ORHP poles of thesystem transfer function G z are zeros of the filter<strong>in</strong>g sensitivity, P, whilstORHP zeros of the system transfer function G y that are not zeros of G z , arezeros of the filter<strong>in</strong>g complementary sensitivity, M. The follow<strong>in</strong>g lemmageneralizes these results to the MIMO case, emphasiz<strong>in</strong>g the multivariablecharacter of the zeros.Lemma 9.1.1 (Interpolation Constra<strong>in</strong>ts). Suppose that G <strong>and</strong> F satisfythe hypotheses stated <strong>in</strong> Assumption 9.1. Then the follow<strong>in</strong>g conditionsare satisfied by P <strong>and</strong> M:(i) P <strong>and</strong> M are stable <strong>and</strong> proper.(ii) Let p ∈ ¢ +be a pole of G z . Then, there exists a nonzero vector Φ ∈¢ n such thatP(p)Φ = 0, <strong>and</strong> M(p)Φ = Φ. (9.1)(iii) Let q ∈ ¢ +be a zero of G y . Then, there exists a nonzero vectorΨ ∈ ¢ n such thatP(q)Ψ = Ψ, <strong>and</strong> M(q)Ψ = 0. (9.2)◦1 In frequency <strong>and</strong> direction.


9.2 Poisson Integral Constra<strong>in</strong>ts 199(iv) Let q ∈ ¢ +be a zero of G z − FG y . Then there exists a nonzero vectorΨ ∈ ¢ n such thatΨ ∗ P(q) = 0, <strong>and</strong> Ψ ∗ M(q) = Ψ ∗ . (9.3)(v) Let q ¢ ∈ +be a zero of F, <strong>and</strong> suppose further that the productG y G −1z does not have a pole at q. Then there exists a nonzero vectorΨ ¢ ∈ n such thatΨ ∗ P(q) = Ψ ∗ , <strong>and</strong> Ψ ∗ M(q) = 0. (9.4)Proof. Stability of P <strong>and</strong> M follows from the fact that F is a BEE <strong>and</strong> G zis m<strong>in</strong>imum phase; their properness follows from condition (iii). Case (ii)follows <strong>in</strong> a straightforward manner from (7.5) by not<strong>in</strong>g that the termG z − FG y is stable by Assumption 9.1. For case (iii), we have that there existsa nonzero vector Ψ 1 such that G y (q)Ψ 1 = 0. S<strong>in</strong>ce G z is m<strong>in</strong>imumphase, then there exists a nonzero vector Ψ such that G −1z (q)Ψ = Ψ 1 .Hence, s<strong>in</strong>ce F is assumed stable, (9.2) follows. Case (iv) is immediate onnot<strong>in</strong>g that G z is m<strong>in</strong>imum phase. F<strong>in</strong>ally, for case (v) notice that if q isnot a pole of G y G −1z , then necessarily there exists a vector Ψ such thatΨ ∗ F(q)G y (q)G −1z (q) = 0, <strong>and</strong> (9.4) follows. □Remark 9.1.1. Conditions (iv) <strong>and</strong> (v) depend on the particular choice offilter <strong>and</strong> they are stated for completeness. However, our ma<strong>in</strong> focus hereis on results that hold irrespective of the particular filter — this is true ofthe other cases <strong>in</strong> Lemma 9.1.1. Yet, as we will see <strong>in</strong> §9.4, there are filter<strong>in</strong>gapplications where zeros of F are directly determ<strong>in</strong>ed by the unstablepoles of the plant.◦Remark 9.1.2. Notice that s<strong>in</strong>ce the plant is detectable from y, the transferfunction G y has all the unstable poles of G z . In consequence, G y G −1z —<strong>and</strong> hence M — has no zeros at any unstable pole of G.◦9.2 Poisson Integral Constra<strong>in</strong>tsThe <strong>in</strong>terpolation constra<strong>in</strong>ts derived <strong>in</strong> the previous section fix the valuesof the filter<strong>in</strong>g sensitivity functions at po<strong>in</strong>ts <strong>in</strong> the CRHP determ<strong>in</strong>edby poles <strong>and</strong> zeros of the plant <strong>and</strong> filter. In the SISO case addressed <strong>in</strong>Chapter 8, the Poisson <strong>in</strong>tegral was used to translate these constra<strong>in</strong>ts <strong>in</strong>toequivalent <strong>in</strong>tegral relations that must be satisfied by P <strong>and</strong> M along thejω-axis. This section extends the results of Chapter 8 to MIMO filter<strong>in</strong>gproblems us<strong>in</strong>g the technique suggested by Gómez <strong>and</strong> Goodw<strong>in</strong> (1995)<strong>and</strong> developed for the multivariable control case <strong>in</strong> Chapter 4.We first <strong>in</strong>troduce some prelim<strong>in</strong>ary notation. Recall<strong>in</strong>g that P <strong>and</strong> Mare n × n square transfer matrices, we denote by P ik , <strong>and</strong> M ik their respectiveelements <strong>in</strong> the i-row <strong>and</strong> k-column. If Φ is a vector <strong>in</strong> ¢ n , we


200 9. MIMO Filter<strong>in</strong>gdenote its elements by φ i , i = 1, . . . , n, i.e., Φ = [φ ∗ 1 , φ∗ 2 , . . . , φ∗ n] ∗ . Also,we <strong>in</strong>troduce the <strong>in</strong>dex set I Φ {i ∈ : φ i ≠ 0} as the set of <strong>in</strong>dices of thenonzero elements of Φ.As before, given a set Z k = {s i , i = 1, . . . , n k } of complex numbers <strong>in</strong>the ORHP, we def<strong>in</strong>e its Blaschke product as the functionB k (s) =∏n ki=1s − s is + ¯s i.If Z k is empty, we def<strong>in</strong>e B k (s) = 1, ∀s. We then have the follow<strong>in</strong>g result.Theorem 9.2.1 (Poisson Integral for P). Let q = σ q + jω q , σ q > 0, be azero of G y , <strong>and</strong> let Ψ ¢ ∈ n , Ψ ≠ 0, be the correspond<strong>in</strong>g zero directionof M, as described <strong>in</strong> Lemma 9.1.1 (iii). Then, under Assumption 9.1, foreach <strong>in</strong>dex k <strong>in</strong> I Ψ ,∫1 ∞π −∞n∑log∣i=1P ki (jω) ψ iψ k∣ ∣∣∣∣σ qσ 2 dω ≥ 0 . (9.5)q + (ω q − ω)2Proof. The proof follows the same general l<strong>in</strong>e as that <strong>in</strong> Chapter 4 —Gómez <strong>and</strong> Goodw<strong>in</strong> (1995) — <strong>and</strong> is an application of the Poisson <strong>in</strong>tegralformula, as <strong>in</strong> Freudenberg <strong>and</strong> Looze (1985), to the elements ofthe vector function PΨ : → n , which are proper, stable, ¢scalar ¢ rationalfunctions. Pick one of these elements, say ρ k (s) ∑ ni=1 P ki(s)ψ i ,where k is <strong>in</strong> I Ψ , <strong>and</strong> let B k be the Blaschke product of the zeros of ρ k<strong>in</strong> the ORHP. Then, the function ˜ρ(s) B −1k(s)ρ k(s) is proper, stable, <strong>and</strong>m<strong>in</strong>imum phase, which implies that log ˜ρ(s) is analytic <strong>in</strong> the ORHP <strong>and</strong>satisfies the conditions of the Poisson <strong>in</strong>tegral formula (Theorem A.6.1 <strong>in</strong>Appendix A). Evaluat<strong>in</strong>g this at s = q, <strong>and</strong> us<strong>in</strong>g the fact that | ρ˜k (jω)| =|ρ k (jω)| yields∫ ∣ ∣1 ∞ n∑ ∣∣∣∣σ ∣∣∣∣ n∣qlogPπ −∞ ∣ ki (jω)ψ iσ 2 q+(ω q −ω) 2 dω = log B −1k(q) ∑ ∣∣∣∣P ki (q)ψ ii=1i=1= log |B −1k (q)| + log |ψ k| ,(9.6)where the last step follows from the <strong>in</strong>terpolation condition P(q)Ψ = Ψ(notice that ψ k ≠ 0 by assumption). S<strong>in</strong>ce1π∫ ∞−∞σ qσ 2 q + (ω q − ω) 2 dω = 1 ,subtract<strong>in</strong>g log |ψ k | from both sides of (9.6) yields∫ ∣1 ∞ n∑logPπ −∞ ∣ ki (jω) ψ ∣∣∣∣i σ qψ k σ 2 dω = log |B−1q + (ω q − ω)2 k(q)| . (9.7)i=1


9.2 Poisson Integral Constra<strong>in</strong>ts 201The proof is then completed on not<strong>in</strong>g that |B −1k(s)| ≥ 1 at any po<strong>in</strong>t s <strong>in</strong>+ , <strong>and</strong> so log |B −1k(q)| ≥ 0. □¢Theorem 9.2.1 establishes that, for each nonm<strong>in</strong>imum phase zero of theplant G y , there is a set of <strong>in</strong>tegral constra<strong>in</strong>ts that limit the values of P onthe jω-axis <strong>in</strong> an <strong>in</strong>tr<strong>in</strong>sically vectorial fashion. More specifically, depend<strong>in</strong>gon the characteristics of the <strong>in</strong>put directions associated to the zero,up to n <strong>in</strong>tegral constra<strong>in</strong>ts may arise on l<strong>in</strong>ear comb<strong>in</strong>ations of elementsof P <strong>in</strong> the same row. If the zero direction happens to be canonical (i.e.,it has only one nonzero entry) or P is designed to be diagonal, then theconstra<strong>in</strong>ts reduce to those of the SISO case.Recall that, <strong>in</strong> the SISO case, if the plant is nonm<strong>in</strong>imum phase, then requir<strong>in</strong>gthat (as is often desirable) |P(jω)| < 1 over a frequency range, sayΩ, implies that, necessarily, |P(jω)| > 1 at other frequencies. The severityof this trade-off depends on the relative location of the nonm<strong>in</strong>imumphase zero <strong>and</strong> the frequency range Ω. More details <strong>and</strong> <strong>in</strong>terpretationswere given <strong>in</strong> Chapter 8.A similar <strong>in</strong>tegral holds for the filter<strong>in</strong>g complementary sensitivity function,as established below.Theorem 9.2.2 (Poisson Integral for M). Let p = σ p + jω p , σ p > 0, be apole of G z , <strong>and</strong> let Φ ∈ ¢ n , Φ ≠ 0, be the correspond<strong>in</strong>g zero direction ofP, as described <strong>in</strong> Lemma 9.1.1 (ii). Then, under Assumption 9.1, for each<strong>in</strong>dex k <strong>in</strong> I Φ ,∫1 ∞π −∞n∑log∣i=1M ki (jω) φ iφ k∣ ∣∣∣∣σ pσ 2 dω ≥ 0 . (9.8)p + (ω p − ω)2Proof. The proof follows that of Theorem 9.2.1, this time us<strong>in</strong>g the <strong>in</strong>terpolationconstra<strong>in</strong>t (ii) <strong>in</strong> Lemma 9.1.1.□A straightforward corollary of these results emphasizes the vectorialnature of the associated trade-offs. Let Ω 1 [0, ω 1 ] denote a given rangeof frequencies of <strong>in</strong>terest, <strong>and</strong> assume that the k-row of M satisfies thefollow<strong>in</strong>g design specifications:|M ki (jω)| ≤ ɛ ki for ω <strong>in</strong> [ω 1 , ∞], i = 1, . . . , n, (9.9)where ɛ ki , i = 1, . . . , n, are small positive numbers. Let Θ s0 (ω 1 ) be theweighted length of the <strong>in</strong>terval [−ω 1 , ω 1 ], as def<strong>in</strong>ed <strong>in</strong> (3.33) <strong>in</strong> Chapter3. Then, we have the follow<strong>in</strong>g corollary.Corollary 9.2.3. Assume that all the conditions of Theorem 9.2.2 hold.Then, if the k-row of M achieves specifications (9.9), the follow<strong>in</strong>g <strong>in</strong>-


202 9. MIMO Filter<strong>in</strong>gequality must be satisfied,‖M kk ‖ ∞ +⎛⎞n∑∣ φ i ∣∣∣ ⎜ 1∣ ‖M ki ‖ ∞ ≥ ⎝φ k ɛ kk + ∑ ∣⎟n i=1 ∣ φ i ∣∣⎠φ ɛki ki=1i≠ki≠kΘ p(ω 1 )π−Θ p(ω 1 ).(9.10)Proof. The proof follows immediately from Theorem 9.2.2 by us<strong>in</strong>g specifications(9.9) on M (see Corollary 4.3.4 <strong>in</strong> Chapter 4 for more details). □As <strong>in</strong> the SISO case, the corollary shows that the <strong>in</strong>tegral relation (9.8)implies lower bounds on the <strong>in</strong>f<strong>in</strong>ity norm of elements of M. Notice thatthe exponent on the RHS of (9.10) is a positive number <strong>and</strong> its base islikely to be larger than one if the ɛ ki ’s are small enough; hence, the moredem<strong>and</strong><strong>in</strong>g the specifications, the larger these lower bounds are.If the zero direction is not canonical, an important difference <strong>in</strong> theMIMO case is that the lower bounds apply to a comb<strong>in</strong>ation of norms of elements,which somehow relaxes the constra<strong>in</strong>t over the SISO case (wherethere is only one element). Also, the lower bounds are smaller than <strong>in</strong> theSISO case due to the presence of extra positive terms <strong>in</strong> the denom<strong>in</strong>atorof the RHS of (9.10). In consequence, if M is required to be diagonal (asoften occurs <strong>in</strong> a number of applications), the constra<strong>in</strong>ts on the values ofM on the jω-axis worsen s<strong>in</strong>ce the off-diagonal elements disappear. Weanalyze this <strong>in</strong> more detail <strong>in</strong> the follow<strong>in</strong>g section.9.3 The Cost of DiagonalizationAs was concluded from Corollary 9.2.3, when the zero direction is notcanonical the cost that it <strong>in</strong>duces on each row of P or M can be sharedamong various elements of the row. On the other h<strong>and</strong>, as discussed <strong>in</strong>Chapter 4 for the control case, a diagonally decoupled system loses thispotential to alleviate the cost, s<strong>in</strong>ce all the price is paid by the diagonalelements alone.By comb<strong>in</strong><strong>in</strong>g the ideas of Gómez <strong>and</strong> Goodw<strong>in</strong> <strong>and</strong> the parametrizationof all diagonaliz<strong>in</strong>g post-compensators given by K<strong>in</strong>naert <strong>and</strong> Peng(1995), it is possible to make a precise statement about the cost of diagonaldecoupl<strong>in</strong>g. For convenience, we first recall the result of K<strong>in</strong>naert <strong>and</strong>Peng. Given an l × n stable, proper, left-<strong>in</strong>vertible transfer function H, itcan be factorized as[ ] ¯HH = Q , (9.11)0where Q is an l × l bistable <strong>and</strong> biproper transfer function <strong>and</strong> ¯H is an uppertriangular l × l matrix (Dion <strong>and</strong> Commault, 1988). The matrix [ ¯H ∗ 0] ∗


9.3 The Cost of Diagonalization 203is a column Hermite form of H over the r<strong>in</strong>g of stable proper rational functions<strong>and</strong> it is unique up to units of this r<strong>in</strong>g. With the aid of this factorization,K<strong>in</strong>naert <strong>and</strong> Peng (1995) derived a parametrization of all stable,proper postcompensators F such that FH is a stable, proper diagonal matrixwith nonzero diagonal elements. Us<strong>in</strong>g this parametrization for F, theproduct FH has the formFH = ¯QR ¯Q diag[R 1 , R 2 , . . . , R n ] , (9.12)where ¯Q is an arbitrary n×n stable, proper, diagonal matrix with nonzerodiagonal elements, <strong>and</strong> the R k , k = 1, . . . n, are the functions∏ nki=1R k (s) (s − p i)(s + σ k ) n . (9.13)k+r kIn (9.13), p i , i = 1, . . . n k , are the unstable poles of the k-row of ¯H −1 <strong>in</strong>(9.11), σ k is an arbitrary positive real number, <strong>and</strong> r k is determ<strong>in</strong>ed sothatlims→∞ R k(s) (k-row of ¯H −1 ) = a nonzero row vector.Notice that the functions R k conta<strong>in</strong> <strong>in</strong>formation about the zeros of H, <strong>and</strong>their numerators depend on H alone.Assume next that we are given a filter<strong>in</strong>g problem where the designobjective is to achieve a diagonal complementary sensitivity M. This isdesirable, for example, if the filter is to perform detection <strong>and</strong> isolation ofsystem faults, as we will discuss <strong>in</strong> §9.4. We then have the follow<strong>in</strong>g result.Theorem 9.3.1 (Cost of Diagonalization). Let Assumption 9.1 hold. Letp = σ p + jω p , σ p > 0, be a pole of G z , <strong>and</strong> let Φ ¢ ∈ n , Φ ≠ 0, be the correspond<strong>in</strong>gzero direction of P, as described <strong>in</strong> Lemma 9.1.1 (ii). Assumethat G y G −1zHermite formis a stable, proper <strong>and</strong> left <strong>in</strong>vertible transfer function, with[Ḡ ]G y G −1z = Q . (9.14)0Suppose that the filter F has been selected such that M <strong>in</strong> (7.5) is a stable,proper diagonal matrix with nonzero diagonal elements. Then, for each<strong>in</strong>dex k <strong>in</strong> I Φ ,1π∫ ∞log |M kk (jω)|−∞σ pσ 2 p + (ω p − ω)dω = log |B−12 k(p)| , (9.15)where B k is the Blaschke product of the unstable poles of the k-row ofḠ −1 , <strong>and</strong> where Ḡ is def<strong>in</strong>ed <strong>in</strong> (9.14).Proof. Note that the assumptions imply that Theorem 9.2.2 holds specializedto the diagonal case, i.e.,∫1 ∞σ plog |M kk (jω)|π −∞σ 2 p + (ω p − ω) 2 dω ≥ 0 .


204 9. MIMO Filter<strong>in</strong>gThe <strong>in</strong>equality above is turned <strong>in</strong>to an equality similar to (9.7) by add<strong>in</strong>gthe Blaschke product of ORHP zeros of ρ k (def<strong>in</strong>ed <strong>in</strong> the proof of Theorem9.2.1). S<strong>in</strong>ce M has the form (9.12) with H = G y G −1z , this Blaschkeproduct is <strong>in</strong>dependent of the filter <strong>and</strong> equal to the unstable poles of thek-row of Ḡ −1 , with Ḡ def<strong>in</strong>ed <strong>in</strong> (9.14). This completes the proof. □The follow<strong>in</strong>g example illustrates the result.Example 9.3.1. Consider a plant hav<strong>in</strong>g the follow<strong>in</strong>g transfer functions.⎡⎤1 s − 10s + 1 s + 14 − sG y (s) =0 0,⎢(s − 3)(s + 2) ⎥⎣ 44 ⎦s 2 0+ 3s + 5s 2 + 3s + 5⎡⎤(9.16)s − 30s + 2G −1s + 1 s + 1/3z (s) =.⎢s + 3 s + 2 ⎥⎣3 − s⎦0s + 2Note that G −1zhas a zero at p = 3 with <strong>in</strong>put direction[ ] −1Φ = ,1which implies that <strong>in</strong>tegrals of the form (9.8) hold that constra<strong>in</strong> the frequencyresponse of the rows of the complementary sensitivity M. Theseconstra<strong>in</strong>ts are:1π∫ ∞log |M k1 (jω) − M k2 (jω)|−∞Next we compute the product G y G −1z , i.e.,⎡s − 1G y (s)G −1z (s) =3ω 2 dω ≥ 0, k = 1, 2. (9.17)+ 9s + 3⎢⎣ 0s − 5/3⎤s + 1s − 4⎥⎦ ,(s + 2) 20 0which is <strong>in</strong> the Hermite form (9.14) with Q = I. If the filter F is selected toachieve diagonalization of M, then necessarily M has the form (9.12) withH = G y G −1z , i.e.,⎡⎤(s − 1)(s − 4)M(s) = ¯Q(s) ⎢ (s + σ 1 ) 3 0⎥⎣s − 4 ⎦ .0(s + σ 2 ) 2


9.4 Application to Fault Detection 205In this case, Theorem 9.3.1 holds, giv<strong>in</strong>g the follow<strong>in</strong>g <strong>in</strong>tegral constra<strong>in</strong>tson the diagonal elements of M:∫1 ∞log |M 11 (jω)|π −∞1π∫ ∞log |M 22 (jω)|−∞3ω 2 dω = 2.6391,+ 9(9.18)3ω 2 + 9 dω = 1.9459.Now we see that even without impos<strong>in</strong>g specifications on the values of M onthe jω-axis, there are nontrivial bounds aris<strong>in</strong>g from the diagonalization.Indeed, on the one h<strong>and</strong> it follows from (9.17) that‖M 11 − M 12 ‖ ∞ ≥ 1,‖M 21 − M 22 ‖ ∞ ≥ 1.On the other h<strong>and</strong>, it follows from (9.18) — after diagonaliz<strong>in</strong>g, i.e., M 12 =0 = M 21 — that‖M 11 ‖ ∞ ≥ 14,‖M 22 ‖ ∞ ≥ 7.(9.19)Therefore, a diagonal M is obta<strong>in</strong>ed at the cost of large peaks <strong>in</strong> its diagonalentries. These peaks might be highly undesirable if they happen tooccur at frequencies where M is required to be small. Moreover, it is notdifficult to see that if frequency specifications on the entries of M are imposed<strong>in</strong> addition, the design trade-offs aris<strong>in</strong>g from (9.19) will be evenworse.◦Theorem 9.3.1 has immediate application <strong>in</strong> filter<strong>in</strong>g problems wherediagonal sensitivities are desirable. In the next section, we illustrate ourresults by study<strong>in</strong>g the problem of detection <strong>and</strong> isolation of faults <strong>in</strong> thecontext of multivariable filter<strong>in</strong>g.9.4 Application to Fault DetectionFault detection <strong>and</strong> isolation (FDI) f<strong>in</strong>ds application <strong>in</strong> complex multiplecomponent systems where, for reasons of safety or economics, tolerance tocomponent failure is required. An approach to achiev<strong>in</strong>g fault tolerance 2consists <strong>in</strong> exploit<strong>in</strong>g the redundancy <strong>in</strong>herent <strong>in</strong> the system model, whichis known as model-based FDI.A common method for model-based FDI is to design a filter that generatesa residual, or fault-sensitive signal, that is dist<strong>in</strong>guishable from zero2 Avoid<strong>in</strong>g the expense of hardware redundancy.


206 9. MIMO Filter<strong>in</strong>gwhen a component of the system fails but rema<strong>in</strong>s close to zero otherwise.If the residual has different properties for different component faults, thenit is also possible to isolate the faulty component. A filter that generates aresidual that allows both detection <strong>and</strong> isolation of faults is called an FDIfilter. A number of techniques are available for construct<strong>in</strong>g FDI filters; seefor example Massoumnia et al. (1989) <strong>and</strong> the references there<strong>in</strong>.Two issues are important <strong>in</strong> FDI filter design. Firstly, the residual shouldbe sensitive to faults at those frequencies where the energy of the faultis likely to be concentrated. Secondly, the residual should be <strong>in</strong>sensitiveto other process disturbances <strong>and</strong> noise. It is well understood that theseare conflict<strong>in</strong>g objectives, s<strong>in</strong>ce both sensitivity to faults <strong>and</strong> <strong>in</strong>sensitivityto disturbances cannot be achieved at the same frequency. However, theimplications of ORHP poles <strong>and</strong> zeros of G for achiev<strong>in</strong>g design objectivesover a range of frequencies are less well appreciated.In this section we show that, by approach<strong>in</strong>g the FDI problem <strong>in</strong> thecontext of multivariable filter<strong>in</strong>g, the Poisson <strong>in</strong>tegral constra<strong>in</strong>ts of §9.2are useful to quantify the limitations imposed on FDI filters by ORHPpoles <strong>and</strong> zeros of G. Furthermore, <strong>in</strong> connection with §9.3, these constra<strong>in</strong>tsare shown to be more severe due to a structural requirement ofdiagonalization imposed by the condition of isolation.We recast the FDI problem as a filter<strong>in</strong>g problem with<strong>in</strong> the generalframework of Figure 7.1 <strong>in</strong> Chapter 7 by tak<strong>in</strong>g z as the fault to be detected,<strong>and</strong> ^z to be the residual. By choos<strong>in</strong>g the structure of G, it is possibleto model a wide variety of faults. For example, the schemes of Figure 9.1represent the important cases of actuator <strong>and</strong> sensor faults. A fault <strong>in</strong> thei-th actuator (or sensor) is modeled by z i ≠ 0. Noth<strong>in</strong>g is assumed aboutthe mode of fault, i.e., z is an arbitrary function of time.vw✲✲✲G y✲+❤+✻z✲y✲vw✲✲✲G y+ ❄✲ ❥+z✲y✲FIGURE 9.1. Plant models for actuator (left) <strong>and</strong> sensor (right) faults.In this context, M maps the fault z to the residual ^z, <strong>and</strong> P maps the faultto the “detection error” ˜z. Thus, M <strong>and</strong> P quantify the quality of detection.We will say that F is an FDI filter if(i) (detection) for all <strong>in</strong>itial conditions <strong>in</strong> G <strong>and</strong> F, <strong>and</strong> <strong>in</strong> the absence ofdisturbances, i.e., v, w = 0, then ^z(t) → 0; <strong>and</strong>(ii) (isolation) if z i ≠ 0, then ^z i ≠ 0 <strong>and</strong> ^z j = 0, j ≠ i.Condition (i) is equivalent to the requirement that F is a BEE. Condition (ii)is equivalent to the requirement that M is diagonal. This is not the only


9.4 Application to Fault Detection 207structure that allows isolation (Massoumnia et al., 1989), but is often studiedbecause (a) it renders the problem of isolation trivial, <strong>and</strong> (b) it permitsisolation of simultaneous faults.The filter<strong>in</strong>g sensitivity functions correspond<strong>in</strong>g to the problems of actuator<strong>and</strong> sensor FDI have the forms given by Table 9.1.PMActuator FDI I − FG y FG ySensor FDI I − F FTABLE 9.1. Filter<strong>in</strong>g sensitivity functions for actuator <strong>and</strong> sensor FDI.We illustrate the ideas for the case of sensor faults; similar analyses canbe performed for the case of actuator faults. Notice that, at first sight, theproblem of sensor FDI may appear to be unconstra<strong>in</strong>ed, s<strong>in</strong>ce P <strong>and</strong> Monly depend on the filter, which is the variable of design. Yet, even for detectionalone, F must <strong>in</strong>corporate <strong>in</strong>formation of the unstable dynamics ofthe plant, s<strong>in</strong>ce the filter is a BEE. To see this, let G y = ˜D −1 Ñ = ND −1 ,where ( ˜D, Ñ), <strong>and</strong> (N, D) are left <strong>and</strong> right coprime factorizations of G y(Vidyasagar, 1985). Then, us<strong>in</strong>g a parametrization of all BEEs (see e.g.,Seron <strong>and</strong> Goodw<strong>in</strong>, 1995), it is easy to show that F necessarily has theform F = ˜Q ˜D, where ˜Q is any stable, proper transfer function of appropriatedimensions. Therefore, M <strong>in</strong> the case of sensor FDI has zeros at theunstable poles of the plant. In addition, if isolation is required, then thefilter must not only be a BEE, but also diagonalize the filter<strong>in</strong>g complementarysensitivity M. As we have seen <strong>in</strong> §9.3, this has the associatedcost — quantified by (9.15) — of further worsen<strong>in</strong>g the design limitationsaris<strong>in</strong>g from unstable poles of the plant with noncanonical directions.For sensor FDI, M not only measures the quality of detection as mentionedabove, but it also maps measurement noise w <strong>and</strong> the relative effectof the <strong>in</strong>put disturbance v to both ^z <strong>and</strong> ˜z, the residual <strong>and</strong> detection errorrespectively. Hence P <strong>and</strong> M may be used to specify the properties of thefilter to adequately detect faults <strong>and</strong> reject disturbances. These objectivescan sometimes be conflict<strong>in</strong>g, as is the case for <strong>in</strong>put disturbances <strong>and</strong>low frequency faults, but can successfully be achieved, for example, <strong>in</strong> thecase of low frequency faults <strong>and</strong> high frequency disturbances. In the lattercase, a suitable shape for the diagonal elements of M will require that|M kk (jω)| be close to one at low frequencies 3 <strong>and</strong> close to zero at highfrequencies where the power spectrum of the disturbance concentrates itsenergy. These objectives can be translated <strong>in</strong>to the follow<strong>in</strong>g design specifications:(i) |M kk (jω)| ≈ 1 on [0, ω 1 ],3 For the k-residual to be sensitive to the k-fault, we require ˜z k to be small, <strong>and</strong> therefore|M kk (jω)| to be close to one at frequencies where the fault is assumed to occur.


208 9. MIMO Filter<strong>in</strong>g(ii) |M kk (jω)| < ɛ k < 1 on [ω 2 , ∞],with ω 1 < ω 2 . Us<strong>in</strong>g these requirements <strong>and</strong> Theorem 9.3.1 it is easy tosee that the follow<strong>in</strong>g lower bound holds for the peak of M kk between ω 1<strong>and</strong> ω 2 .Corollary 9.4.1. Assume that the conditions of Theorem 9.3.1 hold. Thenthe follow<strong>in</strong>g <strong>in</strong>equality is satisfied:( ) π−Θp(ω 2)1 Θ p(ω 2 )−Θ p(ω 1 )sup |M kk (jω)| ≥. (9.20)ω∈[ω 1 ,ω 2 ]ɛ kProof. Immediate from Theorem 9.3.1 <strong>and</strong> the given specifications.□We can argue from the above corollary that a large peak may occur ifɛ k is too small, or if ω 1 is too close to ω 2 . As just discussed, this mayhave a deleterious impact on detection if there are <strong>in</strong>put disturbances withfrequency content with<strong>in</strong> [ω 1 , ω 2 ].Corollary 9.4.1 shows the trade-offs <strong>in</strong> sensor FDI due to the constra<strong>in</strong>tsimposed by unstable poles of the plant G y . Similar trade-offs occur <strong>in</strong> thecase of actuator FDI <strong>in</strong> connection with nonm<strong>in</strong>imum phase zeros of G y .These results may also be extended to the case of general component FDI,where there are trade-offs due to both ORHP zeros <strong>and</strong> poles of the plant.9.5 SummaryIn this chapter we have presented <strong>in</strong>tegral constra<strong>in</strong>ts for multivariablefilter<strong>in</strong>g problems. In contrast to the SISO case, a new dimension arises <strong>in</strong>the analysis of MIMO filter<strong>in</strong>g limitations s<strong>in</strong>ce the structural propertiesof these functions have nonneglectable impact. In particular, problems <strong>in</strong>which diagonalization is required may <strong>in</strong>troduce additional limitations.As one possible application, we have considered problems of fault detection<strong>and</strong> isolation <strong>in</strong> a multivariable filter<strong>in</strong>g sett<strong>in</strong>g.Notes <strong>and</strong> ReferencesThe results <strong>in</strong> this chapter follow Braslavsky et al. (1996). This latter work is themultivariable extension along the l<strong>in</strong>es of Gómez <strong>and</strong> Goodw<strong>in</strong> (1995) of the results<strong>in</strong> Seron <strong>and</strong> Goodw<strong>in</strong> (1995).Other results on limitations for multivariable filter<strong>in</strong>g are given <strong>in</strong> Weller (1996),where <strong>in</strong>tegral constra<strong>in</strong>ts on the maximum s<strong>in</strong>gular values of the sensitivity functionsare obta<strong>in</strong>ed. This latter work follows the ideas of Chen (1995).


10Extensions to SISO PredictionIn Chapter 7 we def<strong>in</strong>ed filter<strong>in</strong>g sensitivities, P <strong>and</strong> M, <strong>and</strong> showed thatthey satisfy a complementarity constra<strong>in</strong>t. Furthermore, for the class ofBEEs, we derived, <strong>in</strong> Chapters 8 <strong>and</strong> 9, <strong>in</strong>terpolation <strong>and</strong> <strong>in</strong>tegral constra<strong>in</strong>tsthat these sensitivities must satisfy. As seen, these constra<strong>in</strong>ts quantifyfundamental limits on the filter achievable performance.Equivalent sensitivities <strong>and</strong> constra<strong>in</strong>ts are obta<strong>in</strong>ed for the closely relatedestimation problem of BEE-based l<strong>in</strong>ear prediction, which is the subjectof the present chapter. As we will see, the additional cost associatedwith the process of prediction is clearly quantifiable us<strong>in</strong>g the results summarizedbelow by Theorems 10.5.1 <strong>and</strong> 10.5.2. In particular, we will discussthe <strong>in</strong>fluence of the prediction horizon on the achievable performance,as measured by the prediction sensitivity functions. The ma<strong>in</strong> conclusionsare illustrated by examples where we analyze limitations <strong>in</strong> predictorsbased on Kalman filters.10.1 General Prediction ProblemThe problem of prediction consists of estimat<strong>in</strong>g a signal z at time t + τ,where τ > 0 is the prediction horizon, given observations up to time t. Similarto the filter<strong>in</strong>g case, we will assume here that z is a l<strong>in</strong>ear comb<strong>in</strong>ationof states of a LTI system, of which a different comb<strong>in</strong>ation of states is availablefor measurement.


210 10. Extensions to SISO PredictionIt is always possible to construct predictors from a given filter. To thisend, suppose that we want to predict the partial state z of the systemẋ = Ax + Bv , x(0) = x 0 ,z = C 1 x + D 1 v ,y = C 2 x + D 2 v + w ,(10.1)where the pair (A, C 2 ) is detectable. For simplicity, we will also assumethat the pair (A, B) is stabilizable. As <strong>in</strong> the filter<strong>in</strong>g case, we partition theplant (10.1) as shown <strong>in</strong> Figure 10.1.❜ v ✲❜ w ✲✲✲G zG y✲+❣+✻✲zy✲FIGURE 10.1. Structure of the plant.Suppose that we have already designed a full-state filter hav<strong>in</strong>g the(transfer function) formWe will then consider predictors for z of the form^X(s) = F(s)Y(s) . (10.2)^z(t + τ|t) = C 1 e Aτ^x(t) , (10.3)where ^x(t) is the estimate given by the filter (10.2) at time t, <strong>and</strong> A is theevolution matrix of the system (10.1). The notation ^z(t + τ|t) <strong>in</strong>dicates aprediction of z at time t + τ given <strong>in</strong>formation up to time t.To motivate the predictor given by (10.3), note from the system equations(10.1) thatz(t + τ) = C 1 e Aτ x(t) +∫ t+τtC 1 e A(t+τ−σ) Bv(σ)dσ . (10.4)Thus, future states are a particular l<strong>in</strong>ear comb<strong>in</strong>ation of the current statex plus a function of the process <strong>in</strong>put v over the <strong>in</strong>terval [t, t + τ]. If weassume that at time t we have no <strong>in</strong>formation about v(·) over the <strong>in</strong>terval[t, t + τ], a natural τ-predictor is constructed as <strong>in</strong> (10.3).Example 10.1.1. Under the same assumptions that guarantee the optimality<strong>in</strong> the least-squares sense of the Kalman filter (Kalman <strong>and</strong> Bucy, 1961),Kailath (1968) showed that the least-squares optimal predictor for the system(10.1) has the form (10.3), where ^x(t) is the state estimate given by theKalman filter.◦


10.1 General Prediction Problem 211More generally, (10.3) is a sensible choice as a predictor on the assumptionthat we have already chosen ^x(t) to be reasonable <strong>in</strong> some sense.We def<strong>in</strong>e the prediction error to be the difference between the futurevalue of z, i.e., z(t + τ), <strong>and</strong> the output of the predictor, namely<strong>and</strong>, us<strong>in</strong>g (10.3)˜z(t + τ|t) z(t + τ) − ^z(t + τ|t),˜z(t + τ|t) = z(t + τ) − C 1 e Aτ^x(t) . (10.5)In the case of prediction, it is convenient to def<strong>in</strong>e a modified Laplacetransform. For a function h(t) we def<strong>in</strong>e it to be∫ ∞H(s) e −st h(t)dt .−τWe do this to avoid an extra term <strong>in</strong> the transform of the shifted statez(t + τ), which corresponds to the transform of the state z(t) truncatedto the <strong>in</strong>terval [0, τ]. This additional term is not aff<strong>in</strong>e <strong>in</strong> the transformsof the <strong>in</strong>put signals (see Appendix C), <strong>and</strong> hence would prevent us fromobta<strong>in</strong><strong>in</strong>g an expression for the mapp<strong>in</strong>g from <strong>in</strong>put v(t) to predictor error˜z(t + τ|t), H ˜zv , as a multiplication operator.Tak<strong>in</strong>g the (modified) Laplace transform of the prediction error <strong>in</strong> (10.5),we have,˜Z(s) = e sτ Z(s) − C 1 e Aτ ^X(s) . (10.6)We <strong>in</strong>troduce the transfer functionF p (s) C 1 e Aτ F(s) , (10.7)which maps y to ^z. Us<strong>in</strong>g (10.7) <strong>and</strong> (10.2) <strong>in</strong> (10.6), we have˜Z(s) = e sτ Z(s) − F p (s)Y(s) . (10.8)The prediction loop depict<strong>in</strong>g the above equation is shown <strong>in</strong> Figure 10.2.❜ v ✲z✲✲ ❥ ✲˜ze sτ +G✻−❜ w ✲y✲^zF pFIGURE 10.2. General configuration for prediction.As <strong>in</strong> the filter<strong>in</strong>g case, the def<strong>in</strong>ition of the prediction sensitivities usesthe mapp<strong>in</strong>gs H^zv <strong>and</strong> H ˜zv . In the case of prediction, note that we considerH^zv as mapp<strong>in</strong>g <strong>in</strong>put v(t) to predictor estimate ^z(t + τ|t), <strong>and</strong> H ˜zv as


212 10. Extensions to SISO Predictionmapp<strong>in</strong>g <strong>in</strong>put v(t) to predictor error ˜z(t+τ|t). The operator H ˜zv can thenbe derived from (10.8) asH ˜zv (s) = e sτ H zv (s) − F p (s)H yv (s) . (10.9)Similarly, from (10.3), (10.2) <strong>and</strong> (10.7), we haveH^zv (s) = F p (s)H yv (s) . (10.10)In the follow<strong>in</strong>g section we use the above mapp<strong>in</strong>gs <strong>in</strong> the def<strong>in</strong>ition ofsensitivity functions for the problem of prediction.10.2 Sensitivity FunctionsSimilarly to the filter<strong>in</strong>g case, we def<strong>in</strong>e the prediction sensitivity <strong>and</strong> complementarysensitivity functions, denoted by P <strong>and</strong> M, respectively, as 1P(s) e −sτ H ˜zv (s)H −1zv (s) ,M(s) e −sτ H^zv (s) H −1zv (s) ,(10.11)where H ˜zv <strong>and</strong> H^zv are the transfer functions given <strong>in</strong> (10.9) <strong>and</strong> (10.10),respectively.Therefore, we obta<strong>in</strong> the complementarity constra<strong>in</strong>t for prediction statedbelow.Theorem 10.2.1. The prediction sensitivity <strong>and</strong> complementary sensitivitydef<strong>in</strong>ed <strong>in</strong> (10.11) satisfy:P(s) + M(s) = I, (10.12)at any f<strong>in</strong>ite complex frequency s that is not a pole of P <strong>and</strong> M.Proof. Us<strong>in</strong>g (10.10) <strong>in</strong> (10.9) yieldsH ˜zv + H^zv = e sτ H zv . (10.13)The result then follows on us<strong>in</strong>g the def<strong>in</strong>itions (10.11).Us<strong>in</strong>g the partition of Figure 10.1, we have that H zv = G z <strong>and</strong> H yv =G y , <strong>and</strong> thus P <strong>and</strong> M are alternatively expressed byP(s) = [G z (s) − e −sτ F p (s)G y (s)]G −1z (s) ,M(s) = e −sτ F p (s)G y (s)G −1z (s) .□(10.14)In the follow<strong>in</strong>g section, we establish a result concern<strong>in</strong>g predictors derivedfrom BEEs, which essentially shows that the BEE-derived predictor<strong>in</strong>herits properties from the orig<strong>in</strong>at<strong>in</strong>g filter.1 In Appendix C, we show the expression of the prediction sensitivities us<strong>in</strong>g the usualLaplace transform.


10.3 BEE Derived Predictors 21310.3 BEE Derived PredictorsFor convenience, we recall from Chapter 7 that a stable filter F <strong>in</strong> (10.2) isa BEE if <strong>and</strong> only if the transfer function (7.18), i.e.,˜G 0 (s) = [I − F(s)C 2 ](sI − A) −1 , (10.15)is stable. This, <strong>in</strong> turn, implies that the filter<strong>in</strong>g error transfer function H ˜zv ,from process <strong>in</strong>put to filter<strong>in</strong>g error, is stable.We will show here that the prediction error transfer function H ˜zv <strong>in</strong>(10.9) does not have f<strong>in</strong>ite CRHP poles whenever the generat<strong>in</strong>g filter, ^x,is a BEE. Note that, due to the presence of the the entire function 2 e sτ <strong>in</strong>the expression of H ˜zv for prediction, we no longer refer to the stability ofthe transfer function, but rather to its analyticity for each f<strong>in</strong>ite complexnumber s <strong>in</strong> the CRHP.Lemma 10.3.1 (Analyticity of H ˜zv ). Suppose that (10.2) is a BEE, <strong>and</strong> considerthe predictor given by (10.2), (10.3), for the system (10.1). Then thetransfer function H ˜zv <strong>in</strong> (10.9) is analytic for each f<strong>in</strong>ite complex number s<strong>in</strong> the CRHP.Proof. From (10.1) <strong>and</strong> Figure 10.1, we haveH zv (s) = C 1 (sI − A) −1 B + D 1 ,H yv (s) = C 2 (sI − A) −1 B + D 2 .(10.16)Us<strong>in</strong>g (10.16) <strong>in</strong> (10.9), we can writeH ˜zv (s) = ˜H(s) + [e sτ D 1 − e Aτ F(s)D 2 ] ,where˜H(s) e sτ C 1 [I − e −(sI−A)τ F(s)C 2 ](sI − A) −1 B . (10.17)S<strong>in</strong>ce F is stable, the term between square brackets <strong>in</strong> the expression of H ˜zvis analytic for each f<strong>in</strong>ite complex number s <strong>in</strong> the CRHP. We hence focuson show<strong>in</strong>g that the same is true for ˜H.For each f<strong>in</strong>ite complex number s, we can exp<strong>and</strong> e −(sI−A)τ <strong>in</strong> a powerseries as (cf. (A.64) <strong>in</strong> Appendix A)e −(sI−A)τ = I +∞∑ (−τ) k(sI − A) k .k!k=12 We recall that an entire function is a complex valued function of the complex variable sthat is analytic for all f<strong>in</strong>ite s.


214 10. Extensions to SISO PredictionUs<strong>in</strong>g the above expression <strong>and</strong> (10.15), ˜H <strong>in</strong> (10.17) can be written as∞∑˜H(s) = e sτ C 1 ˜G 0 (s)B − e sτ (−τ) kC 1 (sI − A) k F(s)C 2 (sI − A) −1 B .k!k=1(10.18)The first term <strong>in</strong> (10.18) is analytic for each f<strong>in</strong>ite s <strong>in</strong> the CRHP. This isbecause ˜G 0 (s) belongs to a BEE by construction, <strong>and</strong> hence it is a stabletransfer function.As for the second term <strong>in</strong> (10.18), the only possibility of unboundedness<strong>in</strong> the f<strong>in</strong>ite CRHP comes from its value at the unstable poles of (sI−A) −1 ,i.e., unstable eigenvalues of the matrix A. Let p be one such eigenvalue.We will show that the second term <strong>in</strong> (10.18) is actually bounded whenevaluated at s = p. 3S<strong>in</strong>ce the generat<strong>in</strong>g filter F is a BEE, we know from Chapter 8 that thetransfer matrix (10.15) is stable. Pre-multiply<strong>in</strong>g (10.15) from the left by(sI − A), does not <strong>in</strong>troduce extra unstable poles. Hence,(sI − A)[I − F(s)C 2 ](sI − A) −1 = I − (sI − A)F(s)C 2 (sI − A) −1is analytic <strong>in</strong> the f<strong>in</strong>ite CRHP. Thus, s = p cannot be a pole of the aboveexpression. Evaluat<strong>in</strong>g its RHS at s = p, we have that the term (pI −A)F(p)C 2 (pI − A) −1 is bounded. S<strong>in</strong>ce p is f<strong>in</strong>ite, it then follows that thevalue at s = p of the sum <strong>in</strong> (10.18), namely∞∑ (−τ) k(pI − A) k F(p)C 2 (pI − A) −1 B, (10.19)k!k=1is bounded, show<strong>in</strong>g that the second term <strong>in</strong> (10.18) is also bounded foreach f<strong>in</strong>ite s <strong>in</strong> the CRHP. This completes the proof that H ˜zv is analytic <strong>in</strong>the f<strong>in</strong>ite CRHP.□The above result will be used to establish <strong>in</strong>terpolation constra<strong>in</strong>ts <strong>in</strong>the follow<strong>in</strong>g section.10.4 Interpolation Constra<strong>in</strong>tsIn this section we show that the prediction sensitivities satisfy similar <strong>in</strong>terpolationconstra<strong>in</strong>ts to the ones that affect the filter<strong>in</strong>g sensitivities, plusadditional ones <strong>in</strong>troduced by the prediction process. In the rema<strong>in</strong>der of3 Note that C 2 (sI − A) −1 B may not have a pole at s = p if (A, B) is not assumed to bestabilizable. In this case, the second term <strong>in</strong> (10.18) is trivially bounded. The analysis thatfollows, however, is <strong>in</strong> fact <strong>in</strong>dependent of B, <strong>and</strong> hence shows boundedness of the secondterm for B = I.


10.4 Interpolation Constra<strong>in</strong>ts 215the analysis for the case of prediction, we will assume that the orig<strong>in</strong>at<strong>in</strong>gfilter is a BEE <strong>and</strong> we focus on scalar systems.Recall first expressions (10.14) of P <strong>and</strong> M us<strong>in</strong>g the partition of Figure10.1. As was the case for the filter<strong>in</strong>g sensitivities, P <strong>and</strong> M are notnecessarily analytic <strong>in</strong> the f<strong>in</strong>ite ORHP, s<strong>in</strong>ce G z may have ORHP zerosthat are not canceled <strong>in</strong> the division. The follow<strong>in</strong>g result gives necessary<strong>and</strong> sufficient conditions for P <strong>and</strong> M to be analytic functions <strong>in</strong> the (f<strong>in</strong>ite)ORHP.Lemma 10.4.1. Consider the prediction sensitivities given <strong>in</strong> (10.14) <strong>and</strong>assume that F <strong>in</strong> (10.7) is a BEE. Then P <strong>and</strong> M are analytic <strong>in</strong> the (f<strong>in</strong>ite)ORHP if <strong>and</strong> only if one of the follow<strong>in</strong>g conditions hold:(i) G z is m<strong>in</strong>imum phase, or(ii) every ORHP zero of G z is also a zero of the product F p G y .Proof. Immediate from (10.14).□The follow<strong>in</strong>g lemma establishes the <strong>in</strong>terpolation constra<strong>in</strong>ts that P<strong>and</strong> M must satisfy at the ORHP poles <strong>and</strong> zeros of G y <strong>and</strong> G z .Lemma 10.4.2 (Interpolation Constra<strong>in</strong>ts). Assume that the orig<strong>in</strong>at<strong>in</strong>gfilter F <strong>in</strong> (10.7) is a BEE. Then P <strong>and</strong> M must satisfy the follow<strong>in</strong>g conditions.(i) If p ∈ ¢ + is a pole of G z , thenP(p) = 0, <strong>and</strong> M(p) = 1.(ii) If q ∈ ¢ + is a zero of G y that is not also zero of G z , thenP(q) = 1, <strong>and</strong> M(q) = 0.(iii) If q ∈ ¢ + is a zero of G z that is not also zero of F p G y , then P(s) <strong>and</strong>M(s) have a pole at s = q.Proof. If p ¢ ∈ + is a pole of G z = C 1 (sI − A) −1 B + D 1 , then p is aneigenvalue of A. It follows from Lemma 10.3.1 that H ˜zv (s) is analytic at s =p. Case (i) then follows from (10.11), s<strong>in</strong>ce H zv = G z . Case (ii) is immediatefrom (10.14) s<strong>in</strong>ce F — <strong>and</strong> hence F p <strong>in</strong> (10.7) — is stable. F<strong>in</strong>ally, case (iii)follows from Lemma 10.4.1.□As <strong>in</strong> §8.1 of Chapter 8, we denote by Z P <strong>and</strong> Z M the sets of ORHPzeros of P <strong>and</strong> M respectively, repeated accord<strong>in</strong>g to their multiplicities.Then, Lemma 10.4.2 identifies subsets of Z P <strong>and</strong> Z M . It is easy to see thatZ M is completed by those zeros of F p that are not also zeros of G z . Asfor P, it may have (possibly <strong>in</strong>f<strong>in</strong>itely many) other ORHP zeros, which are


216 10. Extensions to SISO Predictiona result of the prediction action. Specifically, the function M <strong>in</strong> (10.14) isanalytic <strong>in</strong> a neighborhood of <strong>in</strong>f<strong>in</strong>ity, <strong>and</strong>, due to the factor e −sτ , it hasan essential s<strong>in</strong>gularity at <strong>in</strong>f<strong>in</strong>ity (see Example A.8.3 <strong>in</strong> Appendix A). Itthen follows from the Great Picard Theorem (Conway, 1973) that, <strong>in</strong> eachneighborhood of <strong>in</strong>f<strong>in</strong>ity, M assumes the value 1 (<strong>and</strong> hence P assumesthe value zero) an <strong>in</strong>f<strong>in</strong>ite number of times.The <strong>in</strong>f<strong>in</strong>ite sequence of zeros of P can be further studied by means ofLemma A.11.1 <strong>in</strong> Appendix A. Indeed, note that the numerator of P <strong>in</strong>(10.14) has the form of f <strong>in</strong> (A.87), for g 1 equal to m<strong>in</strong>us the numerator ofF p G y /G z <strong>and</strong> g 2 equal to the denom<strong>in</strong>ator of F p G y /G z . Then Lemma A.11.1applies withδ = RD F pG y, (10.20)G zwhere RD H denotes the relative degree of the transfer function H. If δ = 0,then η is given byη = − lims→∞G z (s)F p (s)G y (s) . (10.21)In §10.5, we restrict our attention to those cases for which δ ≥ 0, where δis def<strong>in</strong>ed <strong>in</strong> (10.20). Us<strong>in</strong>g Lemma A.11.1 we conclude that P has <strong>in</strong>f<strong>in</strong>itelymany zeros <strong>in</strong> the ORHP when δ = 0 <strong>and</strong> |η| < 1 <strong>in</strong> (10.21). When δ > 0,Lemma A.11.1 <strong>in</strong>dicates that the high frequency zeros of P have negativereal part. However, P may still have “low frequency” zeros of positive realpart, as shown <strong>in</strong> the follow<strong>in</strong>g example.Example 10.4.1. Let P <strong>in</strong> (10.14) be given byP = 1 − e −sτ e aτ b,e 1 s + e 2where e 1 , e 2 , b, a are positive real constants. Assume further that b > e 2 .The above sensitivity corresponds to prediction of a scalar plant hav<strong>in</strong>gan unstable pole at s = 1/a.The numerator of P isf(s) = e 1 s + e 2 − be aτ e −sτ .It was shown <strong>in</strong> Example A.11.1 of Appendix A that f will have ORHPzeros <strong>in</strong>side a semicircular contour of radius R < (be aτ −e 2 )/(e 1 +τ) <strong>in</strong> theORHP. Moreover, the number of those zeros <strong>in</strong>creases with the predictionhorizon τ.◦In summary, Z P <strong>in</strong> (8.1), <strong>in</strong> contrast to Z M , may have an <strong>in</strong>f<strong>in</strong>ite numberof elements, which are zeros of P <strong>in</strong>troduced by the prediction process.Although some <strong>in</strong>formation about these zeros can be obta<strong>in</strong>ed <strong>in</strong> specificcases (see §10.6.2), their exact location is, <strong>in</strong> general, difficult to compute.In the follow<strong>in</strong>g section, we will use the <strong>in</strong>terpolation constra<strong>in</strong>ts providedby Lemma 10.4.2 to derive <strong>in</strong>tegral relations on the frequency responsesof the prediction sensitivities.


10.5 Integral Constra<strong>in</strong>ts 21710.5 Integral Constra<strong>in</strong>tsBefore stat<strong>in</strong>g the Poisson <strong>in</strong>tegral theorems for prediction, we <strong>in</strong>troducesome notation. Let the sets (8.1) be given byZ P = {p i : i = 1, . . . , n p } ,Z M = {q i : i = 1, . . . , n q } ,<strong>and</strong> let the correspond<strong>in</strong>g Blaschke products be 4B P (s) =n p∏i=1ns − p i∏ q, <strong>and</strong> B M (s) =s + p ii=1s − q is + q i. (10.22)Note that, accord<strong>in</strong>g to the discussion <strong>in</strong> the previous section, n p , i.e., thenumber of ORHP zeros of P, may be <strong>in</strong>f<strong>in</strong>ity.Follow<strong>in</strong>g the filter<strong>in</strong>g case <strong>in</strong> Chapter 8, we <strong>in</strong>troduce the sets of zeros:Z z {s ∈ ¢ + : G z (s) = 0 <strong>and</strong> F p (s)G y (s) ≠ 0} ,Z y {s ∈ ¢ + : G y (s) = 0 <strong>and</strong> G z (s) ≠ 0} ,Z 1/z {s ∈ ¢ + : G −1z (s) = 0} ,(10.23)<strong>and</strong> denote the correspond<strong>in</strong>g Blaschke products by B z , B y <strong>and</strong> B 1/z . Notethat Z z is the set of ORHP poles of P <strong>and</strong> M.In order to restrict the behavior at <strong>in</strong>f<strong>in</strong>ity of the sensitivities, we requirethe follow<strong>in</strong>gAssumption 10.1. δ = RD F pG yG z≥ 0. ◦Under Assumption 10.1 the function B z P is analytic <strong>and</strong> bounded <strong>in</strong> theORHP. It follows from De Branges (1968, Theorem 8, p. 20) that P <strong>in</strong> (10.14)can be factorized <strong>in</strong> the formP = ˜PB P /B z , (10.24)where ˜P has no zeros or poles <strong>in</strong> the ORHP. The convergence of the Blaschkeproducts <strong>in</strong> (10.24) also follows from De Branges (1968, Theorem 8, p. 20),even when n p = ∞.Similarly, M <strong>in</strong> (10.14) can be factorized asM = e −sτ ˜MB M /B z , (10.25)where ˜M has no zeros or poles <strong>in</strong> the ORHP.The follow<strong>in</strong>g results use the factorizations (10.24) <strong>and</strong> (10.25) to obta<strong>in</strong><strong>in</strong>tegral constra<strong>in</strong>ts on the functions log |P| <strong>and</strong> log |M|.4 Recall that, if the set Z orig<strong>in</strong>at<strong>in</strong>g the Blaschke product B is empty, we def<strong>in</strong>e B(s) = 1,∀s ∈ .


218 10. Extensions to SISO PredictionTheorem 10.5.1 (Poisson Integral for P). Suppose that F <strong>in</strong> (10.7) is a BEE<strong>and</strong> consider the prediction sensitivity, P, def<strong>in</strong>ed <strong>in</strong> (10.14). Let q = σ q +jω q , σ q > 0, be a zero of G y that is not also zero of G z (i.e., q ∈ Z y <strong>in</strong>(10.23)). Then, under Assumption 10.1,∫ ∞−∞σ qlog |P(jω)|σ 2 q + (ω q − ω) 2 dω = π log ∣ B−1P(q)∣ ∣∣ − π log ∣B −1z (q) ∣ .(10.26)Proof. ˜P <strong>in</strong> (10.24) has no zeros or poles <strong>in</strong> the ORHP. Also, under Assumption10.1, B z P is bounded <strong>in</strong> the ORHP, <strong>and</strong> thus it is not difficult to seethat the same is true for ˜P. Hence, we can apply the Poisson <strong>in</strong>tegral formulato log ˜P as <strong>in</strong> Theorem 3.3.1 of Chapter 3.□Theorem 10.5.2 (Poisson Integral for M). Suppose that F <strong>in</strong> (10.7) is aBEE <strong>and</strong> consider the prediction complementary sensitivity, M, def<strong>in</strong>ed<strong>in</strong> (10.14). Let p = σ p + jω p , σ p > 0, be a pole of G z (i.e., p ∈ Z 1/z <strong>in</strong>(10.23)). Then, under Assumption 10.1,∫ ∞−∞σ plog |M(jω)|σ 2 p + (ω p − ω) 2 dω = πσ pτ + π log ∣ ∣B −1M (p)∣ ∣ −π log ∣ B−1z (p) ∣ (10.27).Proof. ˜M <strong>in</strong> (10.25) has no zeros or poles <strong>in</strong> the ORHP <strong>and</strong>, s<strong>in</strong>ce Assumption10.1 holds, it is bounded <strong>in</strong> the ORHP. We can then apply the Poisson<strong>in</strong>tegral formula to log ˜M as <strong>in</strong> Theorem 3.3.1.□The results <strong>in</strong> Theorems 10.5.1 <strong>and</strong> 10.5.2 are affected by the particularchoice of predictor. However, we readily obta<strong>in</strong> the follow<strong>in</strong>g corollary,which presents a constra<strong>in</strong>t that is <strong>in</strong>dependent of the estimator parameters.Corollary 10.5.3. Consider the prediction sensitivities, P <strong>and</strong> M, given <strong>in</strong>(10.14), <strong>and</strong> suppose that F <strong>in</strong> (10.7) is a BEE. Consider the sets of zerosdef<strong>in</strong>ed <strong>in</strong> (10.23), <strong>and</strong> assume further that every ORHP zero of G z is alsoa zero of G y (i.e., Z z = ∅). Then, under Assumption 10.1,(i) if q = σ q + jω q ∈ Z y , we have that∫ ∞−∞σ q∣ ∣ ∣∣Blog |P(jω)|σ 2 q + (ω q − ω) 2 dω ≥ π log −1 ∣∣1/z (q) ; (10.28)(ii) if p = σ p + jω p ∈ Z 1/z , we have that if, <strong>in</strong> addition, Z z = ∅, then∫ ∞−∞σ plog |M(jω)|σ 2 p + (ω p − ω) 2 dω ≥ πσ pτ + π log ∣ B−1y (p) ∣ .


10.6 Effect of the Prediction Horizon 219Proof. Same as the proof of Corollary 8.2.3.□Note that the <strong>in</strong>equality (10.28) would become an equality if we <strong>in</strong>cludeda term correspond<strong>in</strong>g to the Blaschke product of the zeros <strong>in</strong>troducedby the prediction process, which are <strong>in</strong> general not known.Clearly from the above results, the problem of prediction is subject tosimilar constra<strong>in</strong>ts <strong>in</strong> performance — aris<strong>in</strong>g from ORHP poles <strong>and</strong> zerosof the plant — as those present <strong>in</strong> filter<strong>in</strong>g. As a matter of fact, the restrictions<strong>in</strong> prediction are <strong>in</strong>herently more severe, as can be directly seen, forexample, from the <strong>in</strong>tegral constra<strong>in</strong>t on M, which worsens as the predictionhorizon τ exp<strong>and</strong>s. Indeed, a worse constra<strong>in</strong>t means unavoidablehigher sensitivity to noise, which is naturally expected, s<strong>in</strong>ce the forecastof <strong>in</strong>formation, i.e., prediction, generally becomes more difficult <strong>in</strong> noisyenvironments (Anderson <strong>and</strong> Moore, 1979, p. 11).10.6 Effect of the Prediction HorizonAs a way of <strong>in</strong>terpret<strong>in</strong>g Theorems 10.5.1 <strong>and</strong> 10.5.2, we provide here amore detailed discussion of the <strong>in</strong>fluence of the prediction horizon on thefrequency responses of the prediction sensitivities <strong>and</strong> their correspond<strong>in</strong>g<strong>in</strong>tegral constra<strong>in</strong>ts. In order to keep the analysis simple, we assumethat G z is m<strong>in</strong>imum phase, i.e., the prediction sensitivities are analytic <strong>in</strong> theORHP. We consider two cases separately: (i) large values of τ, which canbe studied directly from the expressions of the sensitivities; <strong>and</strong> (ii) <strong>in</strong>termediatevalues of τ, which we study us<strong>in</strong>g Theorems 10.5.1 <strong>and</strong> 10.5.2.10.6.1 Large Values of τConsider the expressions of P <strong>and</strong> M given <strong>in</strong> (10.14), evaluated on theimag<strong>in</strong>ary axis. Some prelim<strong>in</strong>ary conclusions may be drawn for largevalues of the prediction horizon by just analys<strong>in</strong>g the expressions of theprediction sensitivities. S<strong>in</strong>ce the analysis varies if the plant — more precisely,G z — is stable or not, we treat these two cases separately.Unstable G zIf the system matrix A is unstable <strong>and</strong> at least one unstable eigenvalue isobservable from z = C 1 x + D 1 v (i.e., G z is unstable), then, from the def<strong>in</strong>itionof F p <strong>in</strong> (10.7), C 1 e Aτ will <strong>in</strong>crease exponentially with τ. Thus, for asufficiently large prediction horizon, the second term <strong>in</strong> the expression ofP <strong>in</strong> (10.14) will become dom<strong>in</strong>ant, <strong>and</strong> hence the magnitude of both sensitivitieson the imag<strong>in</strong>ary axis will tend to be equally large, which meansthat there will be equally high sensitivity to noise <strong>in</strong> both the estimate <strong>and</strong>the prediction error.


220 10. Extensions to SISO PredictionStable G zIf the system matrix A is stable, then |M(jω)| will become negligible (i.e.,|M(jω)| → 0) when τ goes beyond several multiples of the dom<strong>in</strong>ant timeconstant of the system 5 observable from z. Correspond<strong>in</strong>gly, the magnitudeof P(jω) will tend to one at all frequencies. This is the opposite to theideal desired situation, <strong>and</strong> may be attributed to the fact that, after sucha large τ, no more <strong>in</strong>formation about the system is carried by the filterestimate that gives orig<strong>in</strong> to the predictor, <strong>and</strong> hence there is no po<strong>in</strong>t <strong>in</strong>predict<strong>in</strong>g beyond this time.10.6.2 Intermediate Values of τFor <strong>in</strong>termediate values of the prediction horizon, we can obta<strong>in</strong> further<strong>in</strong>formation from the <strong>in</strong>tegral constra<strong>in</strong>ts (10.26) <strong>and</strong> (10.27). Once more,we analyze both unstable <strong>and</strong> stable cases separately, assum<strong>in</strong>g alwaysthat G y has nonm<strong>in</strong>imum phase zeros.Unstable G zThe first situation that we consider is where both P <strong>and</strong> M have zeros <strong>in</strong>dependentof the prediction process. This is the case when G z is unstable<strong>and</strong> G y is nonm<strong>in</strong>imum phase (recall that we assume that G z is m<strong>in</strong>imumphase). Note that both sensitivities are constra<strong>in</strong>ed already <strong>in</strong> the equivalentfilter<strong>in</strong>g problem, as seen from Theorems 8.2.1 <strong>and</strong> 8.2.2 <strong>in</strong> Chapter 8.Under these conditions, the term σ p τ on the RHS of (10.27), (where σ p isthe real part of each unstable pole of G z ), will be present. Hence, <strong>in</strong>dependentof the existence of additional ORHP zeros of P orig<strong>in</strong>at<strong>in</strong>g from theprediction process, the value of the weighted <strong>in</strong>tegral of M, <strong>in</strong> equation(10.27), <strong>in</strong>creases directly proportional to the prediction horizon τ, giv<strong>in</strong>gan additional cost to that for filter<strong>in</strong>g under the same conditions.The correspond<strong>in</strong>g situation for P is not as straightforward, but we canstill draw some conclusions. From §10.4, we know that P has <strong>in</strong>f<strong>in</strong>itelymany zeros, <strong>and</strong> the asymptotic location of these zeros depends on therelative degree δ given <strong>in</strong> (10.20). When δ > 0, the sequence of zeros ofP converge to the OLHP. However, as shown <strong>in</strong> Example 10.4.1 (see alsoExample 10.6.1 below), the first zeros of the sequence are likely to lie <strong>in</strong> theORHP, <strong>and</strong> even <strong>in</strong>crease <strong>in</strong> number with τ. Thus, the term depend<strong>in</strong>g onthe <strong>in</strong>verse of the Blaschke product B P (q) (where q is an ORHP zero of G y )on the RHS of (10.26) may be <strong>in</strong>creased with respect to the correspond<strong>in</strong>gfilter<strong>in</strong>g term.5 The dom<strong>in</strong>ant time constant of a stable system can be taken as the <strong>in</strong>verse of the real partof the eigenvalue with smallest magnitude for the real part.


10.6 Effect of the Prediction Horizon 221When δ = 0, the asymptotic locus of the sequence of prediction zeros ofP depends on the value of η <strong>in</strong> (10.21), i.e., ofG z (s)η = − lims→∞ C 1 e Aτ F(s)G y (s) , (10.29)where we have replaced F p us<strong>in</strong>g (10.7). Indeed, the sequence of zeroswill converge to the ORHP if |η| < 1 <strong>in</strong> (10.29). Note that η has the vectorC 1 e Aτ <strong>in</strong> its denom<strong>in</strong>ator. S<strong>in</strong>ce some unstable eigenvalues of A are observablefrom z (s<strong>in</strong>ce G z is unstable), a similar analysis as before tells usthat the denom<strong>in</strong>ator of |η| will grow exponentially with τ. It follows thatthere exists a sufficiently large value of τ for which |η| will become smallerthan 1, <strong>and</strong> then the term depend<strong>in</strong>g on the <strong>in</strong>verse of the Blaschke productB P (q) (where q is an ORHP zero of G y ) on the RHS of (10.26) will be<strong>in</strong>creased with respect to the correspond<strong>in</strong>g filter<strong>in</strong>g term. This producesan <strong>in</strong>crease <strong>in</strong> the value of the weighted <strong>in</strong>tegral of P, though this time notdirectly proportional to τ. The follow<strong>in</strong>g example illustrates some of thesepo<strong>in</strong>ts.Example 10.6.1. We consider aga<strong>in</strong> the Kalman filter design given <strong>in</strong> Example8.4.1 of Chapter 8. Follow<strong>in</strong>g the idea described <strong>in</strong> §10.1, here wederive a predictor from the Kalman filter obta<strong>in</strong>ed with weights Q = 100<strong>and</strong> R = 1. We take the parameter a <strong>in</strong> (8.23) as a = 1.In contrast with the filter<strong>in</strong>g case, the prediction sensitivity has an <strong>in</strong>f<strong>in</strong>itenumber of zeros that may constra<strong>in</strong>t its magnitude on the jω-axis.As seen <strong>in</strong> Lemma A.11.1 <strong>in</strong> Appendix A, the asymptotic location of thesezeros depends on the sign of the number δ <strong>in</strong> (10.20). In this case we havethat δ is positive, <strong>and</strong> hence it follows that the high frequency zeros of Pconverge to the OLHP. However, P has low frequency ORHP zeros whosenumber <strong>in</strong>crease with the prediction horizon. Figures 10.3 <strong>and</strong> 10.4 showplots of log |P(s)| for this example <strong>in</strong> the case of filter<strong>in</strong>g (τ = 0) <strong>and</strong> predictionwith τ = 0.41, respectively. Notice that the negative peaks <strong>in</strong> theplots <strong>in</strong>dicate the position of the zeros of P. As we see from these figures,while there is a s<strong>in</strong>gle ORHP zero at s = 1 <strong>in</strong> the filter<strong>in</strong>g case, there arefive (one real <strong>and</strong> two complex pairs) <strong>in</strong> the case of prediction.Figure 10.5 shows the number of zeros of P <strong>in</strong> the ORHP as a function ofthe prediction horizon τ on the <strong>in</strong>terval [0, 1]. Clearly, the number of ORHPzeros of P is an <strong>in</strong>creas<strong>in</strong>g function of the prediction horizon. This fact willimply that the weighted <strong>in</strong>tegral, on the LHS of (10.26), also <strong>in</strong>creases withthe prediction horizon.As for M, we know from (10.27) <strong>and</strong> the previous discussion, that itsweighted <strong>in</strong>tegral <strong>in</strong>creases with the prediction horizon as well.The frequency responses of |P| <strong>and</strong> |M| are shown <strong>in</strong> Figures 10.6 <strong>and</strong> 10.7,respectively, for three different values of the horizon τ. Note that the filter<strong>in</strong>gsensitivities correspond to τ = 0 <strong>in</strong> those figures. Also note that, asτ <strong>in</strong>creases, so does the peak <strong>in</strong> the sensitivity functions, thus confirm<strong>in</strong>gthe additional cost associated with prediction.


222 10. Extensions to SISO Prediction22log |P(σ + jω)|0−2log |P(σ + jω)|0−2−44020jω0−20−20σ24−44020jω0−20−20σ24FIGURE 10.3. ORHP zeros of the filter<strong>in</strong>gsensitivity for the system of Examdictionsensitivity for the system of Ex-FIGURE 10.4. ORHP zeros of the preple10.6.1, τ = 0.ample 10.6.1, τ = 0.41.Nr. of ORHP zeros1816141210864200 0.2 0.4 0.6 0.8 1τFIGURE 10.5. ORHP zeros of P versus the prediction horizon τ.10886|P(jω)|642τ=0.5τ=0.25τ=0|M(jω)|42τ=0.5τ=0.25τ=0010 −4 10 −2 10 0ω10 2 10 4FIGURE 10.6. |P| achieved by theKalman filter-based predictor for theunstable plant of Example 10.6.1.010 −4 10 −2 10 0ω10 2 10 4FIGURE 10.7. |M| achieved by theKalman filter-based predictor for theunstable plant of Example 10.6.1.


10.6 Effect of the Prediction Horizon 223It is <strong>in</strong>terest<strong>in</strong>g to see that, although the magnitude of the predictionsensitivities exceeds that of the filter<strong>in</strong>g sensitivities over almost everyfrequency range, their shapes approximately follow those of the filter<strong>in</strong>gsensitivities. This phenomenon is <strong>in</strong> total accordance with what we discussedbefore: the same weighted <strong>in</strong>tegrals on both sensitivities, whichalready constra<strong>in</strong>ed the filter<strong>in</strong>g problem, now achieve larger values forprediction.◦Summariz<strong>in</strong>g, for the case of an unstable <strong>and</strong> nonm<strong>in</strong>imum phase plant,— i.e., when both sensitivities are already constra<strong>in</strong>ed <strong>in</strong> the filter<strong>in</strong>g case— the prediction process worsens the constra<strong>in</strong>ts imposed by ORHP poles <strong>and</strong>zeros of the plant with respect to the correspond<strong>in</strong>g filter<strong>in</strong>g case. Indeed, predictionadds positive terms to the RHSs of (10.27) <strong>and</strong> (<strong>in</strong> general) of(10.26) too. This means that the same weighted <strong>in</strong>tegrals on the LHS (withthe prediction sensitivities replac<strong>in</strong>g the filter<strong>in</strong>g sensitivities, of course)will achieve higher values for prediction. Additional <strong>in</strong>tegral constra<strong>in</strong>ts onM also appear when there are new ORHP zeros of P <strong>in</strong>troduced by prediction.Stable G zWe analize now the effect of τ when the plant is stable <strong>and</strong> G y has ORHPzeros. In this case only M has zeros <strong>in</strong>dependent of the prediction horizon,<strong>and</strong> therefore, only P is already constra<strong>in</strong>ed <strong>in</strong> the correspond<strong>in</strong>g filter<strong>in</strong>gproblem. Concern<strong>in</strong>g the sequence of zeros of P for τ ≠ 0, when δ > 0,the zeros converge to the OLHP <strong>and</strong>, for a stable plant, it is not clear thatthe first zeros of the sequence would lie <strong>in</strong> the ORHP. In this case, Theorems10.5.1 <strong>and</strong> 10.5.2 do not <strong>in</strong>dicate additional costs associated withprediction. The follow<strong>in</strong>g example illustrates this.Example 10.6.2 (Stable Plant. Case δ > 0). Consider the plant given bythe follow<strong>in</strong>g state-space model,[ ] [ −2 −3 1A =, B = , C1 00]1 = [ 1 2 ] , C 2 = [ 1 −2 ] . (10.30)For this stable plant, we construct a predictor based on a Kalman filterobta<strong>in</strong>ed with the weights Q = 100 <strong>and</strong> R = 1, <strong>and</strong> the same noise modelof Example 8.4.1 with the parameter a <strong>in</strong> (8.23) taken as a = 1.S<strong>in</strong>ce G y has a zero at q = 2, then the weighted <strong>in</strong>tegral of P <strong>in</strong> (10.26)can be evaluated with the RHS equal to 1. This, however, is the same caseas that of filter<strong>in</strong>g for the same problem. The achieved |P(jω)| is shown <strong>in</strong>Figure 10.9 for <strong>in</strong>creas<strong>in</strong>g values of the prediction horizon. It is seen fromthis figure that, although the value of the weighted <strong>in</strong>tegral <strong>in</strong> (10.26) isthe same as that for filter<strong>in</strong>g, the shape of |P(jω)| varies with τ.For this example, δ <strong>in</strong> (10.21) is equal to one. This says that the sequenceof <strong>in</strong>f<strong>in</strong>ite zeros of P converges to the OLHP. Further, no low frequency


224 10. Extensions to SISO PredictionORHP zero was found for several values of τ smaller than the dom<strong>in</strong>anttime constant of the system. Then, the weighted <strong>in</strong>tegral of M <strong>in</strong> (10.27)cannot even be stated. Thus, for this example, no additional cost associatedwith prediction is given by Theorems 10.5.1 <strong>and</strong> 10.5.2. Note that, theachieved |M(jω)|, shown <strong>in</strong> Figure 10.8, is seen to decrease to zero as τ<strong>in</strong>creases, whilst the achieved |P(jω)| tends to 1.11.50.8|P(jω)|τ=21τ=0.2τ=010 −4 10 −2 10 0 10 2 10 4ω|M(jω)|0.60.40.2τ=0τ=0.2τ=2010 −4 10 −2 10 0ω10 2 10 4FIGURE 10.8. |P| result<strong>in</strong>g from theKalman filter-based predictor for thestable plant of Example 10.6.2.FIGURE 10.9. |M| achieved by theKalman filter-based predictor for thestable plant of Example 10.6.2.In the case of a stable plant, the <strong>in</strong>terest<strong>in</strong>g situation is then when δ =0. Assume further that for a prediction horizon much smaller than thedom<strong>in</strong>ant time constant of the system observable from z, |η| <strong>in</strong> (10.29) issmaller than 1. For such a τ, then, P has an <strong>in</strong>f<strong>in</strong>ite sequence of zeros, p isay, converg<strong>in</strong>g to the ORHP. Hence, the term depend<strong>in</strong>g on the <strong>in</strong>verseof the Blaschke product B P (q) (where q is an ORHP zero of G y ) on theRHS of (10.26) will be <strong>in</strong>creased with respect to the correspond<strong>in</strong>g filter<strong>in</strong>gterm. Similarly, the <strong>in</strong>tegral constra<strong>in</strong>t (10.27) holds for each zero p i , withthe terms Re(p i )τ <strong>and</strong> log |B −1y (p i )|, on its RHS. The values of both RHSs<strong>in</strong> (10.26) <strong>and</strong> (10.27) are thus clear functions of τ. But this stops when τis large enough so that C 1 e Aτ <strong>in</strong> the denom<strong>in</strong>ator of η <strong>in</strong> (10.29) turns thevalue of |η| larger than 1. Thus, unless P has low frequency zeros <strong>in</strong> theORHP, the RHSs of both weighted <strong>in</strong>tegrals become <strong>in</strong>dependent of τ. Inparticular, the <strong>in</strong>tegral constra<strong>in</strong>t (10.27) on M disappears.Example 10.6.3 (Stable Plant. Case δ = 0.). Consider a Kalman filterbasedpredictor (Q = 100, R = 1 <strong>and</strong> a = 1) for the system given by A, B<strong>and</strong> C 2 <strong>in</strong> (10.30), but with C 1 = [0 1]. In this case, δ = 0 <strong>in</strong> (10.20). Moreover,it was found that, for τ smaller than approximately 0.77, |η| <strong>in</strong> (10.29)was smaller than one. Thus, for 0 < τ < 0.77, P has an <strong>in</strong>f<strong>in</strong>ite sequenceof zeros, p i say, converg<strong>in</strong>g to the ORHP. Hence, the term depend<strong>in</strong>g onthe <strong>in</strong>verse of the Blaschke product B P (q), for q = 2 (i.e., the zero of G y ),on the RHS of (10.26) will be <strong>in</strong>creased with respect to the correspond<strong>in</strong>g◦


10.6 Effect of the Prediction Horizon 225filter<strong>in</strong>g term. Similarly, the <strong>in</strong>tegral constra<strong>in</strong>t (10.27) holds for each zerop i , with the terms Re(p i )τ <strong>and</strong> log |B −1y (p i )|, on its RHS.Note that the worst constra<strong>in</strong>ts for M come from the first zeros of the<strong>in</strong>f<strong>in</strong>ite sequence of zeros of P, which are the closest to the nonm<strong>in</strong>imumphase zero of M at q = 2. This is because the magnitude of the <strong>in</strong>verseof a Blashke product achieves it maximum at the complex po<strong>in</strong>t equal tothe zero that generates it, <strong>and</strong> then decreases monotonically to 1. It wasfound by plott<strong>in</strong>g that, for τ = 0.25 the first pair of prediction zeros ofP where approximately p 1,2 = 1.9 ± j14.2. For τ = 0.5 the first pair ofprediction zeros of P where approximately p 1,2 = 0.6 ± j7.8. Then, theRHS of (10.27) is larger for τ = 0.25 than for τ = 0.5, particularly due to theterm Re(p i )τ. This effect is seen from Figure 10.11, where the magnitudeof |M(jω)| decreases as τ <strong>in</strong>creases.Figure 10.10 shows the achieved shapes of |P(jω)| for three values of τ<strong>in</strong> the above-mentioned range. Note that peaks greater than one cont<strong>in</strong>ueto appear. However, s<strong>in</strong>ce it is expected that the level of low frequencyreduction achieved by the filter<strong>in</strong>g sensitivity (τ = 0) should be worsenedby the process of prediction, then the peaks outside this low frequencyrange do not exceed those achieved for filter<strong>in</strong>g.322.51.52|P(jω)|1.5|M(jω)|110.5τ=0.5τ=0.25τ=00.5τ=0τ=0.25τ=5010 −4 10 −2 10 0ω10 2 10 4FIGURE 10.10. |P| result<strong>in</strong>g from theKalman filter-based predictor for thestable plant of Example 10.6.3.010 −4 10 −2 10 0ω10 2 10 4FIGURE 10.11. |M| result<strong>in</strong>g from theKalman filter-based predictor for thestable plant of Example 10.6.3.Summariz<strong>in</strong>g, for the case of stable <strong>and</strong> nonm<strong>in</strong>imum phase plants, onlythe filter<strong>in</strong>g sensitivity P is constra<strong>in</strong>ed. In the case of δ = 0, <strong>and</strong> for a certa<strong>in</strong>range of values of the prediction horizon, the prediction process adds aterm to the RHSs of (10.26) with respect to the correspond<strong>in</strong>g value forfilter<strong>in</strong>g. This means that the same weighted <strong>in</strong>tegral on the LHS of (10.26)will achieve higher values for prediction. Also, the complementary sensitivity Mwill start to be constra<strong>in</strong>ed. Moreover, <strong>in</strong>f<strong>in</strong>itely many <strong>in</strong>tegral constra<strong>in</strong>ts(10.27) can be stated on M, one for each of the prediction zeros.◦


226 10. Extensions to SISO Prediction10.7 SummaryThis chapter has extended the sensitivity analysis developed <strong>in</strong> Chapter 8for the problem of filter<strong>in</strong>g with bounded error estimators to the case ofprediction.The results show that there is an additional quantifiable cost associatedwith prediction, summarized <strong>in</strong> Theorems 10.5.1 <strong>and</strong> 10.5.2. The exact valuesof the weighted <strong>in</strong>tegrals of P <strong>and</strong> M are, <strong>in</strong> general, not computable.However, we have shown via different situations that the <strong>in</strong>tegral constra<strong>in</strong>tscan be used to study the effect of the prediction horizon on thefrequency responses of the sensitivities.Notes <strong>and</strong> ReferencesThe sensitivity results of this chapter are based on Seron (1995). Additional materialwas taken from Kailath (1981) <strong>and</strong> Kailath (1968).


11Extensions to SISO Smooth<strong>in</strong>gFollow<strong>in</strong>g similar developments to those <strong>in</strong> the previous chapter, a theoryof design limitations can also be extended to problems of fixed-lagsmooth<strong>in</strong>g. In this chapter, we def<strong>in</strong>e appropriate complementary sensitivities<strong>and</strong> obta<strong>in</strong> fundamental limitations that apply to scalar smoothersderived from BEEs.As opposed to the case of prediction, which <strong>in</strong>duces additional coststo that encountered <strong>in</strong> the correspond<strong>in</strong>g filter<strong>in</strong>g problem, smooth<strong>in</strong>gaccounts for an improvement <strong>in</strong> performance proportional to the extentof the smooth<strong>in</strong>g lag. This is certa<strong>in</strong>ly unsurpris<strong>in</strong>g, s<strong>in</strong>ce more <strong>in</strong>formationis taken <strong>in</strong>to account <strong>in</strong> order to produce a smoothed estimate, <strong>and</strong> sosmoothers will generally be expected to perform better than filters (Anderson<strong>and</strong> Moore, 1979, Chapter 7). Here, we cast <strong>in</strong> a sensitivity sett<strong>in</strong>gthis well-known improvement phenomenon due to smooth<strong>in</strong>g.11.1 General Smooth<strong>in</strong>g ProblemThe fixed lag smooth<strong>in</strong>g problem consists of estimat<strong>in</strong>g a signal z at timet − τ, where τ > 0 is the smooth<strong>in</strong>g lag, given measurements up to timet. Hence, smooth<strong>in</strong>g may be seen as the counterpart of the problem ofprediction dealt with <strong>in</strong> the preced<strong>in</strong>g chapter.Many solutions to the cont<strong>in</strong>uous time smooth<strong>in</strong>g problem appeared<strong>in</strong> the literature dur<strong>in</strong>g the 1960’s. Kailath <strong>and</strong> Frost (1968) showed thatmany of the already exist<strong>in</strong>g formulae could be derived us<strong>in</strong>g the <strong>in</strong>nova-


228 11. Extensions to SISO Smooth<strong>in</strong>gtions approach to least-squares estimation. In their work it was also establishedthat the least-squares smooth<strong>in</strong>g solution is completely determ<strong>in</strong>edby the results for the least-squares filter<strong>in</strong>g problem. This result is validfor a general second order (f<strong>in</strong>ite-variance) signal process <strong>in</strong> white noise.The class of smoothers that we consider here is <strong>in</strong>spired by the generalsmooth<strong>in</strong>g formula of Kailath <strong>and</strong> Frost (see also Kailath, 1981).As before, we assume that the signal z is the partial state of the systemẋ = Ax + Bv , x(0) = x 0 ,z = C 1 x + D 1 v ,y = C 2 x + D 2 v + w ,(11.1)where the pair (A, C 2 ) is detectable <strong>and</strong> the pair (A, B) is stabilizable. Supposethat we have constructed the follow<strong>in</strong>g full-state filter for (11.1):˙ξ = ^Aξ + K y y, ξ(0) = ξ 0 ,^x = ^Cξ ,(11.2)<strong>and</strong> let its Laplace transform be^X(s) = F(s)Y(s) . (11.3)We def<strong>in</strong>e the associated <strong>in</strong>novations process as (see §7.2.1 <strong>in</strong> Chapter 7)ι y − C 2^x , (11.4)where y is the measured output of (11.1). The <strong>in</strong>novations process ι(t)may be regarded as the new <strong>in</strong>formation that is brought <strong>in</strong>to the systemat time t (Kailath, 1968).In the present problem of smooth<strong>in</strong>g, we want to estimate z(t−τ) us<strong>in</strong>gall the data up to time t. A natural way to do this is to add a l<strong>in</strong>ear functionof the <strong>in</strong>novations process over the <strong>in</strong>terval [t − τ, t] to the filtered estimateat time t − τ. Motivated by the structure of the optimal least-squaressmoother (Kailath <strong>and</strong> Frost, 1968), we will then consider smoothers of theform∫ t^z(t − τ|t) = C 1^x(t − τ) + C 1 V 1 e ^A ′ (σ−t+τ) V 2 ι(σ)dσ , (11.5)t−τwhere ^A ′ is the transpose of the state transition matrix of the filter (11.2),<strong>and</strong> V 1 , V 2 are constant matrices of appropriate dimensions. The notation^z(t − τ|t) <strong>in</strong>dicates estimation at time t − τ given <strong>in</strong>formation up to time t.Note that, for consistency with the def<strong>in</strong>ition of the filter (11.2), the matrixV 1 must be of the form V 1 = ^C ¯V 1 , for some matrix ¯V 1 .Smoothers of the form (11.5) <strong>in</strong>clude, for example, the least-squaressmoother of Kailath <strong>and</strong> Frost (1968).


11.1 General Smooth<strong>in</strong>g Problem 229Example 11.1.1. Assume that, <strong>in</strong> (11.1), D 1 = D 2 = 0, <strong>and</strong> v <strong>and</strong> w areuncorrelated white noises with <strong>in</strong>cremental covariances equal to Q <strong>and</strong> R,respectively. Further, assume that the <strong>in</strong>itial condition x(0) is a zero-meanr<strong>and</strong>om variable uncorrelated with the process noise v.Let ^x(t) be the state estimate given by the Kalman filter for the system(11.1) <strong>and</strong> let Φ ≥ 0 be a stabiliz<strong>in</strong>g solution to the Riccati equationAΦ + ΦA ′ + BQB ′ − ΦC ′ 2R −1 C 2 Φ = 0. (11.6)Us<strong>in</strong>g the <strong>in</strong>novations technique, Kailath <strong>and</strong> Frost (1968) showed that afixed lag (τ > 0) least-squares smoother for the full-state x can be derivedfrom the Kalman filter as follows∫ t^x(t − τ|t) = ^x(t − τ) + Φe ^A ′ (σ−t+τ) C 2R ′ −1 ι(σ)dσ , (11.7)t−τi.e., <strong>in</strong> the general expression (11.5), the special choices V 1 = Φ <strong>and</strong> V 2 =C2 ′ R−1 are made <strong>in</strong> this particular design methodology.◦We def<strong>in</strong>e the smooth<strong>in</strong>g error to be the difference between the pastvalue of z, i.e., z(t − τ), <strong>and</strong> the output of the smoother, namelyUs<strong>in</strong>g (11.5), we have˜z(t − τ|t) = z(t − τ) − C 1^x(t − τ) −˜z(t − τ|t) z(t − τ) − ^z(t − τ|t) .∫ tC 1 V 1 e ^A ′ (σ−t+τ) V 2 ι(σ)dσ . (11.8)t−τAs before, we <strong>in</strong>troduce the <strong>in</strong>put-output operators H^zv , mapp<strong>in</strong>g <strong>in</strong>putv(t) to smoother estimate ^z(t − τ|t), <strong>and</strong> H ˜zv , mapp<strong>in</strong>g <strong>in</strong>put v(t) tosmooth<strong>in</strong>g error ˜z(t − τ|t). In order to def<strong>in</strong>e the smooth<strong>in</strong>g sensitivityfunctions, we require the Laplace transforms of these operators. We startderiv<strong>in</strong>g the transform of the <strong>in</strong>tegral <strong>in</strong> (11.8). Mak<strong>in</strong>g the change of variableα = t − σ <strong>in</strong> this <strong>in</strong>tegral, we obta<strong>in</strong>, assum<strong>in</strong>g that ι(t) = 0, t < 0,∫ t ∫ τC 1 V 1 e ^A ′ (σ−t+τ) V 2 ι(σ)dσ = C 1 V 1 e ^A ′ (τ−α) V 2 ι(t − α)dαt−τ0=∫ t0∫ t0C 1 V 1 e ^A ′ (τ−α) V 2 ι(t − α)dαH s (α) ι(t − α) dα= (H s ∗ ι)(t) , (11.9)where the symbol “∗” denotes real convolution <strong>and</strong> where the function H sis def<strong>in</strong>ed as (Anderson <strong>and</strong> Doan, 1977){C 1 V 1 e ^A ′ (τ−t) V 2 if 0 ≤ t ≤ τ,H s (t) =0 otherwise.


230 11. Extensions to SISO Smooth<strong>in</strong>gIt thus follows that the Laplace transform of (11.9) is H s (s)I(s), where Idenotes the Laplace transform of the <strong>in</strong>novations, <strong>and</strong> whereH s (s) C 1 V 1 (sI + ^A [ ]′ ) −1 e τ ^A ′ − e −sτ I V 2 . (11.10)The transforms of the smoother (11.5), <strong>and</strong> of the smooth<strong>in</strong>g error (11.8),are then given by^Z(s) = C 1 e −sτ ^X(s) + H s (s)I(s),˜Z(s) = e −sτ Z(s) − C 1 e −sτ ^X(s) − H s (s)I(s) .(11.11)Next, note from (11.4) <strong>and</strong> (11.3) that the transform of the <strong>in</strong>novations isgiven byI(s) = [I − C 2 F(s)]Y(s) . (11.12)Hence, us<strong>in</strong>g (11.3) <strong>and</strong> (11.12), the equations <strong>in</strong> (11.11) can be written as^Z(s) = F s (s)Y(s) ,˜Z(s) = e −sτ Z(s) − F s (s)Y(s) ,(11.13)where we have used the def<strong>in</strong>itionF s (s) e −sτ C 1 F(s) + H s (s)[I − C 2 F(s)] , (11.14)which is the transfer function that maps y to ^z. The smooth<strong>in</strong>g loop depict<strong>in</strong>gequations (11.13) is shown <strong>in</strong> Figure 11.1.❜ v ✲z +✲ ✲ ❥ ✲˜ze −sτG✻−❜ w ✲y✲^zF sFIGURE 11.1. General configuration for smooth<strong>in</strong>g.The operators H^zv <strong>and</strong> H ˜zv can now be derived from (11.13) asH^zv (s) = F s (s)H yv (s) , (11.15)H ˜zv (s) = e −sτ H zv (s) − F s (s)H yv (s) . (11.16)In the follow<strong>in</strong>g section we use the above mapp<strong>in</strong>gs <strong>in</strong> the def<strong>in</strong>ition ofsensitivity functions for fixed-lag smooth<strong>in</strong>g.11.2 Sensitivity FunctionsAs for the filter<strong>in</strong>g <strong>and</strong> prediction problems, we def<strong>in</strong>e the smooth<strong>in</strong>g sensitivity<strong>and</strong> complementary sensitivity functions, denoted also by P <strong>and</strong> M,


11.3 BEE Derived Smoothers 231respectively, asP(s) e sτ H ˜zv (s)H −1zv (s) ,M(s) e sτ H^zv (s) H −1zv (s) ,(11.17)where H ˜zv <strong>and</strong> H^zv are the transfer functions given <strong>in</strong> (11.16) <strong>and</strong> (11.15),respectively.Therefore, we obta<strong>in</strong> the complementarity constra<strong>in</strong>t for smooth<strong>in</strong>g statedbelow.Theorem 11.2.1. The smooth<strong>in</strong>g sensitivity <strong>and</strong> complementary sensitivitydef<strong>in</strong>ed <strong>in</strong> (11.17) satisfy:P(s) + M(s) = I , (11.18)at any f<strong>in</strong>ite complex frequency s that is not a pole of P <strong>and</strong> M.Proof. Us<strong>in</strong>g (11.15) <strong>in</strong> (11.16) yieldsH ˜zv + H^zv = e −sτ H zv . (11.19)The result then follows on us<strong>in</strong>g the def<strong>in</strong>itions (11.17).□In the follow<strong>in</strong>g section, we establish a result concern<strong>in</strong>g smoothers derivedfrom BEEs.11.3 BEE Derived SmoothersWe will show here that the smooth<strong>in</strong>g error transfer function H ˜zv <strong>in</strong> (11.16)does not have f<strong>in</strong>ite CRHP poles whenever the generat<strong>in</strong>g filter, F <strong>in</strong> (11.3),is a BEE. Recall from Chapter 7 that a stable full-state filter F is a BEE if <strong>and</strong>only if the transfer function (7.18) is stable.We first show that the smoother transfer function F s <strong>in</strong> (11.14) is analytic<strong>in</strong> the f<strong>in</strong>ite CRHP if the generat<strong>in</strong>g filter F is stable.Lemma 11.3.1. If F is stable then F s (s) <strong>in</strong> (11.14) is analytic at each f<strong>in</strong>itecomplex number s <strong>in</strong> the CRHP.Proof. S<strong>in</strong>ce F <strong>in</strong> (11.14) is stable, it only rema<strong>in</strong>s to show that H s is analytic<strong>in</strong> the f<strong>in</strong>ite CRHP. In fact, H s is an entire function, i.e., analytic for allf<strong>in</strong>ite complex frequency s. This is because the factor [e τ ^A ′− e −sτ I] <strong>in</strong>the expression for H s <strong>in</strong> (11.10) cancels the unstable poles of (sI + ^A ′ ) −1 .Indeed,e τ ^A ′ − e −sτ I = e −sτ [e τ(sI+ ^A ′) − I]∑ ∞= e −sτ (−τ) k(sI + ^A ′ ) k ,k!k=1


232 11. Extensions to SISO Smooth<strong>in</strong>g<strong>and</strong> soH s = C 1 V 1 (sI + ^A ′ ) −1 [e τ ^A ′ − e −sτ I]V 2∑ ∞= e −sτ (−τ) kC 1 V 1 (sI + ^A ′ ) k−1k!k=1is analytic <strong>in</strong> the whole f<strong>in</strong>ite complex plane. Hence the result follows.□We then have the follow<strong>in</strong>g result on analyticity of H ˜zv .Lemma 11.3.2. Consider the smoother given by (11.3), (11.5), for the system(11.1). Suppose that (11.3) is a BEE. Then the transfer function H ˜zv <strong>in</strong>(11.16) is analytic for each f<strong>in</strong>ite complex number s <strong>in</strong> the CRHP.Proof. Replac<strong>in</strong>g F s from (11.14) <strong>in</strong>to (11.16), we can writeH ˜zv (s) = e −sτ ˜H 1 (s) − H s (s) ˜H 2 (s) , (11.20)where˜H 1 H zv − C 1 FH yv ,˜H 2 (I − C 2 F)H yv .Us<strong>in</strong>g the expressions for H zv <strong>and</strong> H yv given <strong>in</strong> (10.16), yields˜H 1 (s) = C 1 [I − F(s)C 2 ](sI − A) −1 B + D 1 − C 1 F(s)D 2 ,˜H 2 (s) = C 2 [I − F(s)C 2 ](sI − A) −1 B + [I − C 2 F(s)]D 2 ,which are stable transfer functions s<strong>in</strong>ce F is a BEE (i.e., F <strong>and</strong> [I−F(s)C 2 ](sI−A) −1 are stable). S<strong>in</strong>ce H s is entire (as shown <strong>in</strong> the proof of Lemma 11.3.1),<strong>in</strong>spection of (11.20) then shows that H ˜zv <strong>in</strong> is analytic for each f<strong>in</strong>ite complexnumber s <strong>in</strong> the CRHP. This completes the proof.□The above results will be used to establish <strong>in</strong>terpolation constra<strong>in</strong>ts <strong>in</strong>the follow<strong>in</strong>g section.11.4 Interpolation Constra<strong>in</strong>tsIn this section we extend the result of §10.4 <strong>in</strong> Chapter 10 to the case ofsmooth<strong>in</strong>g. In the rema<strong>in</strong>der of the analysis of the smooth<strong>in</strong>g problem,we will assume that the orig<strong>in</strong>at<strong>in</strong>g filter is a BEE <strong>and</strong> we focus on scalarsystems.We first express P <strong>and</strong> M <strong>in</strong> (10.11) us<strong>in</strong>g the partition of Figure 10.1.S<strong>in</strong>ce H zv = G z <strong>and</strong> H yv = G y , we have thatP(s) = [G z (s) − e sτ F s (s)G y (s)]G −1z (s) ,M(s) = e sτ F s (s)G y (s)G −1z (s) .(11.21)


11.4 Interpolation Constra<strong>in</strong>ts 233As was the case for the filter<strong>in</strong>g <strong>and</strong> prediction problems, P <strong>and</strong> M are notnecessarily analytic <strong>in</strong> the f<strong>in</strong>ite ORHP. The follow<strong>in</strong>g result gives necessary<strong>and</strong> sufficient conditions for P <strong>and</strong> M to be analytic functions <strong>in</strong> the(f<strong>in</strong>ite) ORHP.Lemma 11.4.1. Consider the smooth<strong>in</strong>g sensitivities given <strong>in</strong> (11.21) <strong>and</strong>assume that F <strong>in</strong> (11.14) is a BEE. Then P <strong>and</strong> M are analytic <strong>in</strong> the (f<strong>in</strong>ite)ORHP if <strong>and</strong> only if one of the follow<strong>in</strong>g conditions hold:(i) G z is m<strong>in</strong>imum phase, or(ii) every ORHP zero of G z is also a zero of the product F s G y .Proof. Immediate from (11.21) <strong>and</strong> Lemma 11.3.1.□The follow<strong>in</strong>g lemma establishes the <strong>in</strong>terpolation constra<strong>in</strong>ts that P<strong>and</strong> M must satisfy at the ORHP poles <strong>and</strong> zeros of G y <strong>and</strong> G z .Lemma 11.4.2 (Interpolation Constra<strong>in</strong>ts). Assume that the orig<strong>in</strong>at<strong>in</strong>gfilter F <strong>in</strong> (11.14) is a BEE. Then P <strong>and</strong> M must satisfy the follow<strong>in</strong>g conditions.(i) If p ∈ ¢ + is a pole of G z , thenP(p) = 0, <strong>and</strong> M(p) = 1.(ii) If q ∈ ¢ + is a zero of G y that is not also zero of G z , thenP(q) = 1, <strong>and</strong> M(q) = 0.(iii) If q ∈ ¢ + is a zero of G z that is not also zero of F s G y , then P(s) <strong>and</strong>M(s) have a pole at s = q.Proof. Case (i) follows from (11.17), on not<strong>in</strong>g that H ˜zv (s) is analytic <strong>in</strong> theORHP by Lemma 11.3.2 <strong>and</strong> hence it cannot cancel any unstable pole ofG z . Case (ii) is immediate from (11.21) s<strong>in</strong>ce F s is stable by Lemma 11.3.1.F<strong>in</strong>ally, case (iii) follows from Lemma 11.4.1.□Recall from §8.1 <strong>in</strong> Chapter 8 that we denote by Z P <strong>and</strong> Z M the sets ofORHP zeros of P <strong>and</strong> M respectively, repeated accord<strong>in</strong>g to their multiplicities.Then, Lemma 11.4.2 identifies subsets of Z P <strong>and</strong> Z M . However,these zeros are <strong>in</strong> general only a proper subset of the complete set of zerosof the smooth<strong>in</strong>g sensitivities. In the case of prediction, we have shown<strong>in</strong> §10.4 of Chapter 10, that P may have ORHP zeros other than the polesof G z . Here, both sensitivities P <strong>and</strong> M may <strong>in</strong>corporate other ORHP zerosthat are <strong>in</strong>herent to the process of smooth<strong>in</strong>g. Indeed, it is easy to see thatP <strong>and</strong> M can be written asP = e sτ f 1d 1, <strong>and</strong> M = e sτ f 2d 2,


234 11. Extensions to SISO Smooth<strong>in</strong>gwhere d 1 <strong>and</strong> d 2 are polynomials <strong>in</strong> s, <strong>and</strong> f 1 <strong>and</strong> f 2 have the form of(A.87) <strong>in</strong> Appendix A, i.e.,f i (s) = g i 1(s)e −sτ + g i 2(s) , i = 1, 2.Therefore, from Lemma A.11.1 <strong>in</strong> Appendix A (on zeros of entire functions),we have that both P <strong>and</strong> M have an <strong>in</strong>f<strong>in</strong>ite sequence of zeros converg<strong>in</strong>gto <strong>in</strong>f<strong>in</strong>ity. Furthermore, accord<strong>in</strong>g to the relative orders of thepairs (g i 1 , gi 2), i = 1, 2, (<strong>and</strong> depend<strong>in</strong>g on the ratio between the highestorder coefficients if they have the same order), some of these zeros(possibly <strong>in</strong>f<strong>in</strong>itely many) may have positive real parts. A further analysis1 shows that P has generically <strong>in</strong>f<strong>in</strong>itely many zeros converg<strong>in</strong>g tothe ORHP, whilst the situation for M cannot, <strong>in</strong> general, be predicted.However, we remark that the complementary sensitivity M achieved bythe smoother derived from the Kalman filter (see Example 11.1.1) has <strong>in</strong>f<strong>in</strong>itelymany zeros converg<strong>in</strong>g to the OLHP.In summary, both sets of zeros Z P <strong>and</strong> Z M may have an <strong>in</strong>f<strong>in</strong>ite numberof elements — other than those identified <strong>in</strong> Lemma 11.4.2 — which arezeros of P <strong>and</strong> M <strong>in</strong>troduced by the smooth<strong>in</strong>g process.In the follow<strong>in</strong>g section, we will use the <strong>in</strong>formation of Lemma 11.4.2to derive Poisson <strong>in</strong>tegral constra<strong>in</strong>ts on the frequency responses of thesmooth<strong>in</strong>g sensitivities.11.5 Integral Constra<strong>in</strong>tsLet the Blaschke products correspond<strong>in</strong>g to the sets Z P <strong>and</strong> Z M be givenby (10.22) <strong>in</strong> Chapter 10, where now both n p <strong>and</strong> n q may be <strong>in</strong>f<strong>in</strong>ity. Also,similar to Chapter 10, we <strong>in</strong>troduce the sets of zeros:Z z {s ∈ ¢ + : G z (s) = 0 <strong>and</strong> F s (s)G y (s) ≠ 0} ,Z y {s ∈ ¢ + : G y (s) = 0 <strong>and</strong> G z (s) ≠ 0} ,Z 1/z {s ∈ ¢ + : G −1z (s) = 0} ,(11.22)<strong>and</strong> also denote the correspond<strong>in</strong>g Blaschke products by B z , B y <strong>and</strong> B 1/z .In order to restrict the behavior at <strong>in</strong>f<strong>in</strong>ity of the sensitivities, we requirethe follow<strong>in</strong>g assumption on the relative degree of the ratio G y /G z .Assumption 11.1. F <strong>in</strong> (11.14) is proper <strong>and</strong> RD G yG z≥ 0.Under Assumption 11.1, the rational factors of P <strong>and</strong> M <strong>in</strong> (11.21) areproper. It can then be seen by <strong>in</strong>spection of (11.21) <strong>and</strong> (11.14) that, underAssumption 11.1, the functions e −sτ P <strong>and</strong> e −sτ M will be bounded◦1 See Seron (1995) for more details.


11.5 Integral Constra<strong>in</strong>ts 235functions for s → ∞ <strong>in</strong> the ORHP. It thus follows from De Branges (1968,Theorem 8, p.20) that P <strong>and</strong> M <strong>in</strong> (11.21) can be factored asP = e sτ ˜PB P /B z ,M = e sτ ˜MB M /B z ,(11.23)where ˜P <strong>and</strong> ˜M have no zeros or poles <strong>in</strong> the ORHP. The convergence ofthe Blaschke products <strong>in</strong> (11.23) also follows from De Branges (1968), evenwhen n p = ∞ <strong>and</strong>/or n q = ∞.The smooth<strong>in</strong>g sensitivities satisfy the follow<strong>in</strong>g <strong>in</strong>tegral constra<strong>in</strong>ts.Theorem 11.5.1 (Poisson Integral for P). Suppose that F <strong>in</strong> (11.14) is aBEE <strong>and</strong> consider the smooth<strong>in</strong>g sensitivity, P, def<strong>in</strong>ed <strong>in</strong> (11.21). Let q =σ q + jω q , σ q > 0, be a zero of G y that is not also zero of G z (i.e., q ∈ Z y <strong>in</strong>(11.22)). Then, under Assumption 11.1,∫ ∞−∞σ qlog |P(jω)|σ 2 q + (ω q − ω) 2 dω = π log ∣ B−1P(q)∣ ∣∣ − π log ∣B −1z (q) ∣ −πσ q τ.(11.24)Proof. ˜P <strong>in</strong> (11.23) has no zeros or poles <strong>in</strong> the ORHP. Also, under Assumption11.1, e −sτ B z P is bounded <strong>in</strong> the ORHP, <strong>and</strong> it is not difficult tosee that the same is true for ˜P. Hence, we can apply the Poisson <strong>in</strong>tegralformula to log ˜P as <strong>in</strong> Theorem 3.3.1 of Chapter 3.□Theorem 11.5.2 (Poisson Integral for M). Suppose that F <strong>in</strong> (11.14) is aBEE <strong>and</strong> consider the smooth<strong>in</strong>g complementary sensitivity, M, def<strong>in</strong>ed<strong>in</strong> (11.21). Let p = σ p + jω p , σ p > 0, be a pole of G z (i.e., p ∈ Z 1/z <strong>in</strong>(11.22)). Then, under Assumption 11.1,∫ ∞−∞σ plog |M(jω)|σ 2 p + (ω p − ω) 2 dω = π log ∣ B−1M (p)∣ ∣∣ − π log ∣B −1z (p) ∣ −πσ p τ.(11.25)Proof. Similar to the proof of Theorem 11.5.1.□The follow<strong>in</strong>g corollary gives <strong>in</strong>tegral constra<strong>in</strong>ts that are <strong>in</strong>dependentof the estimator parameters.Corollary 11.5.3. Consider the smooth<strong>in</strong>g sensitivities, P <strong>and</strong> M, given <strong>in</strong>(11.21), <strong>and</strong> suppose that F <strong>in</strong> (11.14) is a BEE. Consider the sets of zerosdef<strong>in</strong>ed <strong>in</strong> (11.22), <strong>and</strong> assume further that every ORHP zero of G z is alsoa zero of G y (i.e., Z z = ∅). Then, under Assumption 11.1,


236 11. Extensions to SISO Smooth<strong>in</strong>g(i) if q = σ q + jω q ∈ Z y , we have that∫ ∞−∞σ q∣ ∣ ∣∣Blog |P(jω)|σ 2 q + (ω q − ω) 2 dω ≥ π log −1 ∣∣1/z (q) − πσq τ;(ii) if p = σ p + jω p ∈ Z 1/z , we have that∫ ∞−∞σ p(11.26)log |M(jω)|σ 2 p + (ω p − ω) 2 dω ≥ π log ∣ B−1y (p) ∣ − πσp τ.Proof. Same as the proof of Corollary 8.2.3 <strong>in</strong> Chapter 8.Note that a term proportional to the smooth<strong>in</strong>g lag appears with a negativesign on the RHS of the <strong>in</strong>tegral constra<strong>in</strong>ts for both sensitivities. Thiseffect is analyzed next.11.5.1 Effect of the Smooth<strong>in</strong>g LagAs we discussed for the prediction problem, the effect of smooth<strong>in</strong>g willbe more dramatic when the orig<strong>in</strong>al filter<strong>in</strong>g problem has both sensitivitiesconstra<strong>in</strong>ed. This is the case, for example, when G z is m<strong>in</strong>imum phasebut has an unstable pole, p say, <strong>and</strong> G y has a nonm<strong>in</strong>imum phase zero, qsay. Under these conditions, there is an <strong>in</strong>tegral constra<strong>in</strong> on P of the form(11.24) where σ q = Re(q) is fixed <strong>in</strong>dependently of τ. Analogously, an <strong>in</strong>tegralconstra<strong>in</strong>t of the form (11.25) can be stated on M, with σ p = Re(p)fixed for all τ.From the discussion at the end of §11.4, we can argue that P will generallyhave an <strong>in</strong>f<strong>in</strong>ite sequence of zeros converg<strong>in</strong>g to the ORHP. Also,we may assume that the <strong>in</strong>f<strong>in</strong>ite sequence of zeros of M converges to theOLHP (this is the case, for example, for the smoother derived from theKalman filter). Then, the term log |B −1P(q)| on the RHS of (11.24) will be enlargedby the <strong>in</strong>verse Blaschke product of the <strong>in</strong>f<strong>in</strong>ite zeros of P, whereasthe term log |B −1M(q)| on the RHS of (11.25) will probably not be larger thanthe correspond<strong>in</strong>g term for filter<strong>in</strong>g.The improvement due to smooth<strong>in</strong>g is now clear for M. Indeed, theterm −σ p τ, for σ p = Re(p) fixed for all τ, will cont<strong>in</strong>uously reduce thevalue of the weighted <strong>in</strong>tegral (11.25) as the smooth<strong>in</strong>g lag <strong>in</strong>creases, untilthe constra<strong>in</strong>t f<strong>in</strong>ally disappears. This means that the same weighted <strong>in</strong>tegralon the LHS (with the smooth<strong>in</strong>g complementary sensitivity M replac<strong>in</strong>gthe correspond<strong>in</strong>g one for filter<strong>in</strong>g) will achieve a lower value for smooth<strong>in</strong>g.The <strong>in</strong>tegral constra<strong>in</strong>t (11.24) for P also has the negative l<strong>in</strong>ear term<strong>in</strong> τ, −σ q τ, but the enlarged term log |B −1P(q)| is add<strong>in</strong>g to the value ofthe <strong>in</strong>tegral. Evidently, the lower bound given by the <strong>in</strong>equality <strong>in</strong> (11.26)decreases with the smooth<strong>in</strong>g lag. However, s<strong>in</strong>ce we cannot, <strong>in</strong> general,evaluate the Blaschke product, we do not have an exact estimate of howtight this bound may be.□


11.6 Sensitivity Improvement of the Optimal Smoother 23711.6 Sensitivity Improvement of the OptimalSmootherThe improvement that smooth<strong>in</strong>g represents upon filter<strong>in</strong>g is a well-knownfact <strong>in</strong> the filter<strong>in</strong>g literature. Indeed, it has been proven for the case of optimalsmoothers that the smooth<strong>in</strong>g error variance is always lower thanthe filter<strong>in</strong>g error variance (Anderson <strong>and</strong> Chirarattananon, 1971).As we have discussed <strong>in</strong> §11.5.1, there is also an improvement <strong>in</strong> sensitivityassociated with smooth<strong>in</strong>g, <strong>in</strong> the sense that design restrictionsimposed by ORHP poles <strong>and</strong> zeros of the plant relax with the smooth<strong>in</strong>glag. However, even for a large smooth<strong>in</strong>g lag, these results only show lessstr<strong>in</strong>gent design constra<strong>in</strong>ts, <strong>and</strong> do not guarantee the reduction of peaks<strong>in</strong> the values of |P| <strong>and</strong> |M| on the jω-axis.In this section, we give a more precise result concern<strong>in</strong>g improvement<strong>in</strong> sensitivity by smooth<strong>in</strong>g. We will see that for the class of smoothersbased on the Kalman filter, i.e., smoothers that are optimal <strong>in</strong> the leastsquaressense, the frequency responses of both smooth<strong>in</strong>g sensitivities P<strong>and</strong> M have no peak values greater than one provided that the smooth<strong>in</strong>glag is sufficiently large. In fact, we will see that a smooth<strong>in</strong>g lag of severaltimes the dom<strong>in</strong>ant time constant of the Kalman filter, essentially achievesall the possible improvement <strong>in</strong> sensitivity reduction. We then illustratethese results with two numerical examples.Consider then the smoother based on least squares presented <strong>in</strong> Example11.1.1, where the plant is given by (11.1) with D 1 = D 2 = 0. We willassume that the solution, Φ, of the Riccati equation (11.6) is positive def<strong>in</strong>ite.Further, we assume that the system evolution matrix A <strong>in</strong> (11.1) hasno eigenvalues on the imag<strong>in</strong>ary axis. Also, we take the process noise <strong>in</strong>crementalcovariance R = 1 for simplicity of notation.Let τ = ¯τ ≫ τ max ( ^A), where τ max ( ^A) is the dom<strong>in</strong>ant time constant 2of the Kalman filter. It is shown <strong>in</strong> Lemma D.0.3 <strong>in</strong> Appendix D that, forτ = ¯τ, the scalar smooth<strong>in</strong>g sensitivities <strong>in</strong> (11.21) are approximately givenbyM[ ¯τ](s) = QH ιv (−s)H ιv (s) ,P[ ¯τ](s) = 1 − QH ιv (−s)H ιv (s) ,(11.27)where H ιv is the transfer function from the process <strong>in</strong>put, v, to the <strong>in</strong>novationsprocess ι = y − C 2^x correspond<strong>in</strong>g to the Kalman filter (see (D.2) <strong>in</strong>Appendix D), <strong>and</strong> given byH ιv = H(sI − ^A) −1 B . (11.28)2 See the footnote on page 220.


238 11. Extensions to SISO Smooth<strong>in</strong>gThe follow<strong>in</strong>g result shows that the modulus of M[ ¯τ] <strong>and</strong> P[ ¯τ] on theimag<strong>in</strong>ary axis is bounded above by one.Theorem 11.6.1. Consider the expressions (11.27) of the smooth<strong>in</strong>g sensitivitiesfor a smooth<strong>in</strong>g lag τ = ¯τ ≫ τ max ( ^A). Assume that the evolutionmatrix A of the system to be estimated has no eigenvalues on the imag<strong>in</strong>aryaxis. Then‖M[ ¯τ]‖ ∞ < 1 ,‖P[ ¯τ]‖ ∞ = 1 .(11.29)Proof. We first show that ‖H ιv√ Q‖∞ < 1, where H ιv is given <strong>in</strong> (11.28).S<strong>in</strong>ce ^A has no imag<strong>in</strong>ary eigenvalues, we can use the result that‖H ιv√Q‖∞ < 1if <strong>and</strong> only if the Hamiltonian matrix[]^A BQBA H ′−C2 ′ C 2 − ^A ′has no eigenvalues on the imag<strong>in</strong>ary axis (Willems, 1971b).We thus compute|sI − A H | =∣ sI − ∣^A −BQB ′ ∣∣∣C2 ′ C 2 sI + ^A ′(11.30)[ ] [ ] [ ]∣ =I 0 sI − ^A −BQB ′ I 0 ∣∣∣∣ 0 Φ C2 ′ C 2 sI + ^A ′ 0 Φ −1=∣ sI − ^AΦC2 ′ C 2∣−BQB ′ Φ −1 ∣∣∣sI + Φ ^A ′ Φ −1=∣ sI − A + ΦC 2 ′ C 2ΦC2 ′ C 2∣−BQB ′ Φ −1 ∣∣∣sI − A − BQB ′ Φ −1=∣ sI − A 0ΦC2 ′ C 2 sI − A − BQB ′ Φ −1 + ΦC2 ′ C ∣2=∣ sI − A 0∣ ∣∣∣ΦC2 ′ C 2 sI + ΦA ′ Φ −1 ,where we have used the Riccati equation (11.6) <strong>in</strong> the form Φ ^A ′ Φ −1 =Φ(A ′ − C ′ 2 C 2Φ)Φ −1 = −(A + BQB ′ Φ −1 ), <strong>and</strong> applied some elementaryalgebra of determ<strong>in</strong>ants.


11.6 Sensitivity Improvement of the Optimal Smoother 239It follows that A H has no eigenvalues on the imag<strong>in</strong>ary axis s<strong>in</strong>ce theevolution matrix of the Kalman filter, ^A, is a stability matrix, <strong>and</strong> A has noeigenvalues on the imag<strong>in</strong>ary axis by assumption. Hence, ‖H ιv√ Q‖∞ < 1<strong>and</strong> thus, from (11.27),‖M[ ¯τ]‖ ∞ ≤ ‖H ιv√Q‖2∞ < 1.Now note that 0 ≤ Q|H ιv (jω)| 2 < 1 implies that P[ ¯τ] <strong>in</strong> (11.27) satisfies0 < P[ ¯τ](jω) ≤ 1. Thus‖P[ ¯τ]‖ ∞ = 1s<strong>in</strong>ce H ιv is strictly proper.Theorem 11.6.1 shows that the frequency responses of the smooth<strong>in</strong>gsensitivities will experience no peaks above one if the smooth<strong>in</strong>g lag ischosen several times greater than the dom<strong>in</strong>ant time constant of the filter.We emphasize that the result holds for unstable <strong>and</strong> nonm<strong>in</strong>imum phasesystems, which were shown <strong>in</strong> Chapter 8 to produce filter<strong>in</strong>g sensitivitieswith large peaks above one.The follow<strong>in</strong>g examples illustrate Theorem 11.6.1 for the cases of unstable<strong>and</strong> stable plants.Example 11.6.1 (Unstable Plant). We consider aga<strong>in</strong> the Kalman filtergiven <strong>in</strong> Example 8.4.1 of Chapter 8. Here we construct a smoother as <strong>in</strong>(11.7), derived from the Kalman filter obta<strong>in</strong>ed for the weights Q = 100<strong>and</strong> R = 1. We take the parameter a <strong>in</strong> (8.23) as a = 1.The frequency responses of |P| <strong>and</strong> |M| are shown <strong>in</strong> Figures 11.2 <strong>and</strong>11.3, respectively, for three different values of the smooth<strong>in</strong>g lag τ. Notethat the filter<strong>in</strong>g sensitivities correspond to τ = 0.□665544|P(jω)|3|M(jω)|3τ=02 τ=02τ=0.51 τ=0.51τ=3τ=3 010 −4 10 −2 10 0 10 2 10 4ω010 −4 10 −2 10 0ω10 2 10 4FIGURE 11.2. |P| achieved by theKalman filter-based smoother for theunstable plant of Example 11.6.1.FIGURE 11.3. |M| achieved by theKalman filter-based smoother for theunstable plant of Example 11.6.1.These plots clearly <strong>in</strong>dicate that the process of smooth<strong>in</strong>g reduces theconstra<strong>in</strong>ts on the frequency responses of both sensitivities, which tend to“ideal” shapes as we <strong>in</strong>crease the smooth<strong>in</strong>g lag.◦


240 11. Extensions to SISO Smooth<strong>in</strong>gExample 11.6.2 (Stable Plant). Consider aga<strong>in</strong> the stable plant used <strong>in</strong>Example 8.4.2 of Chapter 8 <strong>and</strong> revisited <strong>in</strong> §10.6.2 of Chapter 10. For thisplant, we construct a smoother as <strong>in</strong> (11.7). The generat<strong>in</strong>g Kalman filteris aga<strong>in</strong> designed for an aggregated plant that <strong>in</strong>cludes the disturbancemodel (8.23) with the parameter a = 1. The weights are chosen as Q = 100<strong>and</strong> R = 0.1.The plots of Figures 11.4 <strong>and</strong> 11.5 show the frequency responses of P<strong>and</strong> M for three values of τ. These figures show essentially the same effectof an improved frequency response as for the unstable system <strong>in</strong> Example11.6.1.Note that the poles of the generat<strong>in</strong>g Kalman filter for this exampleare −2.1035 ± j2.5701 <strong>and</strong> −1.8335, which implies that τ = 0.8 is approximately0.9 times the filter dom<strong>in</strong>ant time constant, taken as τ f ≈1/1.8335 ≈ 0.55, whilst τ = 2 is approximately 3.6 times τ f . As seen<strong>in</strong> §11.6, tak<strong>in</strong>g a smooth<strong>in</strong>g lag of several times (approximately 5 timesaccord<strong>in</strong>g to Anderson (1969)) the dom<strong>in</strong>ant time constant of the filter,gives all the improvement possible us<strong>in</strong>g smooth<strong>in</strong>g. We can see that thisphenomenon of “saturation” of improvement is also present <strong>in</strong> the frequencyresponses of the error sensitivities.221.51.5|P(jω)|1τ=0|M(jω)|1τ=3τ=0.50.5τ=0.50.5τ=0τ=3010 −4 10 −2 10 0ω10 2 10 4FIGURE 11.4. |P| achieved by theKalman filter-based smoother for thestable plant of Example 11.6.2.010 −4 10 −2 10 0ω10 2 10 4FIGURE 11.5. |M| achieved by theKalman filter-based smoother for thestable plant of Example 11.6.2.◦11.7 SummaryThis chapter has extended the sensitivity analysis developed <strong>in</strong> Chapter 8for the problem of filter<strong>in</strong>g with bounded error estimators to the case offixed-lag smooth<strong>in</strong>g. The results <strong>in</strong>dicate that, <strong>in</strong> contrast to the problemof prediction, the constra<strong>in</strong>ts <strong>in</strong> sensitivity are reduced as the smooth<strong>in</strong>glag <strong>in</strong>creases.


11.7 Summary 241In particular, notice that if one considers the estimation horizon to be eitherpositive (for prediction) or negative (for smooth<strong>in</strong>g), then the resultsare seen to be consistent. This is clear from the structural relations for filter<strong>in</strong>g,given by (7.7), for prediction, given by (10.13) <strong>and</strong> (10.10), <strong>and</strong> forsmooth<strong>in</strong>g, given by (11.19) <strong>and</strong> (11.15). These relations are recalled <strong>in</strong> Table11.1.Filter<strong>in</strong>gPredictionSmooth<strong>in</strong>gH ˜zv + FH yv = H zvH ˜zv + F p H yv = e sτ H zvH ˜zv + F s H yv = e −sτ H zvTABLE 11.1. Structural relations <strong>in</strong> filter<strong>in</strong>g, prediction <strong>and</strong> smooth<strong>in</strong>g.The structural relations shown <strong>in</strong> Table 11.1 are the basis for the def<strong>in</strong>itionof the correspond<strong>in</strong>g sensitivities. It should be noticed, however, thatthe nature of the error transfer function H ˜zv <strong>in</strong> prediction <strong>and</strong> smooth<strong>in</strong>gis not dual.Notes <strong>and</strong> ReferencesThe sensitivity results of this chapter are based on Seron (1995). Background <strong>in</strong>formationon smooth<strong>in</strong>g was collected from Kailath (1981), Kailath <strong>and</strong> Frost (1968)<strong>and</strong> Anderson <strong>and</strong> Doan (1977).


Part IV<strong>Limitations</strong> <strong>in</strong> Nonl<strong>in</strong>ear<strong>Control</strong> <strong>and</strong> Filter<strong>in</strong>g


12Nonl<strong>in</strong>ear OperatorsIn Chapters 13 <strong>and</strong> 14 we will <strong>in</strong>vestigate fundamental constra<strong>in</strong>ts thathold for the problems of nonl<strong>in</strong>ear feedback control <strong>and</strong> nonl<strong>in</strong>ear filter<strong>in</strong>g.The general framework used is that of <strong>in</strong>put-output nonl<strong>in</strong>ear operatorsact<strong>in</strong>g on l<strong>in</strong>ear signal spaces. With<strong>in</strong> this framework, the l<strong>in</strong>earconcepts of nonm<strong>in</strong>imum phase zeros <strong>and</strong> unstable poles of transfer functionsare easily h<strong>and</strong>led us<strong>in</strong>g ideas of defect <strong>in</strong> the doma<strong>in</strong> <strong>and</strong> range ofthe nonl<strong>in</strong>ear operators. Some properties of this approach essential to ouranalysis are reviewed <strong>in</strong> this chapter.12.1 Nonl<strong>in</strong>ear OperatorsMuch of the research effort to date <strong>in</strong> nonl<strong>in</strong>ear control theory has focusedon controller design. This area has experienced substantial progress, giv<strong>in</strong>grise to numerous systematic procedures for controller design applicableto different classes of nonl<strong>in</strong>ear systems. For a recent survey on thistopic see Coron et al. (1995). The theory for nonl<strong>in</strong>ear observers is more<strong>in</strong>cipient than that for nonl<strong>in</strong>ear control, though some techniques for observerconstruction have become available <strong>in</strong> the recent years. Most ofthese nonl<strong>in</strong>ear controller <strong>and</strong> observer design methods use a state spacedescription of the system.The state space approach provides satisfactory answers when deal<strong>in</strong>gwith synthesis problems, yet it does not appear suitable for a theory oflimitations imposed by the structure of the feedback loop. For this prob-


246 12. Nonl<strong>in</strong>ear Operatorslem, a more natural framework is provided by the <strong>in</strong>put-output operatorapproach. This approach received considerable attention dur<strong>in</strong>g the 60’s<strong>and</strong> 70’s (S<strong>and</strong>berg, 1965; Zames, 1966a,b; Willems, 1971a; Desoer <strong>and</strong>Vidyasagar, 1975; Hill <strong>and</strong> Moylan, 1980). In this <strong>in</strong>put-output framework,systems are represented by nonl<strong>in</strong>ear operators act<strong>in</strong>g on normed signalspaces. Two important measures related to these operators are the maximumga<strong>in</strong> 1 over the <strong>in</strong>put signal space, <strong>and</strong> the maximum <strong>in</strong>cremental ga<strong>in</strong>,also called Lipschitz constant or Lipschitz norm. In the l<strong>in</strong>ear case, both measurescollapse <strong>in</strong>to one. Moreover, for l<strong>in</strong>ear time-<strong>in</strong>variant systems (LTI)on L 2 , 2 they co<strong>in</strong>cide with the H ∞ norm of the transfer function. Thus,both the L 2 -ga<strong>in</strong> <strong>and</strong> the Lipschitz norm are possible nonl<strong>in</strong>ear extensionsof the H ∞ norm of l<strong>in</strong>ear systems.We next review properties of nonl<strong>in</strong>ear operators act<strong>in</strong>g on l<strong>in</strong>ear, Banach<strong>and</strong> Hilbert spaces.12.1.1 Nonl<strong>in</strong>ear Operators on a L<strong>in</strong>ear SpaceLet X be a l<strong>in</strong>ear space <strong>and</strong> let H be a nonl<strong>in</strong>ear operator on X, i.e., a mapp<strong>in</strong>gbetween its doma<strong>in</strong> D(H) ⊂ X <strong>in</strong>to X. The doma<strong>in</strong> <strong>and</strong> range of H aredef<strong>in</strong>ed as follows:D(H) {x ∈ X : Hx ∈ X},R(H) {Hx : x ∈ D(H)}.(12.1)If H 1 <strong>and</strong> H 2 are nonl<strong>in</strong>ear operators on X <strong>and</strong> D(H 1 ) ∩ D(H 2 ) ≠ ∅, theaddition H 1 + H 2 : D(H 1 + H 2 ) → X is def<strong>in</strong>ed by(H 1 + H 2 )x = H 1 x + H 2 x, ∀x ∈ D(H 1 + H 2 ) = D(H 1 ) ∩ D(H 2 ).If R(H 2 ) ∩ D(H 1 ) ≠ ∅, the composition H 1 H 2 : D(H 1 H 2 ) → X is def<strong>in</strong>ed by(H 1 H 2 )x = H 1 (H 2 x), ∀x ∈ D(H 1 H 2 ) = H −12 (D(H 1)).Here, the notation H −1 (U), for a set U ⊂ X, <strong>in</strong>dicates the pre-image set ofU through H, i.e.,H −1 (U) = {x ∈ D(H) : Hx ∈ U}.Similar to the notation used for the l<strong>in</strong>ear case, the symbol H ba will beused to represent an open-loop nonl<strong>in</strong>ear operator mapp<strong>in</strong>g signal a tosignal b. Also, H ba will st<strong>and</strong> for the total nonl<strong>in</strong>ear operator mapp<strong>in</strong>gsignal a to signal b.1 I.e., the maximum ratio of the norm of the output signal to the norm of the <strong>in</strong>put signal.2 The def<strong>in</strong>ition of the signal space L 2 is recalled <strong>in</strong> §12.1.3.


12.1 Nonl<strong>in</strong>ear Operators 24712.1.2 Nonl<strong>in</strong>ear Operators on a Banach SpaceLet X be a Banach space. We say that the operator H is stable if its doma<strong>in</strong>is X <strong>and</strong> we say that H is unstable if its doma<strong>in</strong> is a strict subset of X. Thisdef<strong>in</strong>ition of operator stability/<strong>in</strong>stability can be <strong>in</strong>terpreted as the usualsystem stability/<strong>in</strong>stability when X is taken to be a space of “physicallymean<strong>in</strong>gful” signals, such as L 2 or its isomorphic equivalent H 2 .We say that H is nonm<strong>in</strong>imum phase if the closure of its range is a strictsubset of X.Example 12.1.1. To motivate the concept of nonm<strong>in</strong>imum phase nonl<strong>in</strong>earoperator, consider the simple case of a scalar, LTI operator on H 2 , representedby a transfer function, H, say. Assume further that H is stable <strong>and</strong>proper. Then, if H has a nonm<strong>in</strong>imum phase zero, the range of such anoperator is the set of signals <strong>in</strong> H 2 that have a zero at the same frequency,which is not dense <strong>in</strong> H 2 .◦The closure of a set U ⊂ X will be denoted by cl U. We denote byLip(D, X) the class of all operators H : D(H) = D ⊂ X → X such that‖H‖ L sup {|Hx − Hy|/|x − y| : x, y ∈ D, x ≠ y} < ∞, (12.2)where | · | is the norm <strong>in</strong> X. A member H of Lip(D, X) is called a Lipschitzoperator <strong>and</strong> ‖H‖ L is the Lipschitz constant, Lipschitz ga<strong>in</strong>, or <strong>in</strong>crementalga<strong>in</strong> of H. When D = X, the class will be denoted by Lip(X). Note thatmembers of Lip(X) are stable operators, s<strong>in</strong>ce their doma<strong>in</strong> is X.It is easy to see, that if H is a l<strong>in</strong>ear operator, then ‖H‖ L <strong>in</strong> (12.2) reducesto the operator norm <strong>in</strong>duced by the norm <strong>in</strong> X. Indeed, it is claimed <strong>in</strong>Zarantonello (1967) that the ord<strong>in</strong>ary norm for l<strong>in</strong>ear operators naturallyextends to the Lipschitz norm <strong>in</strong> the nonl<strong>in</strong>ear case. To be precise, the Lipschitznorm is <strong>in</strong> fact a semi-norm, thus the space (Lip(X), ‖ · ‖ L ) is a sem<strong>in</strong>ormedl<strong>in</strong>ear space.Lip(X) is closed under operator composition. Moreover, if H 1 , H 2 ∈Lip(X) then the composition H 1 H 2 ∈ Lip(X) <strong>and</strong> the sub-multiplicativeproperty ‖H 1 H 2 ‖ L ≤ ‖H 1 ‖ L ‖H 2 ‖ L holds for the Lipschitz norm.A member H of Lip(X) is said to be <strong>in</strong>vertible <strong>in</strong> Lip(X) if there is anoperator H i ∈ Lip(X) such that HH i = H i H = I. We denote H i = H −1 .Clearly, it is necessary that R(H) = X for H to be <strong>in</strong>vertible <strong>in</strong> Lip(X) <strong>and</strong>thus a nonm<strong>in</strong>imum phase operator is not <strong>in</strong>vertible <strong>in</strong> Lip(X). The follow<strong>in</strong>gresult gives a condition for the <strong>in</strong>vertibility of members of Lip(X).Lemma 12.1.1. Let X be a Banach space <strong>and</strong> let H ∈ Lip(X). Supposethat ‖H‖ L < 1. Then I − H is <strong>in</strong>vertible <strong>in</strong> Lip(X) <strong>and</strong> ‖(I − H) −1 ‖ L ≤(1 − ‖H‖ L ) −1 .Proof. See Mart<strong>in</strong> Jr. (1976, Theorem 2.2, p.66).Lemma 12.1.1 appears to have been stated for the first time <strong>in</strong> the controlliterature by Zames 1966a; 1966b, although it was probably already known□


248 12. Nonl<strong>in</strong>ear Operatorsamong mathematicians. This important result is the basis of the proof ofthe small ga<strong>in</strong> theorem <strong>in</strong> its Lipschitz version (S<strong>and</strong>berg, 1965; Zames,1966a,b).d + e❜ ✲ ✐✲+ ✻bH∆a✛FIGURE 12.1. Basic perturbation model.In §13.5 we will address the issue of stability robustness <strong>and</strong> refer tothe basic perturbation model shown <strong>in</strong> Figure 12.1, where H : D(H) → X<strong>and</strong> ∆ : D(∆) → X are nonl<strong>in</strong>ear operators. We say that the feedbackloop of Figure 12.1 is stable if e, a, b ∈ X whenever d ∈ X. If H, ∆ ∈Lip(X), we also say that the feedback loop of Figure 12.1 is Lipschitz stableif the operator H ed ∈ Lip(X). It follows from Lemma 12.1.1 that, providedH, ∆ ∈ Lip(X), the loop <strong>in</strong> Figure 12.1 is Lipschitz stable if ‖∆H‖ L < 1.Moreover,‖H ed ‖ L = ‖(I − ∆H) −1 ‖ L ≤ (1 − ‖∆H‖ L ) −1 (12.3)<strong>and</strong>‖H ad ‖ L = ‖H(I − ∆H) −1 ‖ L ≤ ‖H‖(1 − ‖∆H‖ L ) −1 . (12.4)12.1.3 Nonl<strong>in</strong>ear Operators on a Hilbert SpaceThis section reviews some notation <strong>and</strong> term<strong>in</strong>ology that will be used <strong>in</strong>§13.4 of Chapter 13. Let L 2 be the st<strong>and</strong>ard Hilbert space of real-valuedmeasurable square-<strong>in</strong>tegrable functions def<strong>in</strong>ed on ¡ + (the positive reall<strong>in</strong>e) with norm ‖ · ‖ L2 <strong>and</strong> <strong>in</strong>ner product < f, g > L2 . For f ∈ L 2 , f denotesits Fourier transform. Let Ω ⊂ ¡ be a set with nonzero measure. For f ∈L 2 , we def<strong>in</strong>e ‖f‖ Ω as‖f‖ Ω ( ∫1/2 1|f(jω)| dω) 2 .2π ΩNote that ‖ · ‖ Ω represents the signal-energy distributed over the frequencyrange Ω (Shamma, 1991). A sequence {f i } ⊂ L 2 is said to weaklyconverge to f 0 ∈ L 2 if for all g ∈ L 2 ,limi< f i , g > L2 =< f 0 , g > L2 .The L 2 -weak-closure of a set U is denoted wk−cl U.


12.2 Nonl<strong>in</strong>ear Cancelations 249A mapp<strong>in</strong>g H : L 2 → L 2 is an I/O operator if it is unbiased (i.e., H 0 = 0)<strong>and</strong> causal. The doma<strong>in</strong> <strong>and</strong> range of an I/O operator H are def<strong>in</strong>ed as <strong>in</strong>(12.1) with X replaced with L 2 .✲K✲uG❜ d✲❄+− ✐e+ y❄ w✐✛❜+FIGURE 12.2. Feedback <strong>Control</strong> Loop.The feedback system shown <strong>in</strong> Figure 12.2 is said to be stable if u <strong>and</strong> ebelong to L 2 whenever d <strong>and</strong> w belong to L 2 ; <strong>in</strong> other words, the nonl<strong>in</strong>earoperators that map d <strong>and</strong> w <strong>in</strong>to u <strong>and</strong> e are stable nonl<strong>in</strong>ear operators.In this case, the controller K is said to stabilize the plant G.12.2 Nonl<strong>in</strong>ear CancelationsFor the problem of nonl<strong>in</strong>ear feedback control, to be discussed <strong>in</strong> Chapter13, stability of closed-loop operators is achieved via feedback, <strong>and</strong>hence it is not necessary to address the issue of unstable “nonl<strong>in</strong>ear zeropole(pole-zero) cancelation”. Therefore, the background on nonl<strong>in</strong>ear operatorsgiven so far is enough to treat this problem. By way of contrast,the filter<strong>in</strong>g problem is of open-loop nature 3 , <strong>and</strong> therefore, if the plant isunstable, the stability of the estimation error operator (see (14.10) <strong>in</strong> Chapter14) has to be achieved by some nonl<strong>in</strong>ear extension of the l<strong>in</strong>ear notionof unstable cancelations.To ga<strong>in</strong> <strong>in</strong>sight <strong>in</strong>to what a possible def<strong>in</strong>ition of unstable “nonl<strong>in</strong>earzero-pole cancelation” may be, let us <strong>in</strong>formally discuss the case of a scalar,LTI operator on H 2 , represented by a transfer function, H 1 , say. Assumefurther that H 1 is stable <strong>and</strong> proper. Then, if H 1 has a nonm<strong>in</strong>imum phasezero, we have discussed <strong>in</strong> Example 12.1.1 that the closure of the range ofH 1 does not cover H 2 , <strong>and</strong> this has motivated the def<strong>in</strong>ition of nonm<strong>in</strong>imumphase nonl<strong>in</strong>ear operator used <strong>in</strong> §12.1.2. However, note also thatH 1 has the additional property that there exist signals outside H 2 (fromits orthogonal complement, H ⊥ 2 , <strong>in</strong> fact) that are mapped by H 1 <strong>in</strong>to H 2 ,(i.e., signals that have poles at the nonm<strong>in</strong>imum phase zero of H 1 ). Now,assume that H 1 is multiplied by (composed with) another transfer function,H 2 say, that has an unstable pole at the same frequency where H 1 hasa nonm<strong>in</strong>imum phase zero. If <strong>in</strong>stead of restrict<strong>in</strong>g the doma<strong>in</strong> of H 2 (ac-3 See comments at the end of §7.2.2 <strong>in</strong> Chapter 7.


250 12. Nonl<strong>in</strong>ear Operatorscord<strong>in</strong>g to the def<strong>in</strong>ition of unstable operator given <strong>in</strong> §12.1.2) we let H 2act on all of H 2 , then bounded signals will be mapped <strong>in</strong>to unboundedsignals. However, when composed with H 1 , these unbounded signals areprecisely those that H 1 maps <strong>in</strong>to H 2 , giv<strong>in</strong>g a stable product H 1 H 2 .From the above discussion, it is clear that the framework used <strong>in</strong> §12.1is <strong>in</strong>sufficient to h<strong>and</strong>le “nonl<strong>in</strong>ear cancelations”, s<strong>in</strong>ce signals that areunbounded are not formally def<strong>in</strong>ed. The use of extended spaces, whereunbounded signals lie, becomes then necessary.12.2.1 Nonl<strong>in</strong>ear Operators on Extended Banach SpacesWe will consider nonl<strong>in</strong>ear operators def<strong>in</strong>ed on an extended l<strong>in</strong>ear spaceX e ⊃ X, where X is a Banach space. The way <strong>in</strong> which X e “extends” X isarbitrary. The doma<strong>in</strong> <strong>and</strong> range of the operator H : X e → X e are def<strong>in</strong>edas <strong>in</strong> (12.1) <strong>in</strong> §12.1.1. Also, the def<strong>in</strong>itions of stable <strong>and</strong> unstable operatorsare the same used <strong>in</strong> §12.1.1.Given a set D ⊂ X e , the symbol D e will denote its complement <strong>in</strong>X e , i.e.,D e {x ∈ X e : x /∈ D}.Similarly, for a set D ⊂ X, D will <strong>in</strong>dicate its complement <strong>in</strong> X, i.e.,D {x ∈ X : x /∈ D}.The set X e , i.e., the complement of X <strong>in</strong> X e , will be considered to conta<strong>in</strong>“unbounded” signals.In this context of extended spaces, we say that H : X e → X e is nonm<strong>in</strong>imumphase (NMP) if the follow<strong>in</strong>g two conditions hold:(i) (defect <strong>in</strong> range) the closure of the range of H, cl R(H), is a strictsubset of X.(ii) (excess <strong>in</strong> doma<strong>in</strong>) the extended doma<strong>in</strong> of H, def<strong>in</strong>ed as the setis not empty.E(H) {x ∈ X e : Hx ∈ X}Property (ii) implies that a nonempty set of unbounded signals are mapped<strong>in</strong>to bounded images by H. Note that this corresponds to the property ofLTI systems previously discussed.It may be possible for a nonl<strong>in</strong>ear operator to satisfy only one of theproperties of NMP operators. If (i) holds, we will say that the operator hasthe the defect-<strong>in</strong>-range property, <strong>and</strong> that it is of D-NMP type. Alternatively,if (ii) holds, we will say that the operator has the excess-<strong>in</strong>-doma<strong>in</strong> property,<strong>and</strong> that it is of E-NMP type. Figure 12.3 illustrates a NMP operator <strong>in</strong> thecontext of extended spaces, i.e., with both the defect-<strong>in</strong>-range <strong>and</strong> excess<strong>in</strong>-doma<strong>in</strong>properties, as just described.


12.2 Nonl<strong>in</strong>ear Cancelations 251E(H)HD(H)HR(H)XXX eFIGURE 12.3. Set <strong>in</strong>terpretation of a NMP nonl<strong>in</strong>ear operator.In order to allow for cancelations, we need to adjust the def<strong>in</strong>ition ofdoma<strong>in</strong> of the composition of nonl<strong>in</strong>ear operators to this sett<strong>in</strong>g of extendedspaces. Let H 1 <strong>and</strong> H 2 be nonl<strong>in</strong>ear operators on X e such thatR(H 2 ) ∩ D(H 1 ) ≠ ∅. Then the doma<strong>in</strong> of the composition H 1 H 2 is def<strong>in</strong>edaswhereD(H 1 H 2 ) = H −12 (D(H 1)) ∪ A(H 1 H 2 ), (12.5)H −12 (D(H 1)) = {x ∈ D(H 2 ) : H 2 x ∈ D(H 1 )},<strong>and</strong> A(H 1 H 2 ) is the added doma<strong>in</strong> of the composition, given byA(H 1 H 2 ) {x ∈ D(H 2 ) : H 2 x ∈ E(H 1 )}. (12.6)Note that A(H 1 H 2 ) ≠ ∅ only if H 2 is unstable <strong>and</strong> H 1 is E-NMP. This isbecause then there exists a signal <strong>in</strong> the complement of the doma<strong>in</strong> of H 2that is mapped by H 2 <strong>in</strong>to E(H 1 ), which is then nonempty. On the otherh<strong>and</strong>, if H 1 is E-NMP, then E(H 1 ) ≠ ∅ <strong>and</strong> then A(H 1 H 2 ) may be nonemptyif H 2 is unstable.When the set A(H 1 H 2 ) is nonempty, we say that there is an unstable nonl<strong>in</strong>earzero-pole cancelation <strong>in</strong> the composition H 1 H 2 . The nonl<strong>in</strong>ear zeropolecancelation idea is depicted <strong>in</strong> Figure 12.4.Note that the def<strong>in</strong>ition of composition doma<strong>in</strong> used <strong>in</strong> §12.1.1, consistedonly of the first term of the set union on the RHS of (12.5), i.e., theset H −12 (D(H 1)). This first term is different from D(H 2 ) only if H 1 is unstable.When H 1 is unstable <strong>and</strong> H −12 (D(H 1)) = D(H 2 ), we say that thereis an unstable nonl<strong>in</strong>ear pole-zero cancelation <strong>in</strong> the composition H 1 H 2 .The concepts considered <strong>in</strong> this subsection will be used <strong>in</strong> Chapter 14 toaddress the issue of stability of the estimation error <strong>in</strong> nonl<strong>in</strong>ear filter<strong>in</strong>g.


252 12. Nonl<strong>in</strong>ear OperatorsH 2H 1E(H 1 )H 1A(H 1 H 2 )XD(H 2 )H2D(H 1 )XX eXFIGURE 12.4. Set <strong>in</strong>terpretation of unstable nonl<strong>in</strong>ear zero-pole cancelation.12.3 SummaryWe have presented background on nonl<strong>in</strong>ear operators, necessary for thedevelopments <strong>in</strong> the next two chapters. In particular, we have describedproperties of nonl<strong>in</strong>ear operators as mapp<strong>in</strong>gs on Banach or Hilbert spaces,with emphasis on Lipschitz operators. A set-based framework has beenprovided to h<strong>and</strong>le nonl<strong>in</strong>ear extensions of the l<strong>in</strong>ear concepts of nonm<strong>in</strong>imumphase systems, unstable systems, <strong>and</strong> unstable zero-pole (polezero)cancelations.Notes <strong>and</strong> ReferencesThe material of §12.1 is ma<strong>in</strong>ly taken from Mart<strong>in</strong> Jr. (1976). §12.1.2 <strong>and</strong> §12.1.3 alsohave <strong>in</strong>put from Doyle et al. (1993) <strong>and</strong> Shamma (1991). The set-based frameworkof §12.2 is taken from Seron (1995).The def<strong>in</strong>ition of unstable operator as one hav<strong>in</strong>g a defect <strong>in</strong> its doma<strong>in</strong> is used<strong>in</strong> Doyle et al. (1993) <strong>and</strong> it is common <strong>in</strong> references deal<strong>in</strong>g with the <strong>in</strong>put-outputapproach to feedback control (e.g., Desoer <strong>and</strong> Vidyasagar, 1975). The concept ofnonm<strong>in</strong>imum phaseness that we use here is similar to the one given <strong>in</strong> Shamma(1991).


13Nonl<strong>in</strong>ear <strong>Control</strong>This chapter analyzes performance limitations <strong>and</strong> stability robustness ofthe unity feedback configuration of Figure 12.2 considered <strong>in</strong> Chapter 12.We use the material on nonl<strong>in</strong>ear operator theory developed <strong>in</strong> §12.1.1 ofChapter 12.13.1 Review of L<strong>in</strong>ear Sensitivity RelationsFor convenience, we review here the central result on l<strong>in</strong>ear sensitivitylimitations that we will extend to the nonl<strong>in</strong>ear case <strong>in</strong> §13.3. This resultshows that the <strong>in</strong>f<strong>in</strong>ity norm of S is bounded below by one for nonm<strong>in</strong>imumphase systems <strong>and</strong> the <strong>in</strong>f<strong>in</strong>ity norm of T is bounded below by onefor open-loop unstable systems.We use the notions of zeros <strong>and</strong> poles of multivariable systems <strong>in</strong>troduced<strong>in</strong> §2.1.1 of Chapter 2. The symbol | · | denotes here the Euclideannorm <strong>in</strong> ¢ n . If M is a matrix <strong>in</strong> ¢ n×n , ‖M‖ 2 denotes its matrix norm <strong>in</strong>ducedby the Euclidean vector norm.The follow<strong>in</strong>g result is well-known for l<strong>in</strong>ear systems. 1Theorem 13.1.1. Let L be the transfer matrix of a LTI open-loop system.Assume that the closed-loop sensitivity S = (I + L) −1 <strong>and</strong> complementarysensitivity T = L(I + L) −1 are stable transfer functions. Then:1 It can be <strong>in</strong>ferred, for example, from the <strong>in</strong>equality (4.34) <strong>in</strong> <strong>and</strong> 4.


254 13. Nonl<strong>in</strong>ear <strong>Control</strong>(i) If L is nonm<strong>in</strong>imum phase, then ‖S‖ ∞ ≥ 1.(ii) If L is unstable, then ‖T‖ ∞ ≥ 1.Proof. The result can be derived from Chen (1995, Corollary 4.2), but wegive here a simpler proof exploit<strong>in</strong>g the fact that the H ∞ norm is the operatornorm on H 2 , the space of Laplace transforms of signals <strong>in</strong> L 2 . 2(i) Let L(s) have a zero at s = q <strong>in</strong> the ORHP with output directionΨ ∈ ¢ n , such that Ψ ∗ Ψ = 1; i.e., Ψ ∗ L(q) = 0. Then Ψ ∗ S(q) = Ψ ∗ (cf.(4.5) <strong>in</strong> Chapter 4). The <strong>in</strong>f<strong>in</strong>ity norm of S satisfies‖S‖ ∞ =sup ‖S(s)‖ 2Re(s)>0= supRe(s)>0≥ |Ψ ∗ S(q)|= 1.max |η ∗ S(s)|η∈n|η|=1(ii) Let L(s) have a pole at s = p <strong>in</strong> the ORHP. Then, us<strong>in</strong>g (4.4) <strong>in</strong> Chapter4, we have that Φ ∗ T(p) = Φ ∗ <strong>and</strong> the proof follows as <strong>in</strong> (i).□The proof of Theorem 13.1.1 uses the fact that the operator norm onan <strong>in</strong>f<strong>in</strong>ite dimensional space can be bounded below by the matrix normon ¢ n with the Euclidean norm. Then, the zeros <strong>and</strong> poles <strong>in</strong> the ORHPof the open-loop system give particular <strong>in</strong>terpolation values of the closedloopoperators, which provide lower bounds on the matrix <strong>in</strong>duced norm.Although this property generally does not hold for nonl<strong>in</strong>ear operators,we will see that nonm<strong>in</strong>imum phase <strong>and</strong> unstable open-loop behavior, asdef<strong>in</strong>ed <strong>in</strong> §12.1.2, still represent constra<strong>in</strong>ts on the closed-loop system.13.2 A Complementarity Constra<strong>in</strong>tConsider the feedback <strong>in</strong>terconnection of plant G <strong>and</strong> controller K as displayed<strong>in</strong> Figure 13.1.We def<strong>in</strong>e the nonl<strong>in</strong>ear sensitivity operator, S, <strong>and</strong> the nonl<strong>in</strong>ear complementarysensitivity operator, T, asS = H yd | (w=0) ,T = −H yw | (d=0) ,(13.1)2 In fact, H 2 (for signals) is the Banach space of functions x : → n , which are analytic<strong>in</strong> the ORHP, <strong>and</strong> satisfy sup σ>0 (2π) −1 ∫ ∞−∞ |x(σ + jω)|2 dω < ∞.


13.2 A Complementarity Constra<strong>in</strong>t 255✲K✲uG✲−❜ d❄+✐e+ y❄ w✐✛❜+FIGURE 13.1. Feedback <strong>Control</strong> Loop.where the notation H ba | (c=0) st<strong>and</strong>s for the total map from signal a tosignal b when signal c is identically zero. In words, S is the mapp<strong>in</strong>g betweenoutput disturbance <strong>and</strong> system output <strong>in</strong> the absence of the sensornoise, <strong>and</strong> T is the mapp<strong>in</strong>g between sensor noise <strong>and</strong> system output <strong>in</strong>the absence of output disturbances.We will consider S <strong>and</strong> T as nonl<strong>in</strong>ear operators on some l<strong>in</strong>ear spaceX, i.e., S : D(S) ⊂ X → X <strong>and</strong> T : D(T) ⊂ X → X. In fact, the doma<strong>in</strong>s of S<strong>and</strong> T are the same, as shown <strong>in</strong> the follow<strong>in</strong>g result.Lemma 13.2.1. Assume that S <strong>and</strong> T def<strong>in</strong>ed <strong>in</strong> (13.1) are nonl<strong>in</strong>ear operatorson a l<strong>in</strong>ear space X. Then D(S) = D(T).Proof. Consider Figure 13.1 <strong>and</strong> assume that d = 0 <strong>and</strong> w ∈ D(S). Thene = Sw ∈ X.It follows that Tw = −y = w − e ∈ X s<strong>in</strong>ce X is a l<strong>in</strong>ear space. Thusw ∈ D(T) <strong>and</strong> therefore D(S) ⊂ D(T). In a similar way it can be shownthat D(T) ⊂ D(S). Hence D(S) = D(T).□The operators S <strong>and</strong> T satisfy the follow<strong>in</strong>g complementarity constra<strong>in</strong>t.Theorem 13.2.2. Consider the operators def<strong>in</strong>ed <strong>in</strong> (13.1) as nonl<strong>in</strong>ear operatorson a l<strong>in</strong>ear space. Then on D(S) = D(T)S + T = I. (13.2)Proof. From the def<strong>in</strong>ition (13.1) <strong>and</strong> Figure 13.1, the sensitivity operatoris given byS = (GK + I) −1 , (13.3)<strong>and</strong> the complementarity sensitivity operator is given byClearly, on D(S) = D(T),T = GK(GK + I) −1 = GKS. (13.4)S + T = S + GKS = (GK + I) S = I,by def<strong>in</strong>ition of addition of operators.□


256 13. Nonl<strong>in</strong>ear <strong>Control</strong>Theorem 13.2.2 clearly <strong>in</strong>dicates that the complementarity constra<strong>in</strong>tgiven by (13.2) is purely determ<strong>in</strong>ed by the structure of the feedback controlloop together with the additive nature of the disturbance <strong>in</strong>puts. Hence,this constra<strong>in</strong>t is <strong>in</strong>dependent of whether the plant or controller are l<strong>in</strong>earor nonl<strong>in</strong>ear operators.The follow<strong>in</strong>g result is straightforward.Corollary 13.2.3. Let X be a Banach space <strong>and</strong> assume that S <strong>and</strong> T <strong>in</strong>(13.1) are nonl<strong>in</strong>ear operators on X. Then S + T = I on X if <strong>and</strong> only if S<strong>and</strong> T are stable operators on X.Proof. Immediate from Theorem 13.2.2 <strong>and</strong> the def<strong>in</strong>ition of stable operatoron page 247.□13.3 Sensitivity <strong>Limitations</strong>We consider <strong>in</strong> this section that the open loop L GK is a nonl<strong>in</strong>ear operatorL : D(L) ⊂ X → X, where X is a Banach space.Before obta<strong>in</strong><strong>in</strong>g the ma<strong>in</strong> result of this section on sensitivity limitations,we need a prelim<strong>in</strong>ary lemma, which is a nonl<strong>in</strong>ear generalization of the<strong>in</strong>terpolation constra<strong>in</strong>ts satisfied by the l<strong>in</strong>ear sensitivities.Lemma 13.3.1. Let L : D(L) ⊂ X → X be such that the operators S <strong>and</strong> Thave a nonempty doma<strong>in</strong>. Then(i) If L is nonm<strong>in</strong>imum phase, then T is nonm<strong>in</strong>imum phase.(ii) If L is unstable <strong>and</strong> D(L) is closed, then S is nonm<strong>in</strong>imum phase.Proof.(i) Assume that L is nonm<strong>in</strong>imum phase, i.e., cl R(L) is a strict subsetof X. From (13.4) it follows that R(T) ⊂ R(L) <strong>and</strong> hence cl R(T) ⊂cl R(L), which means that T is nonm<strong>in</strong>imum phase.(ii) Assume that L is unstable, i.e., D(L) is a strict subset of X. S<strong>in</strong>ce Tmaps D(T) = D(S) <strong>in</strong>to X, it follows from (13.4) that R(S) ⊂ D(L).Hence cl R(S) ⊂ cl D(L) = D(L) (s<strong>in</strong>ce D(L) is closed) <strong>and</strong> thus S isnonm<strong>in</strong>imum phase.□Note that, for (ii) to hold, it is essential that D(T) = D(S). Indeed, ifS <strong>and</strong> T were any nonl<strong>in</strong>ear operators satisfy<strong>in</strong>g T = LS, then D(T) =S −1 (D(L)) {x ∈ D(S) : Sx ∈ D(L)} <strong>and</strong> hence the relation R(S) ⊂ D(L)does not hold <strong>in</strong> general.The follow<strong>in</strong>g theorem gives lower bounds on the Lipschitz constantsof the nonl<strong>in</strong>ear sensitivity operators for open-loop nonm<strong>in</strong>imum phase<strong>and</strong> unstable systems.


13.3 Sensitivity <strong>Limitations</strong> 257Theorem 13.3.2. Let L : D(L) ⊂ X → X be such that the sensitivity operatorS is <strong>in</strong> Lip(X). Then(i) If L is nonm<strong>in</strong>imum phase, then ‖S‖ L ≥ 1.(ii) If L is unstable, then ‖T‖ L ≥ 1.Proof. Note first that s<strong>in</strong>ce S ∈ Lip(X), it is a stable operator. It followsfrom Corollary 13.2.3 that S + T = I on X. Then, S ∈ Lip(X) ⇒ T = I − S ∈Lip(X) s<strong>in</strong>ce Lip(X) is a l<strong>in</strong>ear space (Mart<strong>in</strong> Jr., 1976). Next:(i) Assume that L is nonm<strong>in</strong>imum phase. If ‖S‖ L < 1, it follows fromLemma 12.1.1 that I−S = T is <strong>in</strong>vertible <strong>in</strong> Lip(X), which is not possibles<strong>in</strong>ce T is nonm<strong>in</strong>imum phase by Lemma 13.3.1. Hence necessarily‖S‖ L ≥ 1.(ii) Assume that L is unstable. S<strong>in</strong>ce S is <strong>in</strong> Lip(X), it is possible to showthat L is a closed operator (cf. Seron, 1995, Theorem 6.4.3), which impliesthat D(L) is a closed set. Now, if ‖T‖ L < 1, then by Lemma 12.1.1,I − T = S is <strong>in</strong>vertible <strong>in</strong> Lip(X), which is not possible s<strong>in</strong>ce S is nonm<strong>in</strong>imumphase by Lemma 13.3.1. Hence ‖T‖ L ≥ 1.□Theorem 13.3.2 is the nonl<strong>in</strong>ear extension of Theorem 13.1.1 for the caseof a closed-loop system belong<strong>in</strong>g to the class of Lipschitz operators. Wepo<strong>in</strong>t out that the bounds given <strong>in</strong> Theorem 13.3.2 do not assume sensitivityreduction <strong>and</strong> hence they represent limits for any possible controldesign.The assumption that the sensitivities of <strong>in</strong>terest are Lipschitz operatorsis a rather strong requirement. A relaxation would be, for example, to assumethat they have a f<strong>in</strong>ite L 2 -ga<strong>in</strong>. We stress the fact, however, that theLipschitz assumption br<strong>in</strong>gs a substantial amount of mathematical structureto the problem, s<strong>in</strong>ce many tools from l<strong>in</strong>ear functional analysis haveimmediate counterparts <strong>in</strong> Lipschitz operator theory.As a f<strong>in</strong>al remark, we note that the results on Lipschitz sensitivity limitationsgiven <strong>in</strong> this section, also hold if the extended-space sett<strong>in</strong>g of§12.2 of Chapter 12 is used. Indeed, the crucial fact is that Lemma 13.3.1on <strong>in</strong>terpolation constra<strong>in</strong>ts is still valid s<strong>in</strong>ce D(S) = D(T).To see this, recall from (13.4) that T = GKS = LS. Then, us<strong>in</strong>g (12.5), thedoma<strong>in</strong> of T can be computed asD(T) = D(LS)= S −1 (D(L)) ∪ A(LS)= {x ∈ D(S) : Sx ∈ D(L)} ∪ {x ∈ D(S) : Sx ∈ E(L)}.S<strong>in</strong>ce D(S) = D(T), it follows that A(LS) = ∅. Thus, the analysis reducesto the def<strong>in</strong>ition of composition doma<strong>in</strong> used <strong>in</strong> §12.1. This means that the


258 13. Nonl<strong>in</strong>ear <strong>Control</strong>conclusions of Theorem 13.3.2 are valid when the operators are assumedto be def<strong>in</strong>ed on an extended Banach space.Interest<strong>in</strong>gly, if the open-loop system, L, is unstable, then, accord<strong>in</strong>g tothe term<strong>in</strong>ology <strong>in</strong>troduced <strong>in</strong> §12.2, there is an unstable nonl<strong>in</strong>ear “polezero”cancelation between L <strong>and</strong> S. This cancelation, however, is achievedvia feedback.13.4 The Water-Bed EffectAn important <strong>in</strong>itial result on performance limitations imposed by nonm<strong>in</strong>imumphase open-loop plants, was obta<strong>in</strong>ed by Shamma (1991) forthe problem of nonl<strong>in</strong>ear sensitivity reduction. In this section we will statethis result <strong>and</strong> give its counterpart for the complementary sensitivity.Consider aga<strong>in</strong> the feedback loop <strong>in</strong> Figure 13.1. Let D be a given classof f<strong>in</strong>ite-energy disturbances. We def<strong>in</strong>e the performance measureµ S (G, K, Ω, D) sup ‖Sd‖ Ω , (13.5)d∈Dwhere S is the nonl<strong>in</strong>ear sensitivity operator <strong>and</strong> ‖ · ‖ Ω denotes the signalenergydistributed over the range Ω. As mentioned <strong>in</strong> Shamma (1991),µ S (G, K, Ω, D) expresses the maximum effect of a disturbance d ∈ D onthe energy of y = Sd <strong>in</strong> the frequency <strong>in</strong>terval Ω.Shamma (1991) considered the problem of m<strong>in</strong>imization of (13.5) <strong>and</strong>derived the nonl<strong>in</strong>ear counterpart of the water-bed phenomenon experiencedby nonm<strong>in</strong>imum phase plants as stated below.Theorem 13.4.1 (Shamma 1991). Let Ω ⊂ ¡ have nonzero measure <strong>and</strong>let D ⊂ L 2 be a bounded set of disturbances. Let {K i } be a sequence of I/Ooperators which stabilize the I/O operator G. Suppose thatThen µ S (G, K i , Ω, D) → 0 impliessupiD ⊄ wk−cl R(G).sup ‖S i d‖ L2 = ∞.d∈DS i <strong>in</strong> Theorem 13.4.1 is the sensitivity operator <strong>in</strong>duced by the controllerK = K i <strong>in</strong> Figure 13.1. The result shows that, if the nonl<strong>in</strong>ear plant is“nonm<strong>in</strong>imum phase”, then an arbitrarily small value of the frequencyweightedsensitivity results <strong>in</strong> an arbitrarily large response to some admissibledisturbance. The plant G <strong>in</strong> Theorem 13.4.1 is “nonm<strong>in</strong>imum phase”<strong>in</strong> the sense that there exists a ˜d ∈ D such that ˜d ∉ wk−cl R(G). This◦


13.4 The Water-Bed Effect 259condition may be <strong>in</strong>terpreted as an <strong>in</strong>ability to construct a “stable approximate<strong>in</strong>verse” of the plant (Shamma, 1991). Note that, s<strong>in</strong>ce cl R(G) ⊂wk−cl R(G) <strong>in</strong> general, a system may be nonm<strong>in</strong>imum phase accord<strong>in</strong>gto the def<strong>in</strong>ition given <strong>in</strong> §12.1.2, but the weak-closure of its range maybe the whole space. Thus, the nonm<strong>in</strong>imum phaseness concept given <strong>in</strong>§12.1.2 seems to be <strong>in</strong>sufficient to prove a water-bed effect for nonl<strong>in</strong>earfeedback systems.We consider next the correspond<strong>in</strong>g phenomenon for the complementarysensitivity operator T <strong>and</strong> unstable open-loop dynamics. Let W be anadmissible class of f<strong>in</strong>ite-energy sensor disturbances. Def<strong>in</strong>e the performancemeasureµ T (G, K i , Ω, W) sup ‖T i w‖ Ω , (13.6)w∈Wwhere T i is the complementary sensitivity operator <strong>in</strong>duced by the controllerK = K i <strong>in</strong> Figure 13.1. We then have the follow<strong>in</strong>g result.Theorem 13.4.2. Let Ω ∈ ¡ have nonzero measure <strong>and</strong> let W ⊂ L 2 be abounded set of sensor disturbances. Let {K i } be a sequence of I/O operatorswhich stabilize the I/O operator G. Suppose thatThen µ T (G, K i , Ω, W) → 0 impliessupiW ⊄ wk−cl D(GK i ).sup ‖T i w‖ L2 = ∞. (13.7)w∈WProof. We follow the same l<strong>in</strong>e of development as <strong>in</strong> Shamma (1991).Suppose, by contradiction, that {K i } is such thatsupisup ‖T i w‖ L2 ≤ α < ∞. (13.8)w∈WLet w = ˜w ∉ wk−cl D(GK i ) <strong>and</strong> def<strong>in</strong>e y i T i ˜w. From (13.8), the sequenceof outputs {y i } is bounded. Hence, it conta<strong>in</strong>s a weakly convergentsubsequence (Conway, 1990), which we rename {y i }. S<strong>in</strong>ceµ T (G, K i , Ω, W) = sup ‖T i w‖ Ω → 0,w∈Wthen ‖y i ‖ Ω → 0. It follows that {y i } → 0 weakly (Shamma, 1991).Now, s<strong>in</strong>ce each K i stabilizes G, we have that both y i <strong>and</strong> e i ∈ L 2 .Hence, s<strong>in</strong>ce y i = GK i e i , it follows that e i ∈ D(GK i ). Bute i = y i + ˜w. (13.9)S<strong>in</strong>ce {y i } → 0 weakly, it follows from (13.9) that ˜w ∈ wk−cl D(GK i ),which contradicts the open-loop <strong>in</strong>stability assumption. Hence (13.7) follows.□


260 13. Nonl<strong>in</strong>ear <strong>Control</strong>Theorem 13.4.2 states the trade-off imposed upon the complementarysensitivity operator by unstable open-loop dynamics of the composition ofthe plant <strong>and</strong> controller. Loosely speak<strong>in</strong>g, the <strong>in</strong>stability is <strong>in</strong>terpreted asthe existence of a bounded admissible <strong>in</strong>put that produces an unboundedoutput.13.5 Sensitivity <strong>and</strong> Stability RobustnessIn this section we consider sufficient conditions that guarantee robust Lipschitzstability of the nonl<strong>in</strong>ear sensitivities when the open-loop system isperturbed <strong>in</strong> different ways. Indeed, we consider additive, output-multiplicative<strong>and</strong> <strong>in</strong>put-divisive perturbations. Each case will be reconfigured <strong>in</strong> the basicperturbation model shown <strong>in</strong> Figure 12.1 of Chapter 12 for appropriatechoices of the nonl<strong>in</strong>ear operator H.We consider the feedback loop <strong>in</strong> Figure 13.1 to be the nom<strong>in</strong>al system.We denote the nom<strong>in</strong>al open-loop operator by L GK <strong>and</strong> the nom<strong>in</strong>alsensitivity <strong>and</strong> complementary sensitivity by S <strong>and</strong> T, respectively. We usethe symbols ˜S <strong>and</strong> ˜T for the actual sensitivity operators correspond<strong>in</strong>g tothe perturbed plant, which will be denoted by ˜L.We then have the follow<strong>in</strong>g result.Theorem 13.5.1. Let the nom<strong>in</strong>al open-loop system L : D(L) ⊂ X → X besuch that the nom<strong>in</strong>al sensitivity operator S is <strong>in</strong> Lip(X). Then:(i) Assume that the perturbed system is modeled as ˜L L − ∆, with∆ ∈ Lip(X) (additive perturbation model, Figure 13.2). If ‖∆S‖ L < 1then the actual sensitivity operators are <strong>in</strong> Lip(X) <strong>and</strong>‖ ˜S‖ L ≤ ‖S‖ L (1 − ‖∆S‖ L ) −1 . (13.10)(ii) Assume that the perturbed system is modeled as ˜L (I − ∆)L, with∆ ∈ Lip(X) (output-multiplicative perturbation model, Figure 13.3).If ‖∆T‖ L < 1 then the actual sensitivity operators are <strong>in</strong> Lip(X) <strong>and</strong>‖ ˜S‖ L ≤ ‖S‖ L (1 − ‖∆T‖ L ) −1 . (13.11)(iii) Assume that the perturbed system is modeled as ˜L L(I − ∆) −1 ,with ∆ ∈ Lip(X) (<strong>in</strong>put-divisive perturbation model, Figure 13.4). If‖∆S‖ L < 1 then the actual sensitivity operators are <strong>in</strong> Lip(X) <strong>and</strong>‖ ˜S‖ L ≤ 1 + ‖T‖ L (1 − ‖∆S‖ L ) −1 . (13.12)Proof. (i) Let ˜L L − ∆, ∆ ∈ Lip(X), as shown <strong>in</strong> the left scheme ofFigure 13.2.


13.5 Sensitivity <strong>and</strong> Stability Robustness 261a✲✲∆Lb❜−❄✲ ❄ + d+ ✐−✲✐yd + e❜ ✲ ✐✲+ ✻✲bS∆a✛FIGURE 13.2. Additive perturbation model (left) <strong>and</strong> equivalent configuration <strong>in</strong>terms of S (right).Solv<strong>in</strong>g for y = a, we have that y = a = S(d + b), thus the loop onthe left can be drawn as the basic perturbation model on the rightof Figure 13.2. If ‖∆S‖ L < 1 it follows from Lemma 12.1.1 that (I −∆S) −1 ∈ Lip(X) <strong>and</strong> hence ˜S = H yd | (n=0) = S(I − ∆S) −1 ∈ Lip(X)with the bound given <strong>in</strong> (13.10) for its Lipschitz constant (see (12.4)).(ii) Let ˜L (I − ∆)L, ∆ ∈ Lip(X), as depicted <strong>in</strong> the left scheme ofFigure 13.3.a✲b∆❜−❄✲ ✲ ❄ + d+ ✐✲−L✐yd + e❜ ✲ ✐✲+ ✻✲bT∆a✛FIGURE 13.3. Multiplicative perturbation model (left) <strong>and</strong> equivalent configuration<strong>in</strong> terms of T (right).Solv<strong>in</strong>g for y, we have that y = S(d+b) Se, <strong>and</strong> then a = Ly = Te.Thus the loop on the left can be drawn as the loop on the right of Figure13.3. If ‖∆T‖ L < 1 Lemma 12.1.1 shows that (I−∆T) −1 ∈ Lip(X)<strong>and</strong> hence ˜S = H yd | (n=0) = SH ed = S(I − ∆T) −1 ∈ Lip(X). Thebound given <strong>in</strong> (13.11) for its Lipschitz constant follows us<strong>in</strong>g (12.3).(iii) Let ˜L L(I − ∆) −1 , ∆ ∈ Lip(X), as depicted <strong>in</strong> the left scheme ofFigure 13.4.Solv<strong>in</strong>g for a, we have that a = S(d + b) = Se (<strong>and</strong> then y = d −La = d − Te). Thus the loop on the left can be drawn as the loop onthe right of Figure 13.4. If ‖∆S‖ L < 1 it follows from Lemma 12.1.1that (I − ∆S) −1 ∈ Lip(X) <strong>and</strong> hence ˜S = H yd | (n=0) = I − TH ed =


262 13. Nonl<strong>in</strong>ear <strong>Control</strong>b+❄✐+ ✻∆a✛✲L❜ d+− ❄✲ ✐yd + e❜ ✲ ✐✲+ ✻✲bS∆a✛FIGURE 13.4. Divisive perturbation model (left) <strong>and</strong> equivalent configuration <strong>in</strong>terms of S (right).I − T(I − ∆S) −1 ∈ Lip(X), with the bound given <strong>in</strong> (13.12) for itsLipschitz constant.□The additive perturbation model may result, for example, from an additiveperturbation of the plant, i.e., ˜G = G − (∆ G ), giv<strong>in</strong>g the valuesL = GK <strong>and</strong> ∆ = (∆ G )K <strong>in</strong> Figure 13.2. The output-multiplicative perturbationmodel comes directly from an output-multiplicative perturbationof the plant of the form ˜G = (I − ∆)G. The <strong>in</strong>put-divisive perturbationmodel may represent (orig<strong>in</strong>ally unmodeled) sensor dynamics.It can be concluded from Theorems 13.3.2 <strong>and</strong> 13.5.1 that nonm<strong>in</strong>imumphase <strong>and</strong> unstable systems allow only “small” perturbations if robuststability is to be achieved. Indeed, consider for example robust stabilityunder additive perturbations; it then follows from Theorem 13.5.1 <strong>and</strong> thesub-multiplicative property of the Lipschitz norm that‖∆‖ L ‖S‖ L < 1is also a sufficient condition for robust stability. This, together with Theorem13.3.2 thus imply that ‖∆‖ L < 1 is required for the sufficient conditionfor stability of the perturbed system to hold.13.6 SummaryThis chapter has extended the known concept of sensitivity operators, asused <strong>in</strong> the case of l<strong>in</strong>ear feedback systems, to the case of nonl<strong>in</strong>ear systems.The nonl<strong>in</strong>ear sensitivity operators correspond<strong>in</strong>g to the unity feedbackcontrol problem with additive disturbance <strong>in</strong>puts satisfy the structuralcomplementarity constra<strong>in</strong>t S+T = I <strong>and</strong> “<strong>in</strong>terpolation constra<strong>in</strong>ts”similar to their l<strong>in</strong>ear counterparts. Namely, if the open-loop system isnonm<strong>in</strong>imum phase, then T is nonm<strong>in</strong>imum phase, <strong>and</strong> if the open-loopsystem is unstable, then S is nonm<strong>in</strong>imum phase. The nonl<strong>in</strong>ear extensionsof the l<strong>in</strong>ear concepts of nonm<strong>in</strong>imum phase <strong>and</strong> unstable systemsare h<strong>and</strong>led as properties of the doma<strong>in</strong> <strong>and</strong> range of nonl<strong>in</strong>ear operators.


13.6 Summary 263For the case of Lipschitz sensitivities, the complementarity <strong>and</strong> <strong>in</strong>terpolationconstra<strong>in</strong>ts can be used to establish the fact that the Lipschitzconstant of sensitivity is bounded below by one for nonm<strong>in</strong>imum phaseopen-loop systems <strong>and</strong> that the Lipschitz constant of complementary sensitivityis bounded below by one for open-loop unstable systems. Theseresults parallel those known <strong>in</strong> l<strong>in</strong>ear control theory on the bound<strong>in</strong>g ofthe H ∞ norms of S <strong>and</strong> T.Probably the first result on performance limitations <strong>in</strong> general nonl<strong>in</strong>earfeedback control was given <strong>in</strong> Shamma (1991). Consider<strong>in</strong>g the nonl<strong>in</strong>earsensitivity as an operator <strong>in</strong> a Hilbert space, Shamma established that,for a nonm<strong>in</strong>imum phase system, the achievement of arbitrary sensitivityreduction over a frequency range necessarily implies an arbitrarily largeresponse to some admissible disturbance. This is the nonl<strong>in</strong>ear extensionof the water-bed phenomenon experienced by the l<strong>in</strong>ear sensitivity (see§3.2 <strong>in</strong> Chapter 3). As expected, a result parallel to that of Shamma canalso be derived on the trade-off <strong>in</strong>duced by unstable open-loop dynamicsfor the problem of complementary sensitivity reduction.The nonl<strong>in</strong>ear sensitivities developed here are the appropriate tool withwhich to state sufficient conditions for robust stability. This shows thatthe role of the l<strong>in</strong>ear sensitivities, S <strong>and</strong> T, <strong>in</strong> measur<strong>in</strong>g closed-loop robustnessaga<strong>in</strong>st unstructured perturbations can be extended to nonl<strong>in</strong>earsystems.Notes <strong>and</strong> ReferencesSensitivity <strong>Limitations</strong>The results on nonl<strong>in</strong>ear Lipschitz sensitivity limitations are based on Seron <strong>and</strong>Goodw<strong>in</strong> (1996), Seron (1995) <strong>and</strong> Goodw<strong>in</strong> <strong>and</strong> Seron (1995).The nonl<strong>in</strong>ear extension of the water-bed effect is due to Shamma (1991).Stability Robustness§13.5 is based on Seron <strong>and</strong> Goodw<strong>in</strong> (1996). Robustness tests similar to thosegiven <strong>in</strong> §13.5 were obta<strong>in</strong>ed by Astolfi <strong>and</strong> Guzzella (1993) us<strong>in</strong>g the L 2 -ga<strong>in</strong>(or “nonl<strong>in</strong>ear H ∞ norm”) <strong>in</strong> lieu of the Lipschitz constant. Astolfi <strong>and</strong> Guzzella,however, used a state-space approach. Other results on robustness for nonl<strong>in</strong>earfeedback systems are the works of Georgiou <strong>and</strong> Smith 1995a; 1995b; 1995c, whouse a nonl<strong>in</strong>ear generalization of the gap metric.


14Nonl<strong>in</strong>ear Filter<strong>in</strong>gThis chapter represents a prelim<strong>in</strong>ary extension to the nonl<strong>in</strong>ear case ofthe sensitivity approach to filter<strong>in</strong>g of Chapters 8 <strong>and</strong> 9. We use the backgroundon nonl<strong>in</strong>ear operators given <strong>in</strong> Chapter 12. Indeed, we first studythe complementarity of the filter<strong>in</strong>g sensitivities <strong>in</strong> the framework of §12.1<strong>and</strong> then use the concept of nonl<strong>in</strong>ear cancelations developed <strong>in</strong> §12.2 toaddress the issue of stability of the estimation error.14.1 A Complementarity Constra<strong>in</strong>tConsider the filter<strong>in</strong>g problem shown schematically <strong>in</strong> Figure 14.1.❜ v ✲❜ w ✲Gzy✲F+✲ ❥✲˜z−✻^zFIGURE 14.1. General filter<strong>in</strong>g configuration.For the purposes of this chapter, the “plant”, depicted <strong>in</strong> Figure 14.2,is characterized by two nonl<strong>in</strong>ear operators. The “output” operator, G y , isassumed to be a nonl<strong>in</strong>ear operator on a l<strong>in</strong>ear space, X, i.e., G y : D(G y ) ⊂X → X. It maps the process <strong>in</strong>put, v, <strong>in</strong>to the noise-free output, y − w. The“state” operator, G z , is the nonl<strong>in</strong>ear operator G z : D(G z ) ⊂ X → X, which


266 14. Nonl<strong>in</strong>ear Filter<strong>in</strong>gmaps v <strong>in</strong>to the signal (or <strong>in</strong>ternal state) to be estimated, z. Specifically,y = G y v + w,z = G z v,(14.1)where w is measurement noise corrupt<strong>in</strong>g the system output. We will furtherassume that the operator G y is “unbiased”, i.e., G y 0 = 0.❜ v ✲❜ w ✲✲✲G zG y✲+❣+✻✲zy✲FIGURE 14.2. Structure of the plant.A nonl<strong>in</strong>ear state estimator for (14.1) is chosen as^z = Fy, (14.2)where ^z is the estimate of z <strong>and</strong> the filter, F, is a nonl<strong>in</strong>ear operator F :D(F) ⊂ X → X. F is assumed to be a stable operator, i.e., D(F) = X.Let the estimation error be, as <strong>in</strong> Chapter 9,˜z = z − ^z. (14.3)We will assume that the operator G z is right <strong>in</strong>vertible. By this, we meanthat there exists a right <strong>in</strong>verse G −1z such that G z G −1z = I R(Gz), whereI R(Gz) is the restriction of the identity operator to R(G z ). Also, we willuse the setD z {x ∈ R(G z ) : G −1z x ∈ D(G y )}. (14.4)Observe from Figure 14.2, that D z is the set of signals z’s such that G −1z zgives a signal v <strong>in</strong> the doma<strong>in</strong> of the output operator G y .We def<strong>in</strong>e the nonl<strong>in</strong>ear filter<strong>in</strong>g sensitivity, P, <strong>and</strong> the nonl<strong>in</strong>ear filter<strong>in</strong>gcomplementary sensitivity, M, asP H ˜zv | (w=0) G −1z ,M −H ˜zw | (v=0) G y G −1z .(14.5)We then have the follow<strong>in</strong>g complementarity constra<strong>in</strong>t for the filter<strong>in</strong>gloop <strong>in</strong> Figure 14.1.Theorem 14.1.1. Consider the plant (14.1) <strong>and</strong> the filter (14.2) with estimationerror def<strong>in</strong>ed <strong>in</strong> (14.3). Assume that G y is an unbiased operator.Then, the follow<strong>in</strong>g complementarity constra<strong>in</strong>t holds:P + M = I Dz , (14.6)


14.1 A Complementarity Constra<strong>in</strong>t 267where I Dz<strong>in</strong> (14.4).is the restriction of the identity operator to the set D z givenProof. From (14.3), (14.1) <strong>and</strong> (14.2), we haveH ˜zv | (w=0) = G z − FG y . (14.7)Next, we note that, ^z = Fw whenever v = 0, s<strong>in</strong>ce G y is unbiased. ThenH ˜zw | (v=0) = −H^zw | (v=0) = F, (14.8)where the first equality follows us<strong>in</strong>g the fact that H zw = 0. From (14.7)<strong>and</strong> (14.8), we haveMultiply<strong>in</strong>g from the right by G −1zH ˜zv | (w=0) = G z − ( −H ˜zw | (v=0))Gy .<strong>and</strong> us<strong>in</strong>g (14.5) yieldsP = I R(Gz) − M.The doma<strong>in</strong> where the above relation holds, however, is not all of R(G z ),but the <strong>in</strong>tersection of R(G z ) with the doma<strong>in</strong> of M. To compute D(M),note thatM = F(G y G −1z ).S<strong>in</strong>ce F is a stable operator, then D(M) is given byD(M) = ( G −1 ) −1z (D(Gy ))= {x ∈ D(G −1z ) : G −1z x ∈ D(G y )}= {x ∈ R(G z ) : G −1z x ∈ D(G y )}= D z .Hence (14.6) follows.□Note that, if D(G y ) = D(G z ), then D z = R(G z ). This is the case whenthe operators G y <strong>and</strong> G z share the same “<strong>in</strong>stabilities”. In the l<strong>in</strong>ear case,it would correspond to the situation where all the unstable modes of theplant, assumed observable from y, are also observable from the signal tobe estimated, z.Theorem 14.1.1 proves that the complementarity constra<strong>in</strong>t shown tohold for l<strong>in</strong>ear filter<strong>in</strong>g <strong>in</strong> Chapter 8, extends to the problem of nonl<strong>in</strong>earfilter<strong>in</strong>g under process noise <strong>and</strong> additive measurement noise.


268 14. Nonl<strong>in</strong>ear Filter<strong>in</strong>g14.2 Bounded Error Nonl<strong>in</strong>ear EstimationIn this section, we will extend the concept of BEE, def<strong>in</strong>ed <strong>in</strong> Chapter 8,to nonl<strong>in</strong>ear filters. First, we will ref<strong>in</strong>e the structure of the plant <strong>in</strong> Figure14.2. S<strong>in</strong>ce z is an <strong>in</strong>ternal variable, it is reasonable to assume that theoperators G y <strong>and</strong> G z have some common dynamics. We will model thisasG y = G yy G c ,G z = G zz G c .(14.9)For convenience, we redraw the filter<strong>in</strong>g loop as <strong>in</strong> Figure 14.3, <strong>in</strong>dicat<strong>in</strong>gthe special structure that we assume for the plant.v ✲w✲G c✲✲G zzG yy+✲ ❤+✻zy✲F✲+❥✲˜z−✻^zFIGURE 14.3. Filter<strong>in</strong>g loop with special structure for the plantFrom (14.7), (14.9) <strong>and</strong> (14.8), the filter<strong>in</strong>g error operators have the formH ˜zv | (w=0) = ˜GG c ,H ˜zw | (v=0) = F,(14.10)where ˜G G zz − FG yy . We <strong>in</strong>troduce the follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition 14.2.1 (Bounded Error Nonl<strong>in</strong>ear Estimator). A stable nonl<strong>in</strong>earoperator F is said to be a bounded error nonl<strong>in</strong>ear estimator (BENE)for the system (14.9), if the error operators (14.10) are stable nonl<strong>in</strong>ear operators.◦Observe that, contrary to the l<strong>in</strong>ear case, the BENE concept applies onlyto the particular <strong>in</strong>puts v <strong>and</strong> w. Another <strong>in</strong>terest<strong>in</strong>g dist<strong>in</strong>ction is thatthere is no guarantee that the estimation error will be bounded when both<strong>in</strong>puts act simultaneously, s<strong>in</strong>ce superposition is not valid for nonl<strong>in</strong>earsystems.In order to derive conditions for a nonl<strong>in</strong>ear estimator to be a BENE, weneed to work with the framework of §12.2 <strong>in</strong> Chapter 12. Us<strong>in</strong>g def<strong>in</strong>ition(12.5) to compute the doma<strong>in</strong> of H ˜zv | (w=0) given <strong>in</strong> (14.10), we haveD(H ˜zv | (w=0) ) = D( ˜GG c )= G −1c (D( ˜G)) ∪ A( ˜GG c ),(14.11)


14.3 Sensitivity <strong>Limitations</strong> 269whereG −1c (D( ˜G)) = {x ∈ D(G c ) : G c x ∈ D( ˜G)},A( ˜GG c ) = {x ∈ D(G c ) : G c x ∈ E( ˜G)}.(14.12)We can now state a necessary <strong>and</strong> sufficient condition for a nonl<strong>in</strong>ear operatorto be a BENE.Theorem 14.2.1. A stable nonl<strong>in</strong>ear operator, F, is a BENE for the system(14.9) if <strong>and</strong> only if the follow<strong>in</strong>g two conditions hold:(i) G −1c (D( ˜G)) = D(G c ),(ii) A( ˜GG c ) = D(G c ),where ˜G is def<strong>in</strong>ed <strong>in</strong> (14.10), <strong>and</strong> where G −1c (D( ˜G)), A( ˜GG c ) are given<strong>in</strong> (14.12).Proof. S<strong>in</strong>ce F is stable, it follows from Def<strong>in</strong>ition 14.2.1 that F is a BENEfor (14.9) if <strong>and</strong> only if H ˜zv | (w=0) <strong>in</strong> (14.10) is a stable nonl<strong>in</strong>ear operator.This, <strong>in</strong> turn, holds if <strong>and</strong> only if D( ˜GG c ) = X <strong>in</strong> (14.11). S<strong>in</strong>ce G −1c (D( ˜G))<strong>and</strong> the added doma<strong>in</strong> A( ˜GG c ) <strong>in</strong> (14.12) have no <strong>in</strong>tersection, the resultthen follows.□Assum<strong>in</strong>g that ˜G is stable <strong>and</strong> G c is unstable, then the stability of H ˜zv | (w=0)<strong>in</strong> (14.10) must be achieved by an unstable nonl<strong>in</strong>ear zero-pole cancelation.It is not difficult to see that condition (i) <strong>in</strong> Theorem 14.2.1 is equivalentto the follow<strong>in</strong>g:R(G c ) ⊂ D( ˜G) .Similarly, condition (ii) can be written as:G c (D(G c )) ⊂ E( ˜G) ,where G c (D(G c )) is the obvious notation for the image of the set D(G c )through G c . Hence, we can alternatively say that F is a BENE if <strong>and</strong> onlyif: (i’) G c maps every x <strong>in</strong> its doma<strong>in</strong> to the doma<strong>in</strong> of ˜G, <strong>and</strong> (ii’) G cmaps every x outside its doma<strong>in</strong> to the extended doma<strong>in</strong> of ˜G. Figure 14.4illustrates this <strong>in</strong>terpretation.14.3 Sensitivity <strong>Limitations</strong>Lemma 13.3.1 <strong>in</strong> Chapter 13 established that nonm<strong>in</strong>imum phase <strong>and</strong> unstablecharacteristics of the open-loop system were <strong>in</strong>herited by the closedloopoperators S <strong>and</strong> T. These “nonl<strong>in</strong>ear <strong>in</strong>terpolation constra<strong>in</strong>ts” extendedthe <strong>in</strong>terpolation constra<strong>in</strong>ts for S <strong>and</strong> T <strong>in</strong> l<strong>in</strong>ear feedback control.


270 14. Nonl<strong>in</strong>ear Filter<strong>in</strong>gG cE( ˜G)˜GD(G c)D( ˜G)D(G c) G cR(G c)˜GXXXX eFIGURE 14.4. Set <strong>in</strong>terpretation of a BENE.In Chapters 8 <strong>and</strong> 9, we showed that similar <strong>in</strong>terpolation constra<strong>in</strong>ts canbe derived for the l<strong>in</strong>ear filter<strong>in</strong>g sensitivities, P <strong>and</strong> M, achieved by BEEs.By way of contrast, nonm<strong>in</strong>imum phase <strong>and</strong> unstable characteristicsof the system to be estimated, do not seem to necessarily constra<strong>in</strong>, atleast with<strong>in</strong> our framework, properties of the nonl<strong>in</strong>ear filter<strong>in</strong>g sensitivitiesachieved by a BENE. To see this, we will discuss the case where G zz<strong>in</strong> (14.9) is the identity operator. This will simplify the analysis to the compositionof two operators rather than three, <strong>and</strong> the conclusions are stillsimilar.Us<strong>in</strong>g G zz = I <strong>and</strong> (14.10) <strong>in</strong> the def<strong>in</strong>itions (14.5), we obta<strong>in</strong>P = ˜G = I − FG yy ,M = FG yy .(14.13)Next, assume that the output operator G y <strong>in</strong> (14.9) is such that G c is unstable<strong>and</strong> G yy is D-NMP. If the filter, F, is a BENE, it follows from Theorem14.1.1 that there exists an unstable nonl<strong>in</strong>ear zero-pole cancelation<strong>in</strong> the composition ˜GG c . This means that ˜G is E-NMP. It does not follow,however, that P has a defect <strong>in</strong> the closure of its range unless ˜G is alsoD-NMP.Turn<strong>in</strong>g to M, the fact that G yy is D-NMP implies that the nonl<strong>in</strong>earoperator F is act<strong>in</strong>g on a set, R(G yy ), which is a strict subset of X. This yetdoes not necessarily lead to a defective range for the composition FG yy .We formalize the above discussion <strong>in</strong> the follow<strong>in</strong>g result.Theorem 14.3.1. Let the plant be given by (14.9) with G zz = I. Supposethat P <strong>in</strong> (14.13) is a Lipschitz operator on X. Then(i) If G yy is D-NMP, <strong>and</strong> the filter, F, maps sets whose closure are strictsubsets of X <strong>in</strong>to sets whose closure are also strict subsets of X, then‖P‖ L ≥ 1.(ii) If G c is unstable, <strong>and</strong> ˜G is NMP, then ‖M‖ L ≥ 1.


14.4 Summary 271Proof. Note that the conditions <strong>in</strong> (i) imply that M is D-NMP, <strong>and</strong> the conditions<strong>in</strong> (ii) imply that P is NMP. The proof then follows similarly to theproof of Theorem 13.3.2 <strong>in</strong> Chapter 13.□From Theorem 14.3.1 <strong>and</strong> the previous discussion, it seems plausiblethat the constra<strong>in</strong>ts on the filter<strong>in</strong>g sensitivities, due to nonm<strong>in</strong>imum phase<strong>and</strong> unstable plant dynamics, could be avoided by a proper selection ofthe filter operator, F.14.4 SummaryFor the problem of nonl<strong>in</strong>ear filter<strong>in</strong>g <strong>in</strong> the presence of process noise<strong>and</strong> additive measurement noise, we have def<strong>in</strong>ed <strong>in</strong> this chapter appropriateextensions of the filter<strong>in</strong>g sensitivities considered <strong>in</strong> Chapters 8<strong>and</strong> 9. These nonl<strong>in</strong>ear filter<strong>in</strong>g sensitivities are complementary operatorswhen the external signals affect<strong>in</strong>g the loop belong to a l<strong>in</strong>ear space. Forbounded error nonl<strong>in</strong>ear estimators, we established sensitivity limitationsunder conditions expressed <strong>in</strong> terms of nonm<strong>in</strong>imum phase <strong>and</strong> unstabledynamics of the plant to be estimated. These limitations, however, donot seem to be unavoidable. Indeed, it appears possible to design nonl<strong>in</strong>earbounded error filters such that the sensitivities are not constra<strong>in</strong>ed bynonm<strong>in</strong>imum phase <strong>and</strong> unstable dynamics of the plant.Chapter 13 has dealt with the extension to nonl<strong>in</strong>ear systems of limitationsknown to hold for LTI systems. It is possible that relaxation of l<strong>in</strong>earitycould circumvent some trade-offs that are unavoidable <strong>in</strong> the specialcase of LTI systems. Indeed, the results for nonl<strong>in</strong>ear filter<strong>in</strong>g of Chapter14 h<strong>in</strong>t at a positive answer to this question. This conjecture, however,still rema<strong>in</strong>s an open research question.Notes <strong>and</strong> ReferencesThe results of this chapter are based on Seron (1995).


Part VAppendices


Appendix AReview of Complex VariableTheoryA central result of Complex Variable Theory is the Cauchy Integral Theorem<strong>and</strong> its applications. In this appendix we will state <strong>and</strong> prove thisfundamental result, which is the core of Cauchy’s theory of complex <strong>in</strong>tegration.The theory of complex <strong>in</strong>tegration is one of the three avenues ofapproach to complex analysis, the other two be<strong>in</strong>g the theory of complexderivatives <strong>and</strong> the theory of power series. These three approaches are associatedwith the names of Cauchy, Riemann <strong>and</strong> Weierstrass respectively.The review given <strong>in</strong> this appendix summarizes the essential backgroundon analytic functions <strong>and</strong> complex variable theory. The emphasis is oncomplex <strong>in</strong>tegration <strong>and</strong> complex derivatives, but a brief <strong>in</strong>troduction topower series is also provided.A.1 Functions, Doma<strong>in</strong>s <strong>and</strong> RegionsWe will be primarily <strong>in</strong>terested <strong>in</strong> complex-valued functions def<strong>in</strong>ed onregions. Before def<strong>in</strong><strong>in</strong>g regions, we need some concepts related to sets.An open set is (pathwise) connected if each pair of po<strong>in</strong>ts <strong>in</strong> it can be jo<strong>in</strong>edby a polygonal path (consist<strong>in</strong>g of a f<strong>in</strong>ite number of l<strong>in</strong>e segments jo<strong>in</strong>edend to end) that lies entirely <strong>in</strong> the set (see Figure A.1). An open connectedset of po<strong>in</strong>ts is called a doma<strong>in</strong>. A po<strong>in</strong>t s 0 is called a boundary po<strong>in</strong>t of aset U if every neighborhood of the po<strong>in</strong>t s 0 conta<strong>in</strong>s at least one po<strong>in</strong>t<strong>in</strong> the set U <strong>and</strong> at least one po<strong>in</strong>t not <strong>in</strong> the set U. A region is a doma<strong>in</strong>together with none, some, or all of its boundary po<strong>in</strong>ts. A region is said


276 Appendix A. Review of Complex Variable Theoryto be closed if it conta<strong>in</strong>s all its boundary po<strong>in</strong>ts, <strong>and</strong> bounded if it can beenclosed <strong>in</strong> a circle of sufficiently large radius. If Ω is a region, the notationΩ denotes the closed region formed by Ω together with all its boundarypo<strong>in</strong>ts. Doma<strong>in</strong>s <strong>and</strong> regions are subsets of the extended complex plane, i.e.,the set of all f<strong>in</strong>ite complex numbers (the complex plane ¢ ) <strong>and</strong> the po<strong>in</strong>tat <strong>in</strong>f<strong>in</strong>ity, ∞. We denote the extended complex plane by ¢ e = ¢ ∪ {∞}.s 1s 2FIGURE A.1. Open connected set.A function f is cont<strong>in</strong>uous at a po<strong>in</strong>t s 0 if the follow<strong>in</strong>g condition is satisfied:lims→s 0f(s) = f(s 0 ),which implies that for each positive number ε there exists a positive numberδ such that|f(s) − f(s 0 )| < ε whenever |s − s 0 | < δ. (A.1)A function of a complex variable is said to be cont<strong>in</strong>uous <strong>in</strong> a region Ω if itis cont<strong>in</strong>uous at each po<strong>in</strong>t <strong>in</strong> Ω.A.2 Complex DifferentiationThere is an important difference between real <strong>and</strong> complex derivatives.We recall that the existence of the derivative of a function of a real variableis essentially a mild smoothness condition. On the other h<strong>and</strong>, theexistence of the derivative of a function of an <strong>in</strong>dependent variable hav<strong>in</strong>gtwo “degrees of freedom” implies a great deal about the function. Aswe will see <strong>in</strong> this section, complex differentiability leads to a pair of differentialequations — the Cauchy-Riemann equations — which must besatisfied by the function’s real <strong>and</strong> imag<strong>in</strong>ary parts.Let w = f(s) be a given complex function of the complex variable s,def<strong>in</strong>ed on a doma<strong>in</strong>, D. Then w is said to have a derivative at s 0 ∈ D iff(s) − f(s 0 )lims→s 0 s − s 0(A.2)


A.2 Complex Differentiation 277exists, <strong>and</strong> is <strong>in</strong>dependent of the direction of the <strong>in</strong>crement s − s 0 . Wedenote this limit as f ′ (s 0 ) or df/ds| s0 . When the derivative of f at s 0 ∈ Dexists, f is said to be differentiable at s 0 .We then have the follow<strong>in</strong>g necessary <strong>and</strong> sufficient condition for differentiability.Theorem A.2.1. The function f(s) is differentiable at the po<strong>in</strong>t s 0 ∈ D if<strong>and</strong> only if the function <strong>in</strong>crement f(s) − f(s 0 ) can be written <strong>in</strong> the formf(s) − f(s 0 ) = k(s − s 0 ) + ε(s, s 0 )(s − s 0 ),(A.3)where ε(s, s 0 ) → 0 as s → s 0 , <strong>and</strong> k is a constant <strong>in</strong>dependent of s − s 0<strong>and</strong> ε.Proof. If f has a derivative f ′ (s 0 ) at s 0 then, by def<strong>in</strong>itionf(s) − f(s 0 )s − s 0= f ′ (s 0 ) + ε(s, s 0 ), (A.4)where ε(s, s 0 ) → 0 as s → s 0 . Multiply<strong>in</strong>g (A.4) by (s − s 0 ), we f<strong>in</strong>d thatf(s) − f(s 0 ) can be written <strong>in</strong> the form (A.3) with k = f ′ (s 0 ). Conversely,if (A.3) holds, then divid<strong>in</strong>g by (s − s 0 ) <strong>and</strong> tak<strong>in</strong>g the limit as s → s 0 , wef<strong>in</strong>d that f ′ (s 0 ) exists <strong>and</strong> equals k.□The fact that the <strong>in</strong>crement s − s 0 <strong>in</strong> (A.2) is complex, sets a very severerestriction on the class of functions that have a complex derivative, as wesee from the follow<strong>in</strong>g example.Example A.2.1. Consider the function f(s) = s = σ − jω, i.e., the complexconjugate of s. Then,f(s) − f(s 0 )= s − s 0.s − s 0 s − s 0If s−s 0 is real, then s − s 0 = s−s 0 <strong>and</strong> the limit as s → s 0 is 1. But if s−s 0is purely imag<strong>in</strong>ary, then s − s 0 = −(s − s 0 ) <strong>and</strong> the limit is −1. Thus, thisfunction is not differentiable anywhere, although its real <strong>and</strong> imag<strong>in</strong>aryparts, σ <strong>and</strong> −ω, are well behaved.◦In the above example, the nonexistence of f ′ (s) was proven by tak<strong>in</strong>glimits first through real values <strong>and</strong> then through imag<strong>in</strong>ary values. Thesetwo possibilities lead to the ma<strong>in</strong> condition that a complex function mustsatisfy <strong>in</strong> order that it have a derivative. These conditions are stated <strong>in</strong> thefollow<strong>in</strong>g result.Theorem A.2.2 (Cauchy-Riemann equations). Let the function f(s) =u(σ, ω) + jv(σ, ω), where s = σ + jω, be def<strong>in</strong>ed on a doma<strong>in</strong> D. Thena necessary <strong>and</strong> sufficient condition for f to be differentiable at the po<strong>in</strong>t


278 Appendix A. Review of Complex Variable Theorys 0 = σ 0 + jω 0 ∈ D is that the functions u <strong>and</strong> v be differentiable (as functionsof the two real variables σ <strong>and</strong> ω) at the po<strong>in</strong>t (σ 0 , ω 0 ) <strong>and</strong> satisfythe Cauchy-Riemann equations,at (σ 0 , ω 0 ). Furthermore∂u∂σ = ∂v∂ω <strong>and</strong> ∂u∂ω = − ∂v∂σ ,f ′ (s 0 ) = ∂u∂σ + j ∂v∂σ = ∂v∂ω + j ∂v∂σ = ∂u∂σ − j ∂u∂ω = ∂v∂ω − j ∂u∂ω ,where the partial derivatives are evaluated at (σ 0 , ω 0 ).(A.5)(A.6)Proof. We prove that the conditions are necessary. Let ∆ w = f(s 0 + ∆ s ) −f(s 0 ) = ∆ u + j∆ v . S<strong>in</strong>ce f is differentiable at s 0 , we have from TheoremA.2.1∆ w = k∆ s + ε∆ s , k = f ′ (s 0 ), (A.7)where ε goes to zero as ∆ s goes to zero. Then, writ<strong>in</strong>g ∆ s = ∆ σ + j∆ ω ,k = a + jb <strong>and</strong> ε = ε 1 + jε 2 , we haveSo∆ u + j∆ v = (a + jb)(∆ σ + j∆ ω ) + (ε 1 + jε 2 )(∆ σ + j∆ ω ).∆ u = a∆ σ − b∆ ω + ε 1 ∆ σ − ε 2 ∆ ω ,∆ v = b∆ σ + a∆ ω + ε 2 ∆ σ + ε 1 ∆ ω ,where ε 1 , ε 2 → 0 as ∆ σ , ∆ ω → 0, s<strong>in</strong>ce√|∆ s | = (∆ σ ) 2 + (∆ ω ) 2 , |ε 1 | ≤ |ε|, |ε 2 | ≤ |ε|.It follows that the functions u <strong>and</strong> v are differentiable at (σ 0 , ω 0 ) <strong>and</strong>∂u∂σ = a = ∂v∂ω ,∂u∂v= −b = −∂ω ∂σ ,(A.8)which immediately imply (A.5) <strong>and</strong> (A.6).The proof of sufficiency follows by revers<strong>in</strong>g the preced<strong>in</strong>g argument.□A.3 Analytic functionsThe Cauchy-Riemann equations represent a necessary <strong>and</strong> sufficient conditionfor po<strong>in</strong>twise differentiability of a complex function. We will begenerally <strong>in</strong>terested, however, <strong>in</strong> differentiability throughout a region. Thismotivates the follow<strong>in</strong>g def<strong>in</strong>ition.


A.3 Analytic functions 279Def<strong>in</strong>ition A.3.1 (Analytic Function). A function f is said to be analytic 1at a po<strong>in</strong>t s 0 if f is differentiable throughout some neighborhood of s 0 . Afunction is analytic <strong>in</strong> a region, if it is analytic at every po<strong>in</strong>t of the region.◦In the case of a doma<strong>in</strong> (open, connected, nonempty set), differentiability<strong>and</strong> analyticity are equivalent, but <strong>in</strong> other cases, differentiability isrequired <strong>in</strong> a larger open set (for example, f is analytic for |s| ≤ 1 if f isdifferentiable for |s| < 1 + δ, where δ > 0).A remarkable result from the theory of functions of a complex variableis that every analytic function is <strong>in</strong>f<strong>in</strong>itely differentiable <strong>and</strong>, furthermore,has a power series expansion about each po<strong>in</strong>t of its doma<strong>in</strong>. We will showthis <strong>in</strong> §A.7.1 <strong>and</strong> §A.7.2.As we know from calculus, a sufficient condition for the differentiabilityof the functions u(σ, ω) <strong>and</strong> v(σ, ω) on a doma<strong>in</strong> D is that the partialderivatives∂u∂σ , ∂u∂ω , ∂v∂σ , ∂v(A.9)∂ωexist <strong>and</strong> are cont<strong>in</strong>uous on D. Therefore, a sufficient condition for thefunction f = u + jv to be analytic on D is that the partial derivatives exist,are cont<strong>in</strong>uous <strong>and</strong> satisfy the Cauchy-Riemann equations on D. Conversely,it follows from Theorem A.2.2 that, if f = u + jv is analytic ona doma<strong>in</strong> D, then u <strong>and</strong> v have partial derivatives (A.9) satisfy<strong>in</strong>g theCauchy-Riemann conditions (A.5). 2We illustrate by some examples.Example A.3.1. Consider f(s) = s 2 u(σ, ω) + jv(σ, ω). Then u(σ, ω) =σ 2 − ω 2 , v(σ, ω) = 2σω, <strong>and</strong> thus∂u∂σ = 2σ ,∂u∂ω = −2ω ,∂v∂σ = 2ω ,∂v∂ω = 2σ .Hence the Cauchy-Riemann equations (A.5) hold for all σ <strong>and</strong> ω, <strong>and</strong> thefunction is clearly analytic <strong>in</strong> the complex plane.◦Example A.3.2. Consider f(s) = |s| 2 . We then have that u(σ, ω) = σ 2 +ω 2 <strong>and</strong> v(σ, ω) = 0. Here the Cauchy-Riemann equations require 2σ =0, 2ω = 0, <strong>and</strong> hence they require s = 0. S<strong>in</strong>ce f ′ (s) does not exist throughouta neighborhood of s = 0, this function is not analytic. Nevertheless1 Also known as holomorphic or regular.2 In fact, these partial derivatives are cont<strong>in</strong>uous, but this does not follow from TheoremA.2.2, unless analyticity is def<strong>in</strong>ed as cont<strong>in</strong>uous-differentiability. We will use this restricteddef<strong>in</strong>ition of analyticity to give a compact proof of Cauchy’s Integral Theorem <strong>in</strong>§A.5.2.


280 Appendix A. Review of Complex Variable Theoryf ′ (0) exists, as shown by the equationf(s) − f(0) |s| 2lim= lims→0 s s→0 s = 0. ◦Example A.3.3. Consider a rational function of the form:H(s) = K(s − q 1) · · · (s − q m )(s − p 1 ) · · · (s − p n ) N(s)D(s) .(A.10)ThendH D dNds = ds − NdD dsD 2 .These derivatives clearly exist save when D = 0. Hence H is analytic saveat the po<strong>in</strong>ts where D has zeros.◦Example A.3.4. Consider H as <strong>in</strong> (A.10). Thend log Hds=[ DN] ⎧ ⎪ ⎨ D dN ⎫ds − NdD ⎪⎬ds⎪ ⎩ D 2 ⎪ ⎭= 1 dNN ds − 1 dDD ds .(A.11)Hence log H is analytic save at the po<strong>in</strong>ts where N <strong>and</strong> D have zeros.◦The two previous examples were concerned with functions that are analyticon a region except for some po<strong>in</strong>ts. Those po<strong>in</strong>ts where the functionis not analytic are called s<strong>in</strong>gular po<strong>in</strong>ts or s<strong>in</strong>gularities, <strong>and</strong> are analyzed <strong>in</strong>more detail <strong>in</strong> §A.8.A.3.1Harmonic FunctionsLet f = u + jv be analytic on a doma<strong>in</strong> D. Then it satisfies the Cauchy-Riemann equations (A.5) on D. Suppose further that u <strong>and</strong> v have cont<strong>in</strong>uoussecond partial derivatives (we will show <strong>in</strong> §A.7.1 that they are <strong>in</strong>fact <strong>in</strong>f<strong>in</strong>itely differentiable). Differentiat<strong>in</strong>g the Cauchy-Riemann equationsaga<strong>in</strong> we get∂ 2 u∂σ 2 =∂2 v∂σ ∂ω ,∂ 2 u∂ω 2 = − ∂2 v∂ω ∂σ .Hence,∂ 2 u∂σ 2 + ∂2 u∂ω 2 = 0.(A.12)


A.4 Complex Integration 281A similar relation holds for v. Equation (A.12) is known as Laplace’s equation<strong>and</strong> the LHS the Laplacian of u, usually denoted by ∇ 2 u. We then havethe follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition A.3.2 (Harmonic Function). Let u be a real-valued functiondef<strong>in</strong>ed on a doma<strong>in</strong> D. Then u is said to be harmonic if u has cont<strong>in</strong>uoussecond partial derivatives that satisfy Laplace’s equation (A.12). ◦It follows that the real <strong>and</strong> imag<strong>in</strong>ary parts of an analytic function areharmonic functions. A harmonic function v related to u by the Cauchy-Riemann equations is said to be the harmonic conjugate of u.The follow<strong>in</strong>g example of a harmonic function is of particular <strong>in</strong>terest<strong>in</strong> the sequel.Example A.3.5. Consider aga<strong>in</strong> log H, where H is the rational function ofExample A.3.3. We have seen that log H is analytic on any complex doma<strong>in</strong>that does not conta<strong>in</strong> zeros of either the numerator or denom<strong>in</strong>ator of H. Itfollows that, on such doma<strong>in</strong>s, the function log |H| = Re log H is harmonic.◦F<strong>in</strong>ally, a cont<strong>in</strong>uous real-valued function u(σ, ω), def<strong>in</strong>ed <strong>in</strong> some region,is said to be subharmonic if it satisfies∇ 2 u ∂2 u∂σ 2 + ∂2 u∂ω 2 ≥ 0.A.4 Complex IntegrationIn contrast to the real case, complex <strong>in</strong>tegration requires the specificationof not only the limits of <strong>in</strong>tegration but also the particular curve that connectsthese po<strong>in</strong>ts. We thus start this section with a review of curves.A.4.1CurvesA curve 3 <strong>in</strong> the extended complex plane is a set of po<strong>in</strong>ts{σ(ζ) + jω(ζ),ζ ∈ [a, b]}where σ <strong>and</strong> ω are cont<strong>in</strong>uous real-valued functions of the parameter ζ <strong>in</strong>some <strong>in</strong>terval [a, b]. A concise way of denot<strong>in</strong>g a curve is by means of acomplex-valued function s = s(ζ), wheres(ζ) = σ(ζ) + jω(ζ).(A.13)3 Note that there are slight differences <strong>in</strong> the literature on various def<strong>in</strong>itions used <strong>in</strong> thissection.


282 Appendix A. Review of Complex Variable TheoryWe will then say that the curve is def<strong>in</strong>ed by the function s(ζ).A curve def<strong>in</strong>ed by s = s(ζ), with ζ <strong>in</strong> [a, b], is a closed curve if its extremepo<strong>in</strong>ts co<strong>in</strong>cide, i.e., s(a) = s(b). A curve is simple if it does not <strong>in</strong>tersectitself, with the possible exception of the extreme po<strong>in</strong>ts if the curve isalso closed. A closed simple curve is sometimes called a Jordan curve. Accord<strong>in</strong>gto the Jordan curve theorem (e.g., Churchill <strong>and</strong> Brown, 1984, p. 83),a closed simple curve divides ¢ e <strong>in</strong>to two doma<strong>in</strong>s: an <strong>in</strong>terior doma<strong>in</strong>,the <strong>in</strong>terior, which is bounded, <strong>and</strong> an exterior doma<strong>in</strong>, the exterior. Po<strong>in</strong>tsof the <strong>in</strong>terior of a Jordan curve are called <strong>in</strong>terior po<strong>in</strong>ts or po<strong>in</strong>ts with<strong>in</strong>the curve.The derivative s ′ (ζ), or d[s(ζ)]/dζ, of s with respect to the parameter ζis def<strong>in</strong>ed ass ′ (ζ) = σ ′ (ζ) + j ω ′ (ζ) = dσ/dζ + j dω/dζ,provided the derivatives σ ′ (ζ) <strong>and</strong> ω ′ (ζ) both exist.A curve is said to be differentiable if σ <strong>and</strong> ω are differentiable functionsof the parameter ζ. A cont<strong>in</strong>uously differentiable curve is called an arc.Let C, given by s = s(ζ), ζ ∈ [a, b], be an arc. Then the real valuedfunction√|s ′ (ζ)| = [σ ′ (ζ)] 2 + [ω ′ (ζ)] 2is <strong>in</strong>tegrable over the <strong>in</strong>terval [a, b], <strong>and</strong> the arc C is said to have length 4l =∫ ba|s ′ (ζ)| dζ. (A.14)An arc is smooth if among its various parametric representations, there isat least one representation (A.13) such that the cont<strong>in</strong>uous derivative s ′ (ζ)is never zero on the <strong>in</strong>terval [a, b]. The geometric mean<strong>in</strong>g of smoothnessis clear from the fact that if s ′ (ζ) ≠ 0 for all ζ <strong>in</strong> [a, b], then the curve has atangent at every po<strong>in</strong>t, whose angle of <strong>in</strong>cl<strong>in</strong>ation is arg s ′ (ζ).A contour is a curve consist<strong>in</strong>g of a f<strong>in</strong>ite number of arcs jo<strong>in</strong>ed end toend. The length of a contour is the sum of the lengths of the arcs that constitutethe contour. If each of the arcs that form the contour is smooth, thenthe contour is piecewise smooth. A piecewise smooth contour may not havea tangent at the po<strong>in</strong>ts where the arcs are jo<strong>in</strong>ed to each other, <strong>in</strong> whichcase it has a corner. For example, a rectangle on the plane is a piecewisesmooth contour. Figure A.2 illustrates the curves just def<strong>in</strong>ed.For the purposes of this book, it suffices to assume that all the curvesconsidered are piecewise smooth contours. Most of the contours that weuse are <strong>in</strong> fact piecewise smooth <strong>and</strong> closed.4 This expression arises from the def<strong>in</strong>ition of arc length <strong>in</strong> calculus.


A.4 Complex Integration 283ω(ζ)σ(ζ)A.4.2FIGURE A.2. Simple curve, closed but not simple curve, <strong>and</strong> a contour.IntegralsIn this subsection we def<strong>in</strong>e complex <strong>in</strong>tegration <strong>and</strong> relate it to l<strong>in</strong>e <strong>and</strong>area <strong>in</strong>tegrals of functions of two real variables.As a prelim<strong>in</strong>ary step, we def<strong>in</strong>e the <strong>in</strong>tegral of a complex-valued functionof a real variable ζ over a given <strong>in</strong>terval a ≤ ζ ≤ b. Let s(ζ) =σ(ζ) + jω(ζ), ζ ∈ [a, b], be piecewise cont<strong>in</strong>uous on [a, b]. The <strong>in</strong>tegralof s from a to b is def<strong>in</strong>ed as∫ b ∫ b ∫ bs(ζ) dζ = σ(ζ) dζ + j ω(ζ) dζ.(A.15)aaaIt is easy to show (e.g., Churchill <strong>and</strong> Brown, 1984, p. 80) that the <strong>in</strong>tegral(A.15) satisfies the <strong>in</strong>equality∫ b ∣ s(ζ) dζ∣ ≤ |s(ζ)| dζ,(A.16)a∣∫ bawhich also holds <strong>in</strong> cases when the <strong>in</strong>tegrals are improper, provided bothsides of (A.16) exist.By complex <strong>in</strong>tegration we refer to the <strong>in</strong>tegration of complex-valuedfunctions of the complex variable s. Recall that a complex-valued functionf(s) is said to be cont<strong>in</strong>uous on the arc C, def<strong>in</strong>ed by s = s(ζ), if thefunction f(s(ζ)) is cont<strong>in</strong>uous for ζ <strong>in</strong> [a, b] 5 . Then for a function f cont<strong>in</strong>uous6 on C, the <strong>in</strong>tegral of f on C is def<strong>in</strong>ed by∫f(s) ds = f(s(ζ)) s ′ (ζ) dζ.(A.17)C∫ baWhen f is cont<strong>in</strong>uous on a contour C, i.e., cont<strong>in</strong>uous on each arc C i , i =1, . . . n, that forms the contour, then we def<strong>in</strong>e the <strong>in</strong>tegral by∫∫∫∫f(s) ds = f(s) ds + f(s) ds + · · · + f(s) ds.CC 1 C 2 C n5 Note that f is cont<strong>in</strong>uous on C if it is cont<strong>in</strong>uous <strong>in</strong> a region of the extended complexplane conta<strong>in</strong><strong>in</strong>g C.6 Piecewise cont<strong>in</strong>uity is, <strong>in</strong> fact, sufficient.


284 Appendix A. Review of Complex Variable TheoryComplex <strong>in</strong>tegration is a l<strong>in</strong>ear operation, that is, the follow<strong>in</strong>g propertyholds ∫∫∫[α f(s) + β g(s)] ds = α f(s) ds + β g(s) ds,Cwhenever α <strong>and</strong> β are complex constants <strong>and</strong> f <strong>and</strong> g are (piecewise) cont<strong>in</strong>uouson the arc or contour C. Also, if F is a complex function with cont<strong>in</strong>uousderivative 7 f <strong>in</strong> a doma<strong>in</strong> conta<strong>in</strong><strong>in</strong>g the arc C, then the fundamentaltheorem of calculus holds <strong>in</strong> the form∫f(s) ds = F(A) − F(B),(A.18)Cwhere A <strong>and</strong> B are the <strong>in</strong>itial <strong>and</strong> term<strong>in</strong>al po<strong>in</strong>ts of C respectively.Complex <strong>in</strong>tegrals can also be represented as <strong>in</strong>tegrals of real-valuedfunctions of two real variables. Indeed, by separat<strong>in</strong>g <strong>in</strong>to real <strong>and</strong> imag<strong>in</strong>aryparts, any function f can be expressed <strong>in</strong> terms of two real functions,i.e., f = u + jv. Thus, if s = σ + jω, we can writeCf(s) = u(σ, ω) + jv(σ, ω).We can then formally replace f = u + jv <strong>and</strong> ds = dσ + j dω <strong>in</strong> the expression∫ f(s) ds, to obta<strong>in</strong>C∫∫[ ]f(s) ds = u(σ, ω)+jv(σ, ω) (dσ + j dω)CC∫∫[ ] [ ]= u(σ, ω) dσ−v(σ, ω) dω +j u(σ, ω) dω+v(σ, ω) dσ .CCC(A.19)Hence, we can represent a complex <strong>in</strong>tegral us<strong>in</strong>g two l<strong>in</strong>e <strong>in</strong>tegrals of thek<strong>in</strong>d∫p(σ, ω) dσ + q(σ, ω) dω,(A.20)Cwhere p <strong>and</strong> q are real-valued functions of the real variables σ <strong>and</strong> ω. That(A.19) is equivalent to (A.17) follows immediately from def<strong>in</strong>ition (A.15).Now, assum<strong>in</strong>g that p <strong>and</strong> q are cont<strong>in</strong>uous <strong>in</strong> some doma<strong>in</strong> D conta<strong>in</strong><strong>in</strong>gthe arc C given by s = s(ζ), ζ ∈ [a, b], then the l<strong>in</strong>e <strong>in</strong>tegrals along C frompo<strong>in</strong>t A = s(a) to po<strong>in</strong>t B = s(b) are def<strong>in</strong>ed as∫ B ∫ bp(σ, ω) dσ = p (σ(ζ), ω(ζ)) σ ′ (ζ) dζ,Aa∫ B ∫ bq(σ, ω) dω = q (σ(ζ), ω (ζ)) ω ′ (ζ) dζ.Aa(A.21)7 See def<strong>in</strong>ition of complex derivative <strong>in</strong> §A.2.


A.4 Complex Integration 285The l<strong>in</strong>e <strong>in</strong>tegral (A.20) can be <strong>in</strong>terpreted as a vector <strong>in</strong>tegral (e.g.,Widder, 1961). Indeed, consider the unit tangent vector, ⃗t, <strong>and</strong> the unitouter normal, ⃗n, to the curve C, as shown <strong>in</strong> Figure A.3. Then ⃗t has com-jω⃗tCdσds− dσds ⃗ndωdsdωdsσFIGURE A.3. Tangent <strong>and</strong> outer normal vectors.ponents dσ/ds, dω/ds, <strong>and</strong> ⃗n has components dω/ds, −dσ/ds (Kaplan,1973, p. 237). Consider next the vector (q, −p), <strong>and</strong> note that(q, −p) · ⃗n = qdω/ds + pdσ/ds,where “·” denotes scalar product of vectors. It is then easy to see that∫∫p dσ + q dω = (q, −p) · ⃗n ds.(A.22)CWe next turn to the def<strong>in</strong>ition of area (or double) <strong>in</strong>tegral of a functionof two real variables. Let p(σ, ω) be a function def<strong>in</strong>ed over a closed <strong>and</strong>bounded region Ω of the extended complex plane. Suppose we subdivideΩ by draw<strong>in</strong>g a grid parallel to the real <strong>and</strong> imag<strong>in</strong>ary axes as shown<strong>in</strong> Figure A.4. We thus form a f<strong>in</strong>ite number of closed square subregionsjωCjω iσ iΩ iCσFIGURE A.4. Subdivision of the region Ω.where each po<strong>in</strong>t of Ω lies <strong>in</strong> at least one such subregion <strong>and</strong> each subregionconta<strong>in</strong>s po<strong>in</strong>ts of Ω. Consider those square regions that conta<strong>in</strong>


286 Appendix A. Review of Complex Variable Theoryonly po<strong>in</strong>ts <strong>in</strong>side Ω (i.e., only “full” squares). Number them from 1 tom <strong>and</strong> denote by Ω i the area of the i-th square. Choos<strong>in</strong>g an arbitrarypo<strong>in</strong>t (σ i , ω i ) <strong>in</strong> the i-th square, then the double <strong>in</strong>tegral of p(σ, ω) over theregion Ω is def<strong>in</strong>ed as∫∫m∑p(σ, ω) dσ dω = lim p(σ i , ω i )Ω i ,Ωi=1(A.23)where the limit is taken as the grid becomes <strong>in</strong>f<strong>in</strong>itely f<strong>in</strong>e, i.e., m goes to<strong>in</strong>f<strong>in</strong>ite <strong>and</strong> the maximum diagonal of all squares approaches 0.The existence of the limit <strong>in</strong> (A.23) can be demonstrated (see e.g., Widder,1961) when p is cont<strong>in</strong>uous <strong>and</strong> Ω satisfies simple conditions, e.g.,when Ω can be split <strong>in</strong>to a f<strong>in</strong>ite number of square subregions. In fact, theresult also holds when the subregions are not necessarily square, but eachof them can be described by <strong>in</strong>equalities of any of the follow<strong>in</strong>g formsa ≤ σ ≤ b, g 1 (σ) ≤ ω ≤ g 2 (σ), (A.24)c ≤ ω ≤ d, g 3 (ω) ≤ σ ≤ g 4 (ω), (A.25)where g 1 , g 2 , g 3 <strong>and</strong> g 4 are cont<strong>in</strong>uous functions. The first of these twosituations is represented <strong>in</strong> Figure A.5.jωg 2 (σ)Ωg 1 (σ)CabσFIGURE A.5. Simple region.In both cases, the double <strong>in</strong>tegral (A.23) can be computed as an iterated<strong>in</strong>tegral, i.e., if Ω is given by (A.24), we have∫∫p(σ, ω) dσ dω =Ω∫ baor alternatively, if Ω is given by (A.25),∫∫p(σ, ω) dσ dω =Ω∫ dc∫ g2 (σ)g 1 (σ)∫ g4 (ω)g 3 (ω)p(σ, ω) dω dσ,p(σ, ω) dσ dω.


A.4 Complex Integration 287Integrals <strong>in</strong> Limit<strong>in</strong>g CasesIn many of the applications to control <strong>and</strong> filter<strong>in</strong>g theory discussed <strong>in</strong>the ma<strong>in</strong> body of the book, we consider <strong>in</strong>tegrals over the semicircularcontour shown <strong>in</strong> Figure A.6, where the radius R can be made either <strong>in</strong>def<strong>in</strong>itelylarge or <strong>in</strong>def<strong>in</strong>itely small. This requires evaluation of <strong>in</strong>tegralsjωRCσFIGURE A.6. Typical path of <strong>in</strong>tegration.<strong>in</strong> limit<strong>in</strong>g cases, a task that is circumvented by appropriate estimates orbounds on the <strong>in</strong>tegrals <strong>in</strong> question, as we show next.Let f be any function def<strong>in</strong>ed on a doma<strong>in</strong> D, <strong>and</strong> consider a contour, C,<strong>in</strong> D, def<strong>in</strong>ed by s = s(ζ), ζ ∈ [a, b]. Suppose that f is piecewise cont<strong>in</strong>uouson C <strong>and</strong> let f m be the largest value of |f(s)| over C 8 . It is easy to see 9 that∫∣C∫ bf(s) ds∣ ≤ f m |s ′ (ζ)| dζ = f m l,a(A.26)where l is the length of the contour C.The bound for the <strong>in</strong>tegral given <strong>in</strong> (A.26) can be used to obta<strong>in</strong> estimatesof the <strong>in</strong>tegral <strong>in</strong> various limit<strong>in</strong>g situations, as the follow<strong>in</strong>g examplesshows.Example A.4.1. Consider the <strong>in</strong>tegral of powers of s over the contour Cshown <strong>in</strong> Figure A.6. The path length of this contour is πR. Hence if f(s)varies as s −2 , then the magnitude of f(s) on C, <strong>and</strong> hence f m <strong>in</strong> (A.26),must vary as R −2 . Thus the <strong>in</strong>tegral on C vanishes as R goes to ∞. Thesame result holds, of course, if f(s) varies as any larger negative power of s.◦8 Such a value f m will always exist. Indeed, if f is cont<strong>in</strong>uous on the arc C, then the realvaluedfunction |f(s(ζ))| is cont<strong>in</strong>uous on the closed bounded <strong>in</strong>terval [a, b], <strong>and</strong> hence italways reaches a maximum value f m on that <strong>in</strong>terval. Hence |f(s)| has a maximum value onC when f is cont<strong>in</strong>uous. It is immediate that the same is true when f is piecewise cont<strong>in</strong>uouson C.9 Us<strong>in</strong>g def<strong>in</strong>ition (A.17), property (A.16) <strong>and</strong> the def<strong>in</strong>ition of the length of a contourgiven <strong>in</strong> §A.4.1.


288 Appendix A. Review of Complex Variable TheoryExample A.4.2. If the semicircle of Figure A.6 has very small, rather thanvery large radius then the path length vanishes <strong>in</strong> the limit. It is then clearfrom (A.26) that the <strong>in</strong>tegral also vanishes if f(s) either approaches a constantvalue or behaves <strong>in</strong> the vic<strong>in</strong>ity of the orig<strong>in</strong> as any positive power ofs. On the other h<strong>and</strong>, if f(s) behaves as s −2 or any larger negative powerof s, then f m will <strong>in</strong>crease rapidly as R dim<strong>in</strong>ishes, <strong>and</strong> thus we can saynoth<strong>in</strong>g about the <strong>in</strong>tegral on the basis of (A.26).◦When not only powers of s are <strong>in</strong>volved but also exponential functions,a h<strong>and</strong>y result is found <strong>in</strong> Jordan’s Lemma, which is reviewed next.Lemma A.4.1 (Jordan’s Lemma). Let C be the contour of Figure A.6 <strong>and</strong>let τ be a positive real number. Then∫C∣ ∣ e−sτ π |ds| 0.<strong>and</strong> let C be the semicircle of the Jordan’s lemma. Then∫e −sτ ∫∣ dss ∣ ≤ |e −sτ ||ds| < π |s| Rτ ,CC<strong>and</strong> hence∫e −sτlim ds = 0.R→∞ C sObviously the same result is true for f(s) = e −sτ /s k , k ≥ 1.◦


A.5 Ma<strong>in</strong> Integral Theorems 289A.5 Ma<strong>in</strong> Integral TheoremsIn complex <strong>in</strong>tegration, a natural question arises whether the value of the<strong>in</strong>tegral is affected by the choice of the path between <strong>in</strong>tegration limits, orwhether the <strong>in</strong>tegral is <strong>in</strong>variant with respect to the path. We will see <strong>in</strong>§A.5.2 that Cauchy’s Integral Theorem connects this question to importantproperties of the function to be <strong>in</strong>tegrated. As a prelude, we next giveGreen’s Theorem, which is part of the necessary background to prove theCauchy Integral Theorem.A.5.1Green’s TheoremWe prove <strong>in</strong> this section Green’s Theorem, which applies to real-valuedfunctions of two real variables def<strong>in</strong>ed on a simply connected doma<strong>in</strong>.Informally, a doma<strong>in</strong> D is said to be simply connected if it has no holes.More precisely, D is simply connected if for every simple closed curveC ⊂ D, the region Ω formed from C plus its <strong>in</strong>terior lies wholly <strong>in</strong> D.We then have the follow<strong>in</strong>g theorem for functions def<strong>in</strong>ed over a simplyconnected doma<strong>in</strong>.Theorem A.5.1 (Green’s Theorem). Let D be a simply connected doma<strong>in</strong><strong>and</strong> let C be a piecewise smooth simple closed contour <strong>in</strong> D. Let p(σ, ω),q(σ, ω) be functions that are cont<strong>in</strong>uous <strong>and</strong> have cont<strong>in</strong>uous first partialderivatives <strong>in</strong> D. Then∮C∫∫p dσ + q dω =where Ω is the closed region bounded by C.Ω( ∂q∂σ − ∂p )dσ dω,∂ω(A.29)Proof. We first consider a simple case <strong>in</strong> which Ω has the form shown <strong>in</strong>Figure A.5, i.e., it is representable <strong>in</strong> both of the forms (A.24) <strong>and</strong> (A.25).Then the double <strong>in</strong>tegral∫∫∂pdω dσ∂ωcan be written as the iterated <strong>in</strong>tegral∫∫ΩΩ∫∂pb ∫ g2 (σ)∂ω dσ dω = ∂pdω dσ.a g 1 (σ) ∂ω


290 Appendix A. Review of Complex Variable TheoryOne can now <strong>in</strong>tegrate to achieve∫∫Ω∫∂pb∂ω dσ dω = [p (σ, g 2 (σ)) − p (σ, g 1 (σ))] dσa∫ b ∫ a= p (σ, g 2 (σ)) dσ + p (σ, g 1 (σ)) dσab∮= − p(σ, ω) dσ.By a similar argument we obta<strong>in</strong>∫∫∂q∂σ∮Cdσ dω = q dω.ΩCThe result follows on add<strong>in</strong>g the two double <strong>in</strong>tegrals for the simple typeof region Ω represented as <strong>in</strong> (A.24) <strong>and</strong> (A.25).It is easy to prove the result for more complex regions that can be decomposed<strong>in</strong>to a f<strong>in</strong>ite number of simple regions. For the most generalcase, it is necessary to approximate the region by the latter <strong>and</strong> then performa limit<strong>in</strong>g process.□Green’s formula (A.29) connects a double <strong>in</strong>tegral over a region witha l<strong>in</strong>e <strong>in</strong>tegral over its boundary, <strong>and</strong> it is at the basis of the proof ofCauchy’s Integral Theorem. We next give a version of this formula thatis used <strong>in</strong> Chapter 4 to obta<strong>in</strong> <strong>in</strong>tegral constra<strong>in</strong>ts for multivariable systems.For a real-valued function f(σ, ω) with cont<strong>in</strong>uous second partial derivatives<strong>in</strong> some doma<strong>in</strong>, let ∇f <strong>and</strong> ∇ 2 f be the gradient <strong>and</strong> Laplacian of f,respectively, given by∇f =( ∂f∂σ , ∂f ), <strong>and</strong> ∇ 2 f = ∂2 f∂ω∂σ 2 + ∂2 f∂ω 2 .We then have the follow<strong>in</strong>g corollary.Corollary A.5.2. Let C be a piecewise smooth, simple closed contour <strong>in</strong> asimply connected doma<strong>in</strong>, D, <strong>and</strong> let Ω be the closed region bounded byC. Let f(σ, ω), g(σ, ω) be functions that are cont<strong>in</strong>uous <strong>and</strong> have cont<strong>in</strong>uoussecond partial derivatives <strong>in</strong> D. Then∮ (f ∂g∂⃗n − g ∂f ) ∫∫ds = (f ∇ 2 g − g ∇ 2 f) dσ dω, (A.30)∂⃗nCΩwhere ∂f/∂⃗n = ∇f·⃗n is the directional derivative of f along ⃗n, the exteriornormal of C.


A.5 Ma<strong>in</strong> Integral Theorems 291Proof. We use the identitydiv(f ∇g) = f ∇ 2 g + ∇f · ∇g,where the symbol div denotes the divergence of a vector field (see e.g.,Kaplan, 1973). Integrat<strong>in</strong>g the above equation over Ω, we have∫∫∫∫∫∫div(f ∇g) dσ dω = f ∇ 2 g dσ dω + ∇f · ∇g dσ dω. (A.31)ΩΩUs<strong>in</strong>g the def<strong>in</strong>itions of divergence <strong>and</strong> gradient, <strong>and</strong> equations (A.29)<strong>and</strong> (A.22), the LHS above can be written as∫∫∫∫ [ ( ∂div(f ∇g) dσ dω = f ∂g )+ ∂ (f ∂g )]dσ dω∂σ ∂σ ∂ω ∂ωΩΩ∮= f ∇g · ⃗n dsC∮= f ∂f∂⃗n ds.CHence, (A.31) can alternatively be expressed as∮f ∂f ∫∫∫∫∂⃗n ds = f ∇ 2 g dσ dω + ∇f · ∇g dσ dω.CΩUs<strong>in</strong>g the above identity with f <strong>and</strong> g <strong>in</strong>terchanged, <strong>and</strong> subtract<strong>in</strong>g, leadsto (A.30).□ΩΩA.5.2The Cauchy Integral TheoremIn §A.3, we <strong>in</strong>troduced the class of analytic functions, which are at the coreof the results presented <strong>in</strong> the book. For these functions, a fundamentalresult — Cauchy’s Integral Theorem — can be stated which allows one toevaluate the <strong>in</strong>tegral of such functions on closed contours.Before address<strong>in</strong>g this result, let us motivate it with a simple exampleof <strong>in</strong>tegration of powers of s over a circle around the orig<strong>in</strong>.Example A.5.1. Let f(s) = s −1 <strong>and</strong> consider the contour C shown <strong>in</strong> FigureA.7. The <strong>in</strong>tegral over C is computed <strong>in</strong> the clockwise direction, i.e.,from s 1 to s 2 . On C, we have that s = Re jθ , <strong>and</strong> so ds = jRe jθ dθ. Hence∫Cdss = ∫ θ2θ 1jRe jθ dθRe jθ= −j (θ 1 − θ 2 ) . (A.32)


292 Appendix A. Review of Complex Variable Theoryjωs 1Rθ 1s 2θ 2CσFIGURE A.7. Path of <strong>in</strong>tegration used <strong>in</strong> Example A.5.1.Note that this result is <strong>in</strong>dependent of the value of R. We will next use(A.32) to evaluate the <strong>in</strong>tegral of powers of s around a full circle of radiusR centered at the orig<strong>in</strong> of the complex plane. In this case,∮s m (ds = R m e jmθ) jRe jθ dθ∫ π−π∫ π= jR m+1 + 1)θ + j s<strong>in</strong>(m + 1)θ] dθ{−π[cos(m0 for m ≠ −1,=−j2π for m = −1, (<strong>in</strong>tegration clockwise).(A.33)In particular, note that the <strong>in</strong>tegral of positive power of s over the circlevanishes. This is not casual, as we see next.◦Theorem A.5.3 (Cauchy Integral Theorem). If f is analytic on some simplyconnected doma<strong>in</strong> D, then∮f(s) ds = 0,Cwhere C is any piecewise smooth simple closed contour <strong>in</strong> D.Proof. We will prove this theorem assum<strong>in</strong>g the restricted def<strong>in</strong>ition of analyticityreferred to <strong>in</strong> the footnote on page 279, i.e., a function is analyticif it is cont<strong>in</strong>uously differentiable. 10Let f = u + jv. From (A.19), we have∮ ∮∮f ds = (u dσ − v dω) + j (v dσ + u dω).CCC10 This restricted def<strong>in</strong>ition allows us to obta<strong>in</strong> a simple proof of the theorem. Moreover,there is no loss of generality <strong>in</strong> this assumption s<strong>in</strong>ce the orig<strong>in</strong>al def<strong>in</strong>ition not requir<strong>in</strong>gcont<strong>in</strong>uity ultimately leads to the conclusion that the derivative of the function is, <strong>in</strong> fact,cont<strong>in</strong>uous.


A.5 Ma<strong>in</strong> Integral Theorems 293S<strong>in</strong>ce f is analytic (<strong>and</strong> hence f ′ is cont<strong>in</strong>uous) on a simply connected doma<strong>in</strong>D, it follows from Theorem A.2.2 that u <strong>and</strong> v have cont<strong>in</strong>uous partialderivatives (A.9). S<strong>in</strong>ce D is simply connected, we can apply Green’stheorem (Theorem A.5.1) to the real <strong>and</strong> imag<strong>in</strong>ary parts of the <strong>in</strong>tegral <strong>in</strong>the above equation to obta<strong>in</strong>∮ ∫∫ (f ds = − ∂vC∂σ − ∂u ) ∫∫ ( ∂udσ dω + j∂ω∂σ − ∂v )dσ dω,∂ωΩΩwhere Ω is the region bounded by C. S<strong>in</strong>ce the partial derivatives of u<strong>and</strong> v satisfy the Cauchy-Riemann conditions (A.5) (Theorem A.2.2), the<strong>in</strong>tegr<strong>and</strong>s of these two double <strong>in</strong>tegrals are zero throughout D <strong>and</strong> hencethe result follows.□The above proof follows the orig<strong>in</strong>al sett<strong>in</strong>g used by Cauchy <strong>in</strong> his proof<strong>in</strong> the early part of the last century. Several decades later, it was discoveredby Edouard Goursat that the hypothesis of cont<strong>in</strong>uity of the derivative <strong>in</strong>Cauchy’s theorem can be dispensed with, which led to the modern, moregeneral version of this important result 11 . The relaxation of this assumptionallows us to use Cauchy’s Integral Theorem to show that, <strong>in</strong> fact, cont<strong>in</strong>uityof the derivative follows from analyticity of the function.Theorem A.5.3 is readily illustrated us<strong>in</strong>g Example A.5.1, as seen below.Example A.5.2. Suppose that f(s) is the polynomialf(s) = a 0 + a 1 s + · · · + a n s n .Then f(s) is analytic on <strong>and</strong> with<strong>in</strong> a circle about the orig<strong>in</strong>, <strong>and</strong>, accord<strong>in</strong>gto the Cauchy <strong>in</strong>tegral theorem, the <strong>in</strong>tegral on the complete circle mustvanish. But this was <strong>in</strong>deed the result of Example A.5.1, which showedthat the <strong>in</strong>tegral of each term of the polynomial vanishes.◦The simple piecewise smooth closed contour <strong>in</strong> Theorem A.5.3 can bereplaced by less trivial contours without compromis<strong>in</strong>g the result. Theseextensions are discussed <strong>in</strong> the follow<strong>in</strong>g subsection.A.5.3Extensions of Cauchy’s Integral TheoremAs a first extension, we note that the contour <strong>in</strong> Theorem A.5.3 can bereplaced by a piecewise smooth closed contour C that is not necessarilysimple. For if C <strong>in</strong>tersects itself a f<strong>in</strong>ite number of times, it consists of af<strong>in</strong>ite number of simple closed contours, as illustrated <strong>in</strong> Figure A.2 <strong>in</strong>§A.4.1. By apply<strong>in</strong>g the Cauchy Integral Theorem to each of those simple11 Some authors even refer to this theorem as the Cauchy-Goursat Theorem (e.g., Lev<strong>in</strong>son<strong>and</strong> Redheffer, 1970; Churchill <strong>and</strong> Brown, 1984).


294 Appendix A. Review of Complex Variable Theoryclosed contours, the desired result for C is obta<strong>in</strong>ed. Also, a portion ofC may be traversed twice <strong>in</strong> opposite directions s<strong>in</strong>ce the <strong>in</strong>tegrals alongthese portions <strong>in</strong> the two directions cancel each other.Another useful extension of Cauchy Integral Theorem covers the casewhere the contour C is the oriented boundary of a multiply connected doma<strong>in</strong>.To see how this extension can be established, let C 0 be a simple closedcontour <strong>and</strong> let C i , i = 1, 2, · · · , n, be a f<strong>in</strong>ite number of simple closedcontours <strong>in</strong>side C 0 such that the <strong>in</strong>teriors of each C i have no po<strong>in</strong>ts <strong>in</strong>common. Let Ω be the closed region consist<strong>in</strong>g of all po<strong>in</strong>ts with<strong>in</strong> <strong>and</strong>on C 0 except for po<strong>in</strong>ts <strong>in</strong>terior to each C i (Figure A.8). Let C denote thejωC 0C 1Γ 2Γ 1 C 2l 3l 2l 1ΩσFIGURE A.8. Multiply connected doma<strong>in</strong>.entire connected boundary of Ω consist<strong>in</strong>g of C 0 <strong>and</strong> all the contours C i ,described <strong>in</strong> a direction such that the <strong>in</strong>terior po<strong>in</strong>ts of Ω lie to the left ofC. Next, we <strong>in</strong>troduce a polygonal path l 1 , consist<strong>in</strong>g of a f<strong>in</strong>ite numberof l<strong>in</strong>e segments jo<strong>in</strong>ed end to end, to connect the outer contour C 0 to the<strong>in</strong>ner contour C 1 . We <strong>in</strong>troduce another polygonal path l 2 that connectsC 1 to C 2 ; <strong>and</strong> we cont<strong>in</strong>ue <strong>in</strong> this manner, with l n+1 connect<strong>in</strong>g C n to C 0 .As <strong>in</strong>dicated <strong>in</strong> Figure A.8, two simple closed contours Γ 1 <strong>and</strong> Γ 2 can beformed, each consist<strong>in</strong>g of polygonal paths l i or −l i <strong>and</strong> pieces of C 0 <strong>and</strong>C i . Then, if f is analytic throughout Ω, the Cauchy Integral Theorem canbe applied to f on Γ 1 <strong>and</strong> Γ 2 , <strong>and</strong> the sum of the <strong>in</strong>tegrals over those contoursis found to be zero. S<strong>in</strong>ce the <strong>in</strong>tegrals <strong>in</strong> opposite directions alongeach path l i cancel, only the <strong>in</strong>tegral along C rema<strong>in</strong>s. Thus,∫f(s) ds = 0.CThe follow<strong>in</strong>g examples show how the extension of Theorem A.5.3 tothe boundary of a multiply connected doma<strong>in</strong> can be used.Example A.5.3. Let C 0 be a simple closed piecewise smooth contour ly<strong>in</strong>gon the <strong>in</strong>terior of a simple closed piecewise smooth contour C, whereboth C <strong>and</strong> C 0 are equally oriented, e.g., <strong>in</strong> the counter-clockwise direction(Figure A.9). Let f be analytic <strong>in</strong> the closed region bounded by C <strong>and</strong>


A.5 Ma<strong>in</strong> Integral Theorems 295C 0 . Then, the extension of the Cauchy Integral Theorem to its boundarygives that the <strong>in</strong>tegral of f around the outer contour, C, m<strong>in</strong>us the <strong>in</strong>tegralaround the <strong>in</strong>terior contour, C 0 , must equal zero, i.e.,∮C∮f(s) ds = f(s) ds.C 0Example A.5.4. When <strong>in</strong>tegration on closed contours is extended to functionshav<strong>in</strong>g isolated s<strong>in</strong>gularities, the value of the <strong>in</strong>tegral is not zero, <strong>in</strong>general, but each s<strong>in</strong>gularity contributes a term called the residue. 12Say that f(s) can be exp<strong>and</strong>ed as◦f(s) =c −1s − s 0+ c 0 + c 1 (s − s 0 ) + c 2 (s − s 0 ) 2 + · · · .(A.34)The number c −1 is called the residue of f at s 0 .jωCC 0s 0ρΩσFIGURE A.9. Region for Example A.5.4.Now consider the region Ω with boundary consist<strong>in</strong>g of the piecewisesmooth contour C <strong>and</strong> the smaller circle C 0 centered at s 0 , as shown <strong>in</strong>Figure A.9. S<strong>in</strong>ce f(s) is analytic on Ω, Example A.5.3 shows that the <strong>in</strong>tegralaround the outer curve, C <strong>in</strong> Figure A.9, equals the <strong>in</strong>tegral aroundthe <strong>in</strong>ner circle, C 0 . The counter-clockwise circular <strong>in</strong>tegral around s 0 isj2πc −1 us<strong>in</strong>g (A.33). Thus, we may conclude that∮Cf(s) ds = j2πc −1 .(A.35)◦12 The residue is formally def<strong>in</strong>ed <strong>in</strong> §A.9.1; here we give only a preview that will motivateCauchy’s Integral Formula.


296 Appendix A. Review of Complex Variable TheoryA.5.4The Cauchy Integral FormulaA general problem of central importance <strong>in</strong> this book is that of relat<strong>in</strong>gthe values assumed by an analytic function with<strong>in</strong> a given region, to itsvalues on the boundary of the region. A remarkable tool for deal<strong>in</strong>g withthis problem is found <strong>in</strong> Cauchy’s Integral Formula.Theorem A.5.4 (Cauchy’s Integral Formula). Let f be analytic with<strong>in</strong> <strong>and</strong>on a simple closed piecewise smooth contour, C. If s 0 is any po<strong>in</strong>t <strong>in</strong>teriorto C, then∮f(s)ds = j2π f(s 0 ),(A.36)s − s 0Cwhere the <strong>in</strong>tegral is computed <strong>in</strong> the counter-clockwise direction.Proof. S<strong>in</strong>ce f is cont<strong>in</strong>uous at s 0 , given ε > 0 there is a δ > 0 such that|f(s) − f(s 0 )| < ε whenever |s − s 0 | < δ.Choose 0 < ρ < δ such that the counter-clockwise oriented circle |s − s 0 | =ρ, denoted by C 0 <strong>in</strong> Figure A.9, is <strong>in</strong>terior to C. Then|f(s) − f(s 0 )| < ε whenever |s − s 0 | = ρ. (A.37)Next, observe that the function f(s)/(s − s 0 ) is analytic at all po<strong>in</strong>ts with<strong>in</strong><strong>and</strong> on C except at s 0 . Hence, by the Cauchy Integral Theorem for multiplyconnected doma<strong>in</strong>s, its <strong>in</strong>tegral around the oriented boundary of theregion between C <strong>and</strong> C 0 has value zero, i.e.,∮Cf(s)s − s 0ds =∮C 0f(s)s − s 0ds.Subtract<strong>in</strong>g the constant term f(s 0 ) ∮ C 0ds/(s − s 0 ), which equals j2π f(s 0 )by (A.33), from both sides of the above equation yields∮∮f(s)f(s) − f(s 0 )ds − j2π f(s 0 ) =ds. (A.38)s − s 0 C 0s − s 0CReferr<strong>in</strong>g to (A.37) <strong>and</strong> not<strong>in</strong>g that the length of C 0 is 2πρ, we may applyproperty (A.16) to the RHS of (A.38), to obta<strong>in</strong>∮f(s) − f(s 0 )∣dsC 0s − s 0∣ < ε 2πρ = 2πε.ρFrom (A.38), then∮∣Cf(s)ds − j2π f(s 0 )s − s 0∣ < 2πε.S<strong>in</strong>ce the LHS of this <strong>in</strong>equality is a nonnegative number that is less thanan arbitrarily small positive number, it must be equal to zero, giv<strong>in</strong>g thedesired result.□


A.5 Ma<strong>in</strong> Integral Theorems 297We note from Theorem A.5.4 that the value f(s 0 ) can be obta<strong>in</strong>ed by <strong>in</strong>tegrat<strong>in</strong>gf(s)/(s − s 0 ) on a contour encircl<strong>in</strong>g s 0 . Hence we can determ<strong>in</strong>ethe value of an analytic function <strong>in</strong>side a region by its behavior on theboundary. We extensively exploit this result <strong>in</strong> the book to exam<strong>in</strong>e thecharacteristics that a function must have on the boundary (typically theimag<strong>in</strong>ary axis) when it is known to achieve certa<strong>in</strong> values <strong>in</strong> the <strong>in</strong>terior.The Cauchy <strong>in</strong>tegral formula can be extended to cases <strong>in</strong> which the simpleclosed contour C is replaced by the oriented boundary of a multiplyconnected doma<strong>in</strong>, as described <strong>in</strong> §A.5.3. The follow<strong>in</strong>g example showshow this can be accomplished.Example A.5.5. Let C 0 <strong>and</strong> C 1 be two counter-clockwise oriented, concentriccircles, where C 1 is smaller than C 0 (Figure A.10). Assume that f is an-C 0C 1γs 0FIGURE A.10. Annular doma<strong>in</strong> for Example A.5.5.alytic on both circles <strong>and</strong> throughout the annular doma<strong>in</strong> between them.Let s 0 be a po<strong>in</strong>t <strong>in</strong>side the annulus <strong>and</strong> construct a counter-clockwiseoriented circle γ about s 0 , small enough to be completely conta<strong>in</strong>ed <strong>in</strong>the annular doma<strong>in</strong>, as shown <strong>in</strong> Figure A.10. It then follows from theadaptation of the Cauchy <strong>in</strong>tegral theorem to the boundary of a multiplyconnected doma<strong>in</strong> that∮C 0f(s)s − s 0ds −∮C 1∮f(s) f(s)ds − ds = 0.s − s 0 γ s − s 0But, accord<strong>in</strong>g to Cauchy’s <strong>in</strong>tegral formula, the value of the third <strong>in</strong>tegralabove is 2πj f(s 0 ). Hence,f(s 0 ) = 1 ∮2πj C 0f(s)ds − 1 ∮s − s 0 2πj C 1f(s)s − s 0ds.(A.39)◦In the follow<strong>in</strong>g section we discuss an important application of the Cauchy’s<strong>in</strong>tegral theorem <strong>and</strong> formula.


298 Appendix A. Review of Complex Variable TheoryA.6 The Poisson Integral FormulaAn application of the <strong>in</strong>tegral theorems of the previous section, central tothe purposes of this book, is found <strong>in</strong> the Poisson <strong>in</strong>tegral formula for boththe half plane <strong>and</strong> the unit disk.A.6.1Formula for the Half PlaneIn the case of the right half plane, we are faced with <strong>in</strong>tegration over acontour that becomes arbitrarily long. To deal with this, we will considera class of functions with restricted behavior at <strong>in</strong>f<strong>in</strong>ity, <strong>and</strong> then use thebound<strong>in</strong>g technique of (A.26) to estimate the <strong>in</strong>tegral over the <strong>in</strong>f<strong>in</strong>ite contour.In particular, given a function f, def<strong>in</strong>em(R) = sup |f(Re jθ )|, θ ∈ [−π/2, π/2]. (A.40)θThen f(s) is said to be of class R ifm(R)lim = 0. (A.41)R→∞ RFor example, the functions considered <strong>in</strong> Examples A.4.1 <strong>and</strong> A.4.3 are <strong>in</strong>this class. More generally, if f is analytic <strong>and</strong> of bounded magnitude <strong>in</strong> theCRHP, then f is of class R.The Poisson <strong>in</strong>tegral formula for the half plane is given <strong>in</strong> the follow<strong>in</strong>gresult.Theorem A.6.1 (Poisson Integral Formula for the Half Plane). Let f beanalytic <strong>in</strong> the CRHP <strong>and</strong> suppose that f is of class R. Let s 0 = σ 0 + jω 0be a po<strong>in</strong>t <strong>in</strong> the complex plane with σ 0 > 0. Thenf(s 0 ) = 1 π∫ ∞−∞σ 0f(jω)σ 2 0 + (ω 0 − ω) dω. 2(A.42)Proof. Let f be as <strong>in</strong> the statement of the theorem <strong>and</strong> let s 0 = σ 0 + jω 0 beany po<strong>in</strong>t such that σ 0 > 0. Consider the clockwise oriented semicircularcontour C shown <strong>in</strong> Figure A.11, where R is large enough so that s 0 is<strong>in</strong>terior to the contour. 13 Thus C consists of the segment s = jω, ω ∈[−R, R], together with the arc C R given by s = Re jθ , θ ∈ [−π/2, π/2].S<strong>in</strong>ce f is analytic on <strong>and</strong> <strong>in</strong>side C, then Cauchy’s <strong>in</strong>tegral formula (A.36)givesf(s 0 ) = − 1 ∮2πj Cf(s)s − s 0ds.13 Recall that the <strong>in</strong>terior is the doma<strong>in</strong> bounded by the curve, <strong>and</strong> it is def<strong>in</strong>ed <strong>in</strong>dependentof the orientation.


A.6 The Poisson Integral Formula 299jωRσ−¯s 0 s 0C RFIGURE A.11. Contour for the Poisson <strong>in</strong>tegral formulaNow consider the po<strong>in</strong>t −s 0 , which is outside C. Thus, the Cauchy <strong>in</strong>tegraltheorem gives0 = 1 ∮f(s)ds.2πj C s + s 0Add<strong>in</strong>g the above two equations, we obta<strong>in</strong>f(s 0 ) = − 12πj∮Cs 0 + s 0f(s)ds, (A.43)(s − s 0 )(s + s 0 )which can be decomposed <strong>in</strong>to the sum of two <strong>in</strong>tegrals as∫ R−Rf(s 0 ) = − 1 f(jω)π (jω − s 0 )(jω + s 0 ) dω− 1 ∫(A.44)σ 0f(s)πj C R(s − s 0 )(s + s 0 ) ds.As R → ∞, the first <strong>in</strong>tegral <strong>in</strong> (A.44) becomes1π∫ ∞−∞σ 0σ 0f(jω)σ 2 0 + (ω 0 − ω) dω. 2(A.45)Compar<strong>in</strong>g to (A.42), it thus rema<strong>in</strong>s to show that the second <strong>in</strong>tegral <strong>in</strong>(A.44) vanishes as R → ∞. Us<strong>in</strong>g (A.40) <strong>and</strong> the fact that, for R sufficientlylarge, the denom<strong>in</strong>ator <strong>in</strong> the second <strong>in</strong>tegral has magnitude R 2 , we have∫∣ 1σ 0 ∣∣∣∣ f(s)πj C R(s − s 0 )(s + s 0 ) ds ≤ 1 πm(R) σ 0 π RR 2 ,which tends to zero as R → ∞ s<strong>in</strong>ce f is of class R <strong>and</strong> hence satisfies(A.41). The result then follows.□If f is analytic <strong>in</strong> the ORHP <strong>and</strong> on the imag<strong>in</strong>ary axis except for s<strong>in</strong>gularitiesof a particular type, then the Poisson <strong>in</strong>tegral formula is still valid,as shown next.


300 Appendix A. Review of Complex Variable TheoryLemma A.6.2. Let f be analytic <strong>in</strong> the CRHP, except for s<strong>in</strong>gular po<strong>in</strong>ts s kon the imag<strong>in</strong>ary axis that satisfylims→s kRe s≥0(s − s k )f(s) = 0. (A.46)Suppose further that f is of class R. Then the Poisson <strong>in</strong>tegral formula(A.42) holds at each complex po<strong>in</strong>t s 0 = σ 0 + jω 0 with σ 0 > 0.Proof. Consider the contour shown <strong>in</strong> Figure A.12, i.e., a semicircle of radiusR encircl<strong>in</strong>g the po<strong>in</strong>t s 0 <strong>and</strong> such that the portion of curve on theimag<strong>in</strong>ary axis has semicircular <strong>in</strong>dentations of radius δ <strong>in</strong>to the ORHP ateach s<strong>in</strong>gularity s k of f.jωs kδC δs 0RσC RFIGURE A.12. Contour for f with s<strong>in</strong>gularities on the jω-axis.We next proceed as <strong>in</strong> the proof of Theorem A.6.1, but <strong>in</strong> this case the<strong>in</strong>tegral (A.43) will be decomposed <strong>in</strong>to the sum of the <strong>in</strong>tegral over thesemicircular contour C R , the <strong>in</strong>tegrals over the small semicircular contoursof radii δ, <strong>and</strong> the <strong>in</strong>tegrals over the rema<strong>in</strong><strong>in</strong>g portions of the imag<strong>in</strong>aryaxis. This last sum of <strong>in</strong>tegrals over the portions of imag<strong>in</strong>ary axis betweenthe small semicircles will tend to (A.45) as R → ∞ <strong>and</strong> δ → 0. The <strong>in</strong>tegralover C R vanishes as <strong>in</strong> the proof of Theorem A.6.1. It thus rema<strong>in</strong>s to showthat the <strong>in</strong>tegrals over each semicircle of radius δ also vanish as δ → 0.Consider then one of the semicircles C δ <strong>in</strong> Figure A.12, centered at s k ,say. On this contour, s = s k + δe jθ , θ ∈ [−π/2, π/2]. Then∫∫f(s) σ π/20C δ(s − s 0 )(s + s 0 ) ds = j−π/2f(s k + δe jθ ) δe jθ σ 0(s k + δe jθ − s 0 )(s k + δe jθ + s 0 ) dθ.Note that (A.46) implies that lim δ→0 f(s k + δe jθ ) δe jθ = 0. Hence the <strong>in</strong>tegr<strong>and</strong>on the RHS above vanishes <strong>and</strong> the result follows.□Other forms of the Poisson formula are obta<strong>in</strong>ed by separat<strong>in</strong>g <strong>in</strong>to real<strong>and</strong> imag<strong>in</strong>ary parts. For example, if f(s) = u(σ, ω) + jv(σ, ω), then the


A.6 The Poisson Integral Formula 301formula (A.42) gives two real equations of the same structure, namelyu(σ 0 , ω 0 ) = 1 πv(σ 0 , ω 0 ) = 1 π∫ ∞u(0, ω)−∞∫ ∞v(0, ω)−∞σ 0σ 2 0 + (ω 0 − ω) 2 dω,σ 0σ 2 0 + (ω 0 − ω) 2 dω.(A.47)S<strong>in</strong>ce u <strong>and</strong> v are harmonic when f is analytic, each of the formulae (A.47)is the Poisson <strong>in</strong>tegral formula for harmonic functions.Note that the <strong>in</strong>tegrals <strong>in</strong> (A.47) are improper <strong>in</strong>tegrals of the formI =∫ ∞w(ω) dω.−∞Every such <strong>in</strong>tegral will be evaluated based on its Cauchy pr<strong>in</strong>cipal value,i.e.,∫ RI = lim w(ω) dω.R→∞ −RExistence of the Cauchy pr<strong>in</strong>cipal value of an <strong>in</strong>tegral does not, <strong>in</strong> general,guarantee the existence of the two limits lim R→∞ w(ω) dω <strong>and</strong>∫ 0−R∫ Rlim R→∞ w(ω) dω. However, if w(ω) is even, i.e., w(−ω) = w(ω), then0existence of the Cauchy pr<strong>in</strong>cipal value implies existence of these two limits.A useful result for a particular harmonic function is given next. 14Corollary A.6.3. Let f be analytic <strong>and</strong> nonzero <strong>in</strong> the CRHP except forpossible zeros on the imag<strong>in</strong>ary axis <strong>and</strong>/or zeros at <strong>in</strong>f<strong>in</strong>ity. Assume thatlog f is <strong>in</strong> class R. Then, at each complex po<strong>in</strong>t s 0 = σ 0 + jω 0 , σ 0 > 0,log |f(s 0 )| = 1 π∫ ∞−∞σ 0log |f(jω)|σ 2 0 + (ω 0 − ω) dω. 2(A.48)Proof. If f is as <strong>in</strong> the statement of the theorem, then log f is analytic <strong>in</strong> theCRHP except for s<strong>in</strong>gularities at the imag<strong>in</strong>ary zeros of f <strong>and</strong>/or at zerosof f at <strong>in</strong>f<strong>in</strong>ity. If f has imag<strong>in</strong>ary zeros s k , it is not difficult to prove thatlog f satisfies (A.46), <strong>and</strong> hence Lemma A.6.2 shows that these s<strong>in</strong>gularitiesdo not affect the Poisson <strong>in</strong>tegral. If f has zeros at <strong>in</strong>f<strong>in</strong>ity, then log fhas a s<strong>in</strong>gularity at <strong>in</strong>f<strong>in</strong>ity, but the contour of <strong>in</strong>tegration <strong>in</strong> Figure A.12has an <strong>in</strong>dentation around the po<strong>in</strong>t at <strong>in</strong>f<strong>in</strong>ity (i.e., the large semicircle<strong>in</strong>to the ORHP). The fact that log f is <strong>in</strong> class R shows that the <strong>in</strong>tegral onthis semicircle vanishes as the radius tends to <strong>in</strong>f<strong>in</strong>ity. Then (A.42) holdsfor log f(s) <strong>and</strong> (A.47) holds for log |f(s)| = Re log f(s).□14 This result, to the best of our knowledge, first appeared <strong>in</strong> Freudenberg <strong>and</strong> Looze (1985).


302 Appendix A. Review of Complex Variable TheoryNote that zeros of f at <strong>in</strong>f<strong>in</strong>ity can be treated as zeros of f on the imag<strong>in</strong>aryaxis, s<strong>in</strong>ce both types of zeros are <strong>in</strong> fact s<strong>in</strong>gularities of log f on thecontour of <strong>in</strong>tegration that encircles the ORHP. The procedure that we taketo deal with these s<strong>in</strong>gularities consists of two steps: first, <strong>in</strong>dentationsaround these s<strong>in</strong>gularities have to be made on the contour of <strong>in</strong>tegration;second, precautions should be taken to show that the <strong>in</strong>tegrals on those<strong>in</strong>dentations converge (go to zero <strong>in</strong> our case) as the <strong>in</strong>dentations vanish,i.e., condition (A.46) is assumed for imag<strong>in</strong>ary zeros, <strong>and</strong> the property oflog f be<strong>in</strong>g <strong>in</strong> class R is assumed for zeros at <strong>in</strong>f<strong>in</strong>ity.A.6.2Formula for the DiskLet s = re jθ , <strong>and</strong> consider the unit circle |s| = 1 described <strong>in</strong> a counterclockwisesense by s = e jθ , with −π ≤ θ ≤ π. We then have the follow<strong>in</strong>gresult.Theorem A.6.4 (Poisson Integral Formula for the Disk). If f is a functionanalytic on £ , then for any <strong>in</strong>terior po<strong>in</strong>t s 0 = r 0 e jθ 0, r 0 < 1,f(r 0 e jθ 0) = 1 ∫ πf(e jθ )2π −π1 − r 2 01 − 2r 0 cos(θ − θ 0 ) + r 2 dθ.0(A.49)Proof. Let f be a function analytic on £ . For any po<strong>in</strong>t s 0 = r 0 e jθ <strong>in</strong>teriorto £ , the Cauchy <strong>in</strong>tegral formula expresses f(s 0 ) asf(s 0 ) = 1 ∫ πf(e jθ )2π −π e jθ e jθ dθ.− s 0(A.50)s 1ss 0θ θ 01FIGURE A.13. The unit disk.Introduce the <strong>in</strong>verse of the po<strong>in</strong>t s 0 with respect to the unit circle, says 1 , which lies on the same ray from the orig<strong>in</strong> as does s 0 , but is exterior£ to , i.e., s 1 = 1/s 0 (see Figure A.13). It then follows from the Cauchy


A.6 The Poisson Integral Formula 303<strong>in</strong>tegral theorem that the <strong>in</strong>tegral <strong>in</strong> (A.50) equals 0 when s 0 is replacedby s 1 <strong>in</strong> the <strong>in</strong>tegr<strong>and</strong>, i.e.,0 = 1 ∫ πf(e jθ )2π −π e jθ e jθ dθ.− s 1(A.51)Subtract<strong>in</strong>g (A.51) from (A.50), <strong>and</strong> replac<strong>in</strong>g s 1 = 1/s 0 , yieldsf(s 0 ) = 1 ∫ π( ef(e jθ jθ)2π −π= 12π∫ πf(e jθ )−πe jθ −− s 0( ejθe jθ +− s 0)ejθe jθ dθ− s 1)s 0e −jθ dθ.− s 0The term with<strong>in</strong> parenthesis <strong>in</strong> the <strong>in</strong>tegr<strong>and</strong> above can be written ase jθe jθ − s 0+s 0e −jθ − s 0=1 − r 2 0|e jθ − r 0 e jθ 0 |21 − r 2 0=1 − 2 cos(θ − θ 0 ) + r 2 ,0where 1−2 cos(θ−θ 0 )+r 2 0> 0 represents the distance between the po<strong>in</strong>tss = e jθ <strong>and</strong> s 0 . Hence, the formula (A.49) follows.□Tak<strong>in</strong>g the real or imag<strong>in</strong>ary part of (A.49) we obta<strong>in</strong> the Poisson <strong>in</strong>tegralformula for harmonic functions on the unit circle.For our purposes, we will require a version of Theorem A.6.4 that reconstructsthe values of a harmonic function at any po<strong>in</strong>t outside the unitdisk from the values on the border. This follows as an easy corollary.Corollary A.6.5. If u is a function harmonic outside £ , then for any po<strong>in</strong>texterior to the unit disk s 0 = r 0 e jθ 0, r 0 > 1,u(r 0 e jθ 0) = 1 ∫ πu(e jθ )2π −πr 2 0 − 11 − 2r 0 cos(θ − θ 0 ) + r 2 dθ.0(A.52)Proof. Straightforward from Theorem A.6.4 on tak<strong>in</strong>g the real part of (A.49)<strong>and</strong> consider<strong>in</strong>g u(1/s).□Both versions of the Poisson <strong>in</strong>tegral formulae, for the half plane <strong>and</strong>the disk, were obta<strong>in</strong>ed follow<strong>in</strong>g the same procedure: the Cauchy <strong>in</strong>tegralformula was applied to a po<strong>in</strong>t <strong>in</strong>side the contour of <strong>in</strong>terest <strong>and</strong> thenadded to (subtracted from) the Cauchy <strong>in</strong>tegral theorem applied to a particularpo<strong>in</strong>t outside the contour of <strong>in</strong>terest. The reader may wonder if thesame procedure applied to other po<strong>in</strong>ts outside the contour would leadto other <strong>in</strong>terest<strong>in</strong>g relationships. The answer is <strong>in</strong>deed yes (see §2.3.2 <strong>in</strong>Chapter 2 for other applications).


304 Appendix A. Review of Complex Variable TheoryA.7 Power SeriesThis section gives three applications of the Cauchy <strong>in</strong>tegral formula thatare related with derivatives <strong>and</strong> series expansions of analytic functions.A.7.1Derivatives of Analytic FunctionsTheorem A.5.4 has an immediate application <strong>in</strong> show<strong>in</strong>g that an analyticfunction possesses derivatives of all orders, <strong>and</strong> these derivatives are themselvesanalytic, as we see next.Theorem A.7.1. Let f be analytic <strong>in</strong> an arbitrary doma<strong>in</strong> D. Then its derivativesof all orders exist <strong>in</strong> D <strong>and</strong> are analytic functions. Moreover, if C isany closed contour conta<strong>in</strong>ed <strong>in</strong> D, then the n-th derivative of f at anypo<strong>in</strong>t s 0 <strong>in</strong>side C is computed asf (n) (s 0 ) = n!2πj∮Cf(s)ds. (A.53)(s − s 0 )n+1Proof. Let f be analytic <strong>in</strong> a doma<strong>in</strong> D <strong>and</strong> let α be any po<strong>in</strong>t of D. We willshow that f has all derivatives at α. S<strong>in</strong>ce α is arbitrary, this will establishthe existence of all derivatives <strong>in</strong> D.If C is a sufficiently small circle with α as center (see Figure A.14), thenit follows from (A.36) that for any po<strong>in</strong>t s 0 <strong>in</strong>side Cf(s 0 ) = 12πj∮Cf(s)s − s 0ds,(A.54)where C is now traversed <strong>in</strong> the counter-clockwise direction. If (A.54) isdifferentiated formally n times with respect to s 0 , we havef (n) (s 0 ) = n! ∮f(s)ds. (A.55)2πj (s − s 0 )n+1We will next show the validity of (A.55).CCαs 0dFIGURE A.14. Contour for Theorem A.7.1.Let ∆ s be such that 0 < |∆ s | < d, where d is the shortest distance froms 0 to po<strong>in</strong>ts on C, as shown <strong>in</strong> Figure A.14. Us<strong>in</strong>g (A.54) for s 0 <strong>and</strong> s 0 +∆ s ,


A.7 Power Series 305we havef(s 0 + ∆ s ) − f(s 0 )= 1 ∮∆ s 2πj= 1 ∮2πjCC(1− 1 ) f(s)dss − s 0 − ∆ s s − s 0 ∆ sf(s)ds + J, (A.56)(s − s 0 )2whereJ = ∆ ∮sf(s)2πj C (s − s 0 − ∆ s )(s − s 0 ) 2 ds.Let f m be the maximum value of |f(s)| on C <strong>and</strong> let l be the length of C.S<strong>in</strong>ce |s − s 0 | ≥ d by construction, <strong>and</strong>|s − s 0 − ∆ s | ≥ ||s − s 0 | − |∆ s || ≥ d − |∆ s |,we readily obta<strong>in</strong> the follow<strong>in</strong>g bound for J <strong>in</strong> (A.56):|J| ≤|∆ s |f m l2πj(d − |∆ s |)d 2 ,where the last fraction approaches zero as ∆ s approaches zero. Tak<strong>in</strong>g limits<strong>in</strong> (A.56), it then follows thatf(s 0 + ∆ s ) − f(s 0 )lim= 1 ∮f(s)∆ s→0 ∆ s 2πj (s − s 0 ) 2 ds,<strong>and</strong> the expression (A.55) is then seen to hold for n = 1.Repeat<strong>in</strong>g the above procedure start<strong>in</strong>g with (A.55) for n = 1 establishesthe existence of f ′′ <strong>and</strong> proves (A.55) for n = 2. This shows that if f isanalytic so is f ′ . The result for f (n) then follows by <strong>in</strong>duction. □Note that, if we agree that f (0) (s) = f(s) <strong>and</strong> that 0! = 1, then the formula(A.53) for n = 0 is the Cauchy <strong>in</strong>tegral formula.In particular, when a functionCf(s) = u(σ, ω) + jv(σ, ω)is analytic <strong>in</strong> a doma<strong>in</strong> D, the analyticity of f ′ ensures the cont<strong>in</strong>uity of f ′there. Then, s<strong>in</strong>cef ′ = ∂u∂σ + j ∂v∂σ = ∂v∂ω − j ∂u∂ω ,Theorem A.2.2 shows that the first order partial derivatives of u <strong>and</strong> v arecont<strong>in</strong>uous <strong>in</strong> D. Similarly, Theorem A.7.1 shows that the partial derivativesof u <strong>and</strong> v of all orders are cont<strong>in</strong>uous <strong>in</strong> D. This result was anticipated<strong>in</strong> §A.3.1 <strong>in</strong> the discussion of harmonic functions.As was the case with the Cauchy <strong>in</strong>tegral formula, (A.53) can be extendedto the case <strong>in</strong> which the circle C is replaced by the oriented boundaryof a multiply connected doma<strong>in</strong>.


306 Appendix A. Review of Complex Variable TheoryA.7.2Taylor SeriesWe will next give a result that adapts the familiar Taylor series expansionfrom calculus, to functions of a complex variable.We briefly review some term<strong>in</strong>ology. The partial sums of a power seriesa 0 + a 1 (s − s 0 ) + a 2 (s − s 0 ) 2 + · · · ,where the a i are complex numbers, are def<strong>in</strong>ed byΣ n (s) = a 0 + a 1 (s − s 0 ) + a 2 (s − s 0 ) 2 + · · · + a n (s − s 0 ) n .The partial sums form a sequence of polynomials{Σn (s) } , n = 0, 1, 2, · · · .Let f(s) <strong>and</strong> a sequence of functions { Σ n (s) } be given <strong>in</strong> a region Ω ofthe complex plane. Then the sequence is said to converge uniformly to thefunction f <strong>in</strong> Ω if, given any ε > 0, there exists an <strong>in</strong>teger N, which c<strong>and</strong>epend on ε (but not on s ∈ Ω), such that|f(s) − Σ n (s)| < ε, for n ≥ N <strong>and</strong> s <strong>in</strong> Ω. (A.57)The uniform convergence of { Σ n (s) } to f <strong>in</strong> a region Ω is sometimes stated<strong>in</strong> the formf(s) = limn→∞ Σ n(s), uniformly <strong>in</strong> the region Ω. (A.58)In particular, if Σ n (s) are the partial sums of a power series, (A.58) is sometimeswritten as∞∑f(s) = a k (s − s 0 ) k , uniformly <strong>in</strong> the region Ω.k=0Theorem A.7.2 (Taylor Series). Let f(s) be analytic <strong>in</strong> a doma<strong>in</strong> D. Let s 0be <strong>in</strong> D <strong>and</strong> let R be the radius of the largest circle with center at s 0 <strong>and</strong>hav<strong>in</strong>g its <strong>in</strong>terior <strong>in</strong> D. Then the power series∞∑f(s) = a k (s − s 0 ) k ,k=0(A.59)converges uniformly to f(s) for all s such that |s − s 0 | ≤ r < R. The coefficients<strong>in</strong> (A.59) are given bya k = f(k) (s 0 )k!= 1 ∮f(s)ds, k = 0, 1, · · · , (A.60)2πj C 1(s − s 0 )k+1where C 1 is any circle with center at s 0 <strong>and</strong> radius r 1 such that r < r 1 < R.


A.7 Power Series 307C 1ss 0r 1rDRFIGURE A.15. Contours for Theorem A.7.2.Proof. Let R be as <strong>in</strong> the statement of the theorem <strong>and</strong> let s be a po<strong>in</strong>t suchthat |s − s 0 | ≤ r < R. Pick r 1 such that r < r 1 < R, def<strong>in</strong><strong>in</strong>g a circle C 1(Figure A.15).S<strong>in</strong>ce s is <strong>in</strong>terior to C 1 <strong>and</strong> f is analytic on <strong>and</strong> with<strong>in</strong> C 1 , Cauchy’s<strong>in</strong>tegral formula (A.36) holds, i.e.,f(s) = 1 ∮f(ξ)2πj C 1ξ − s dξ ,(A.61)where the contour is traversed counter-clockwise.Next notice that we can write1ξ − s = 1 1ξ − s 01 − s − s ,0ξ − s 0 ∣ which, for all s such thats − s 0 ∣∣∣∣ ≤ r < 1, converges uniformly toξ − s 0 r 1∞1ξ − s = ∑ (s − s 0 ) k. (A.62)(ξ − s 0 )k+1k=0Substitut<strong>in</strong>g the RHS of (A.62) <strong>in</strong>to (A.61) yieldsf(s) = 1 ∮f(ξ)2πj C 1∞∑k=0(s − s 0 ) kdξ .(ξ − s 0 )k+1Due to the uniform convergence of (A.62) <strong>in</strong> |s − s 0 | ≤ r, we can <strong>in</strong>terchange<strong>in</strong>tegration <strong>and</strong> summation to obta<strong>in</strong>f(s) =∞∑k=0( ∮)1 f(ξ)2πj C 1(ξ − s 0 ) k+1 dξ (s − s 0 ) k .


308 Appendix A. Review of Complex Variable TheoryNote that, us<strong>in</strong>g (A.55), the coefficient between parenthesis correspondsto f (k) (s 0 )/k!. We then havef(s) =∞∑k=0f (k) (s 0 )(s − s 0 ) k , uniformly <strong>in</strong> |s − s 0 | ≤ r < R,k!thus prov<strong>in</strong>g the theorem.□Example A.7.1. Let f(s) = cos s. The derivatives are − s<strong>in</strong> s, − cos s, s<strong>in</strong> s,cos s, etc., <strong>and</strong> hence f(0) = 1, f ′ (0) = 0, f ′′ (0) = −1, f ′′′ (0) = 0. S<strong>in</strong>cef (iv) (s) = f(s), the sequence repeats, so thatcos s = 1 − s22! + s44! − s66! + · · · , (A.63)is the Taylor series of f(s) = cos s about the po<strong>in</strong>t s 0 = 0. S<strong>in</strong>ce cos s isanalytic <strong>in</strong> |s| < R for every R, Theorem A.7.2 shows that the convergenceis uniform <strong>in</strong> |s| ≤ r for each fixed r < ∞.Similarly,e s = 1 + s + s22! + s33! + · · · , (A.64)s<strong>in</strong> s = s − s33! + s55! − · · · , (A.65)uniformly for |s| ≤ r < ∞.◦The Taylor series shows that the values f (k) (s 0 ), k = 0, 1, · · · , at a po<strong>in</strong>ts 0 of a doma<strong>in</strong> D <strong>in</strong> which f is analytic, determ<strong>in</strong>e f(s) <strong>in</strong> a disk |s − s 0 |


A.7 Power Series 309(Figure A.16). Let f be analytic on both circles <strong>and</strong> throughout the annulardoma<strong>in</strong> between them. Then the Laurent serieswhere<strong>and</strong>f(s) = f 1 (s) + f 2 (s) ∞∑∞∑a k (s − s 0 ) k b k+(s − s 0 ) k , (A.66)k=0k=1a k = 1 ∮f(s)ds, k = 0, 1, · · · , (A.67)2πj C 0(s − s 0 )k+1b k = 1 ∮f(s)ds, k = 1, 2, · · · , (A.68)2πj C 1(s − s 0 )−k+1converges uniformly to f(s) <strong>in</strong> any closed annulus conta<strong>in</strong>ed <strong>in</strong> the doma<strong>in</strong>enclosed between C 0 <strong>and</strong> C 1 .Proof. Let s be any po<strong>in</strong>t <strong>in</strong>terior to the annular doma<strong>in</strong> between C 0 <strong>and</strong>C 1 , as shown <strong>in</strong> Figure A.16. It then follows, as <strong>in</strong> Example A.5.5, thatf(s) = 1 ∮f(ξ)2πj C 0ξ − s dξ + 1 ∮−f(ξ)2πj C 1ξ − s dξ.(A.69)Select a closed annulus with <strong>in</strong>ternal radius r 1 <strong>and</strong> external radius r, con-C 0s 0C 1r 1CsrFIGURE A.16. Contours for Theorem A.7.3.ta<strong>in</strong>ed <strong>in</strong> the doma<strong>in</strong> enclosed between C 0 <strong>and</strong> C 1 , <strong>and</strong> such that r 1 ≤|s − s 0 | ≤ r, as depicted <strong>in</strong> Figure A.16.We proved <strong>in</strong> Theorem A.7.2 that the first <strong>in</strong>tegral <strong>in</strong> (A.69) convergesto f 1 (s) <strong>in</strong> (A.66) uniformly for all s <strong>in</strong> |s − s 0 | ≤ r.As for the second <strong>in</strong>tegral, we note that −1/(ξ − s) = 1/(s − ξ). Thus,<strong>in</strong>terchang<strong>in</strong>g s <strong>and</strong> ξ <strong>in</strong> (A.62), <strong>and</strong> follow<strong>in</strong>g the proof of Theorem A.7.2,we show that the second <strong>in</strong>tegral <strong>in</strong> (A.69) converges to f 2 (s) <strong>in</strong> (A.66)uniformly for all s <strong>in</strong> |s − s 0 | ≥ r 1 .


310 Appendix A. Review of Complex Variable TheoryIt follows that (A.69) converges uniformly to f(s) = f 1 (s) + f 2 (s) <strong>in</strong>(A.66) for all s <strong>in</strong> the annulus r 1 ≤ |s − s 0 | ≤ r, thus prov<strong>in</strong>g the theorem.□Note that the two <strong>in</strong>tegr<strong>and</strong>s f(s)/(s − s 0 ) k+1 <strong>and</strong> f(s)/(s − s 0 ) −k+1<strong>in</strong> the coefficient expressions (A.67) <strong>and</strong> (A.68) are analytic throughoutthe annular doma<strong>in</strong> R 1 < |s − s 0 | < R 0 <strong>and</strong> on its boundary. Then, as<strong>in</strong> Example A.5.3, any simple closed piecewise smooth contour C aroundthat doma<strong>in</strong> <strong>in</strong> the counter-clockwise direction, as shown <strong>in</strong> Figure A.16,can be used as a path of <strong>in</strong>tegration <strong>in</strong> place of the circular paths C 0 <strong>and</strong>C 1 . Thus, the Laurent series (A.66) can be written as∞∑f(s) = c k (s − s 0 ) k ,(A.70)wherek=−∞c k = 1 ∮f(s)ds, k = 0, ±1, ±2, · · · . (A.71)2πj C (s − s 0 )k+1Correspond<strong>in</strong>g to the decomposition f = f 1 + f 2 <strong>in</strong> (A.66), are the twoparts of the Laurent series, namely,f 1 (s) =∞∑a k (s − s 0 ) k , f 2 (s) =k=0∞∑k=1b k(s − s 0 ) k .The function f 1 (s) <strong>in</strong>volves nonnegative powers of s − s 0 <strong>and</strong> is calledthe regular part of f(s) at s 0 . The function f 2 (s) <strong>in</strong>volves negative powersof s − s 0 <strong>and</strong> is called the s<strong>in</strong>gular or pr<strong>in</strong>cipal part of f(s) at s 0 . In thefollow<strong>in</strong>g section, we show that there is an <strong>in</strong>timate connection betweenthe pr<strong>in</strong>cipal part <strong>and</strong> the nature of the s<strong>in</strong>gularity at s 0 .A.8 S<strong>in</strong>gularitiesA.8.1Isolated S<strong>in</strong>gularitiesIn this subsection we exam<strong>in</strong>e functions that are analytic <strong>in</strong> a punctureddisk (an open disk with the center removed). We will then consider thatthe center of the disk is an isolated s<strong>in</strong>gularity <strong>and</strong> classify it accord<strong>in</strong>g tothe behavior of the function near that po<strong>in</strong>t.Denote by B(s 0 , R) the disk of center s 0 <strong>and</strong> radius R, i.e., B(s 0 , R) = {s :|s − s 0 | < R}. We then have the follow<strong>in</strong>g def<strong>in</strong>ition.Def<strong>in</strong>ition A.8.1 (Isolated S<strong>in</strong>gularities). A function f has an isolated s<strong>in</strong>gularityat s = s 0 if there is an R > 0 such that f is def<strong>in</strong>ed <strong>and</strong> analytic <strong>in</strong>B(s 0 , R) − {s 0 } but not <strong>in</strong> B(s 0 , R).Let s = s 0 be an isolated s<strong>in</strong>gularity of f. Then:


A.8 S<strong>in</strong>gularities 311(i) s = s 0 is called a removable s<strong>in</strong>gularity if there is an analytic functiong : B(s 0 , R) → ¢ such that g(s) = f(s) for 0 < |s − s 0 | < R;(ii) s = s 0 is called a pole if lim s→s0 |f(s)| = ∞, that is, for any K > 0 thereis a number ε > 0 such that |f(s)| ≥ K whenever 0 < |s − s 0 | < ε;(iii) s = s 0 is called an essential s<strong>in</strong>gularity if s 0 is neither a pole nor aremovable s<strong>in</strong>gularity.◦An alternative def<strong>in</strong>ition of a pole is as follows: an isolated s<strong>in</strong>gularitys = s 0 is called a pole of f iff(s) =g(s)(s − s 0 ) m , (A.72)where m ≥ 1 is an <strong>in</strong>teger, g(s) is analytic <strong>in</strong> a neighborhood of s 0 <strong>and</strong>g(s 0 ) ≠ 0. This def<strong>in</strong>ition <strong>and</strong> (ii) are equivalent <strong>and</strong> can be derived fromeach other. Also, if f has a pole at s = s 0 <strong>and</strong> m is the smallest positive<strong>in</strong>teger such that (s − s 0 ) m f(s) has a removable s<strong>in</strong>gularity at s = s 0 , thenf has a pole of order m at s = s 0 .In the follow<strong>in</strong>g theorem, we show how the Laurent series expansion isused to classify isolated s<strong>in</strong>gularities.Theorem A.8.1. Let s = s 0 be an isolated s<strong>in</strong>gularity of f <strong>and</strong> let f(s) =∑ ∞k=−∞ c k(s − s 0 ) k be its Laurent series expansion <strong>in</strong> 0 < |s − s 0 | < R.Then:(i) s = s 0 is a removable s<strong>in</strong>gularity if <strong>and</strong> only if c k = 0 for k ≤ −1;(ii) s = s 0 is a pole if <strong>and</strong> only if c −m ≠ 0 <strong>and</strong> c k = 0 for k ≤ −(m + 1);(iii) s = s 0 is an essential s<strong>in</strong>gularity if <strong>and</strong> only if c k ≠ 0 for <strong>in</strong>f<strong>in</strong>itelymany negative <strong>in</strong>tegers k.Proof. (i) If c k = 0 for k ≤ −1 then let g(s) be def<strong>in</strong>ed <strong>in</strong> B(s 0 , R) byg(s) = ∑ ∞k=0 c k(s − s 0 ) k . Thus, g is analytic <strong>and</strong> agrees with f <strong>in</strong> thepunctured disk 0 < |s − s 0 | < R. Thus s 0 is a removable s<strong>in</strong>gularityaccord<strong>in</strong>g to Def<strong>in</strong>ition A.8.1. The converse is equally as easy.(ii) Suppose that c k = 0 for k ≤ −(m + 1). Then (s − s 0 ) m f(s) has aLaurent expansion that has no negative powers of s − s 0 . By part (i),(s − s 0 ) m f(s) has a removable s<strong>in</strong>gularity at s = s 0 . The converse isestablished by an equally straightforward argument.(iii) S<strong>in</strong>ce f has an essential s<strong>in</strong>gularity at s = s 0 when it has neithera removable s<strong>in</strong>gularity nor a pole, part (iii) follows from parts (i)<strong>and</strong> (ii).□


312 Appendix A. Review of Complex Variable TheoryWe next give some examples of the different types of s<strong>in</strong>gularities.Example A.8.1. The function f(s) = s<strong>in</strong> s/s has a removable s<strong>in</strong>gularity ats = 0. Indeed, us<strong>in</strong>g L’Hospital’s rule (e.g., Lev<strong>in</strong>son <strong>and</strong> Redheffer, 1970,p. 152) we have thats<strong>in</strong> slim = lim cos s = 1,s→0 s s→0<strong>and</strong> hence f(s) = g(s) for |s| > 0, where g : ¢ → ¢ is the analytic functiondef<strong>in</strong>ed as⎧⎨s<strong>in</strong> sif s ≠ 0,g(s) = s⎩1 if s = 0.Example A.8.2. As an example of poles, consider the rational function(A.10) <strong>in</strong> Example A.3.3. Assume that the numbers q 1 , · · · , q m , p 1 , · · · , p nare all different. Then H has simple (i.e., of order one) poles at the n zerosof its denom<strong>in</strong>ator.◦Example A.8.3. The function e 1/s has an essential s<strong>in</strong>gularity at s = 0.Indeed, s<strong>in</strong>ce for real positive σ, with ν = 1/σ, <strong>and</strong> for all m > 0,limσ→0 σm e 1/σ e ν= limν→∞ ν m = ∞,it follows that e 1/s is not bounded at s = 0 nor can it have a pole of orderm for any m at s = 0. Hence it has an essential s<strong>in</strong>gularity.◦If f has a removable s<strong>in</strong>gularity at s = s 0 , it follows from the def<strong>in</strong>itionthat f can be redef<strong>in</strong>ed to be analytic <strong>in</strong> the disk B(s 0 , R) by assign<strong>in</strong>g avalue to f at s = s 0 . Because an analytic function is cont<strong>in</strong>uous, it is clearthat we need only def<strong>in</strong>e f(s 0 ) so as to make f cont<strong>in</strong>uous at s = s 0 . Thisimplies that lim s→s0 f(s) must exist (see Example A.8.1).If f has a pole at s = s 0 , then lim s→s0 |f(s)| exists <strong>and</strong> equals <strong>in</strong>f<strong>in</strong>ity.On the other h<strong>and</strong>, f has an essential s<strong>in</strong>gularity at s = s 0 when thelimit lim s→s0 |f(s)| fails to exist. Moreover, a remarkable feature of essentials<strong>in</strong>gularities is that the function f <strong>in</strong> any neighborhood of an essentials<strong>in</strong>gularity assumes all values except possibly one. This fact is known asPicard’s theorem (e.g., Conway, 1973). For example, given any complexnumber γ ≠ 0 <strong>and</strong> any small δ > 0, then it is easy to show that e 1/s takeson the value γ an <strong>in</strong>f<strong>in</strong>ite number of times <strong>in</strong> 0 < |s| < δ.We end this section with a discussion of s<strong>in</strong>gularities at <strong>in</strong>f<strong>in</strong>ity. Thebehavior of a function f(s) at s = ∞ is def<strong>in</strong>ed by consider<strong>in</strong>g the behaviorof f(1/ξ) at the po<strong>in</strong>t ξ = 0. For example, f(s) is cont<strong>in</strong>uous at s = ∞ iff(1/ξ) is cont<strong>in</strong>uous at ξ = 0. Let f(s) be analytic for R < |s| < ∞ but notanalytic <strong>in</strong> R < |s| ≤ ∞. Then, by us<strong>in</strong>g s = 1/ξ <strong>and</strong> consider<strong>in</strong>g ξ = 0, it◦


A.8 S<strong>in</strong>gularities 313follows that the po<strong>in</strong>t s = ∞ is an isolated s<strong>in</strong>gular po<strong>in</strong>t. This may be aremovable s<strong>in</strong>gularity, a pole, or an essential s<strong>in</strong>gularity of f.As a f<strong>in</strong>al remark, notice that the Laurent expansion of a function f(s)at s = ∞ is computed by first obta<strong>in</strong><strong>in</strong>g the Laurent expansion of thefunction g(ξ) = f(1/ξ) at ξ = 0, <strong>and</strong> then evaluat<strong>in</strong>g this series at ξ = 1/s.Example A.8.4. Consider a function f analytic at s = ∞. Then, the functiong(ξ)∑= f(1/ξ) is analytic at ξ = 0, <strong>and</strong> so admits a Taylor series g(ξ) =∞k=0 a kξ k that converges uniformly <strong>in</strong> |ξ| ≤ r for some r > 0. Therefore,f is represented, uniformly <strong>in</strong> |s| ≥ 1/r, by the (Laurent) series expansionat s = ∞0∑f(s) = c k s k ,k=−∞where c −k = a k . Notice that, as opposed to the case of a function analyticat a f<strong>in</strong>ite po<strong>in</strong>t, this series has only nonpositive powers of s. This is preciselybecause, <strong>in</strong> def<strong>in</strong><strong>in</strong>g regular <strong>and</strong> pr<strong>in</strong>cipal parts at <strong>in</strong>f<strong>in</strong>ity, the rolesof positive <strong>and</strong> negative powers of s are <strong>in</strong>terchanged.◦A.8.2Branch Po<strong>in</strong>tsA special k<strong>in</strong>d of s<strong>in</strong>gularity arises when multiple-valued functions are <strong>in</strong>volved.A particular function of this type, which is of central <strong>in</strong>terest <strong>in</strong>this book, is the logarithmlog s = log |s| + j arg s.The function log s is multiple-valued due to the presence of arg s. Indeed,at every po<strong>in</strong>t s <strong>in</strong> the complex plane arg s has <strong>in</strong>f<strong>in</strong>itely many values differ<strong>in</strong>gby 2πk, with k = 0, ±1, ±2, . . ..The function log s can be def<strong>in</strong>ed to be s<strong>in</strong>gle-valued by restrict<strong>in</strong>g its doma<strong>in</strong>;for example, by lett<strong>in</strong>g −π < arg s < π. The function so obta<strong>in</strong>ed iscalled the pr<strong>in</strong>cipal branch of log s. Other branches are obta<strong>in</strong>ed by consider<strong>in</strong>gdifferent restrictions on arg s, i.e., for k = ±1, ±2, . . .. Any of thesebranches can be shown to be analytic <strong>in</strong> their doma<strong>in</strong> of def<strong>in</strong>ition, i.e.,the extended complex plane with the negative real axis deleted (<strong>in</strong>clud<strong>in</strong>g<strong>in</strong>f<strong>in</strong>ity). Such a doma<strong>in</strong> is called a cut plane <strong>and</strong> the deleted portionis a branch cut. Every po<strong>in</strong>t of a branch cut is a s<strong>in</strong>gularity of the branchfunction (s<strong>in</strong>ce the function cannot be def<strong>in</strong>ed to be cont<strong>in</strong>uous there) <strong>and</strong>,moreover, it is nonisolated.A s<strong>in</strong>gular po<strong>in</strong>t common to any branch cut is called a branch po<strong>in</strong>t. Forexample, the branch po<strong>in</strong>ts of log s are s = 0 <strong>and</strong> s = ∞. Special care mustbe taken when perform<strong>in</strong>g contour <strong>in</strong>tegration of functions with branchpo<strong>in</strong>ts, as we will see <strong>in</strong> §A.9.2.


314 Appendix A. Review of Complex Variable TheoryExample A.8.5. The function log H considered <strong>in</strong> Example A.3.4 has branchpo<strong>in</strong>ts at the zeros <strong>and</strong> poles of the transfer function H, i.e., at the zeros ofN <strong>and</strong> D.◦Example A.8.6. S<strong>in</strong>ce the pr<strong>in</strong>cipal branch of log s is analytic <strong>in</strong> its cutplane, it admits a Taylor series expansion around any <strong>in</strong>terior po<strong>in</strong>t s 0 .Then, s<strong>in</strong>ced k(k − 1)!log s = (−1)k−1dsk s k , k = 1, 2, · · · ,it follows from (A.59) <strong>and</strong> (A.60), with s 0 = 1, that∞∑k−1 (s − 1)klog s = (−1) . (A.73)kk=0The distance between the po<strong>in</strong>t s 0 = 1 <strong>and</strong> the boundary of the cut planeequals 1, <strong>and</strong> thus the expansion (A.73) is valid for |s−1| < 1. Replac<strong>in</strong>g s−1 by s we obta<strong>in</strong>ed the follow<strong>in</strong>g series for log(1 + s), called the logarithmicseries:∞∑k−1 (s − 1)klog(1 + s) = (−1) , <strong>in</strong> |s| < 1. (A.74)kk=0A useful observation that is used <strong>in</strong> Chapter 3 is obta<strong>in</strong>ed by not<strong>in</strong>g that,if |s| < 1 then∣ log(1 + s)∣∣∣∣1 − s ∣ = 12 s − 1 3 s2 + · · ·∣≤ 1 (|s| + |s| 2 + · · · )2= 1 |s|2 1 − |s| .If we further require that |s| < 1/2 then |1 − log(1 + s)/s| ≤ 1/2. This givesthat for |s| < 1/212 |s| ≤ | log(1 + s)| ≤ 3 2 |s|. ◦Example A.8.7. As an application of the logarithmic series, we will showthat the function f : ¢ × ¡ → ¢ , given byf(s, η) = η log(1 + s ),ηsatisfieslim f(s, η) = s,(A.75)η→∞uniformly on any compact set <strong>in</strong> ¢ . 1515 A compact set <strong>in</strong> is bounded.


A.9 Integration of Functions with S<strong>in</strong>gularities 315Indeed, let s belong to a compact set ¢ <strong>in</strong> . Then there exists a f<strong>in</strong>iteconstant K > 0 such that |s| ≤ K for all s <strong>in</strong> the set. For η > K, the follow<strong>in</strong>gexpansion, of the form (A.74), holdslog(1 + s η)= s η − s22η 2 + · · · .(A.75) then follows on multiply<strong>in</strong>g both sides of the above series by η <strong>and</strong>tak<strong>in</strong>g the limit as η → ∞.◦A.9 Integration of Functions with S<strong>in</strong>gularitiesA.9.1Functions with Isolated S<strong>in</strong>gularitiesThe Cauchy <strong>in</strong>tegral theorem tells us that if a function is analytic at allpo<strong>in</strong>ts <strong>in</strong>terior to <strong>and</strong> on a simple closed contour C, the value of the <strong>in</strong>tegralof the function around that contour is zero. As we anticipated <strong>in</strong>Example A.5.4, if the function fails to be analytic at a f<strong>in</strong>ite number ofpo<strong>in</strong>ts <strong>in</strong>terior to C, there is a specific number, called the residue, whicheach of those po<strong>in</strong>ts contributes to the value of the <strong>in</strong>tegral.Let then f have an isolated s<strong>in</strong>gularity at s = s 0 <strong>and</strong> let∞∑f(s) = c k (s − s 0 ) kk=−∞be its Laurent series expansion. The residue of f at s = s 0 , denoted byRes s=s0 f(s), is def<strong>in</strong>ed as the coefficient c −1 , i.e.,Accord<strong>in</strong>g to (A.71), we have thatRess=s 0f(s) c −1 .Res f(s) = 1s=s 0 2πj∮Cf(s) ds,(A.76)(A.77)where the curve C encircles the po<strong>in</strong>t s = s 0 <strong>in</strong> the counter-clockwisedirection.The generalization of (A.77) to the case where f has a f<strong>in</strong>ite number ofs<strong>in</strong>gular po<strong>in</strong>ts <strong>in</strong>terior to C is given <strong>in</strong> the follow<strong>in</strong>g theorem.Theorem A.9.1 (Residue Theorem). Let C be a simple closed piecewisesmooth contour, counter-clockwise oriented. Let f be analytic on <strong>and</strong> with<strong>in</strong>C except for a f<strong>in</strong>ite number of s<strong>in</strong>gular po<strong>in</strong>ts s 1 , s 2 , · · · , s n <strong>in</strong>terior toC. If Res s=s1 f(s), Res s=s2 f(s), · · · , Res s=sn f(s) denote the residues of fat those respective po<strong>in</strong>ts, then∮n∑f(s) ds = 2πj Res f(s).(A.78)s=s kCk=1


316 Appendix A. Review of Complex Variable TheoryProof. Let the s<strong>in</strong>gular po<strong>in</strong>ts s 1 , s 2 , · · · , s n be centers of counter-clockwiseoriented circles C k , which are <strong>in</strong>terior to C, <strong>and</strong> are so small that no twoof the circles have po<strong>in</strong>ts <strong>in</strong> common, as shown <strong>in</strong> Figure A.17.jωCs 1C 1C 2s 2C n s nσFIGURE A.17. Contours for Theorem A.9.1.The circles C k together with the contour C form the boundary of aclosed region throughout which f is analytic <strong>and</strong> whose <strong>in</strong>terior is a multiplyconnected doma<strong>in</strong>. Hence, accord<strong>in</strong>g to the extension of the Cauchy<strong>in</strong>tegral theorem to such regions∮ ∮∮f(s) ds − f(s) ds − · · · − f(s) ds = 0.CC 1 C nbut this reduces to (A.78) by us<strong>in</strong>g (A.77) for each of the <strong>in</strong>tegrals aroundthe circles.□Residue at Inf<strong>in</strong>ityWe have just <strong>in</strong>troduced the def<strong>in</strong>ition of residue of a function at an isolateds<strong>in</strong>gular po<strong>in</strong>t. This def<strong>in</strong>ition could <strong>in</strong> fact be extended to any po<strong>in</strong>tthat is at the center of a punctured disk where the function is analytic —whether the function is analytic or not at the po<strong>in</strong>t itself. Evidently, theresidue of a function at a f<strong>in</strong>ite regular po<strong>in</strong>t is zero.Analogously, we can def<strong>in</strong>e the residue of a function at s = ∞ asRes f(s) = 1s=∞ 2πj∮f(s)ds,(A.79)where the <strong>in</strong>tegral is computed clockwise along a circle of radius R so largethat the only possible s<strong>in</strong>gularity of f <strong>in</strong> |s| ≥ R is the po<strong>in</strong>t s = ∞. Theclockwise direction is used because the po<strong>in</strong>t s = ∞ is thus “enclosed” bythe contour of <strong>in</strong>tegration. 1616 This assumes the convention that a counter-clockwise oriented curve “encloses” its <strong>in</strong>teriorwhereas a clockwise oriented curve “encloses” its exterior.


A.9 Integration of Functions with S<strong>in</strong>gularities 317Let the Laurent expansion at <strong>in</strong>f<strong>in</strong>ity of f bef(s) = · · · + c −ks k+ · · · + c −1s + c 0 + c 1 s + · · · ,uniformly convergent <strong>in</strong> R ≤ |s| < ∞. Then, <strong>in</strong> virtue of Example A.5.1,Ress=∞ f(s) = −c −1,(A.80)which is consistent with the def<strong>in</strong>ition of the residue at a f<strong>in</strong>ite po<strong>in</strong>t givenby (A.76).It is <strong>in</strong>terest<strong>in</strong>g to note that, for a function f hav<strong>in</strong>g a f<strong>in</strong>ite number, nsay, of isolated s<strong>in</strong>gularities, the sum of all the residues <strong>in</strong> the extendedcomplex plane is zero. Indeed, <strong>in</strong>tegrat<strong>in</strong>g f around a circle large enoughto conta<strong>in</strong> all f<strong>in</strong>ite s<strong>in</strong>gularities, the Residue Theorem together with (A.79)given∑Res f(s) = − Res f(s).(A.81)s=s k s=∞k=1Example A.9.1. Consider the functionf(s) = 1s − 1 .This function has a s<strong>in</strong>gle pole at s = 1 <strong>and</strong> it is analytic at s = ∞. Theresidue of f(s) at s = 1 is Res s=1 f(s) = 1. To obta<strong>in</strong> the residue at <strong>in</strong>f<strong>in</strong>ity,letg(ξ) = f(1/ξ) =ξ1 − ξ .The Laurent expansion of g(ξ) at ξ = 0 isg(ξ) =∞∑ξ k , <strong>in</strong> 0 < |ξ| < 1,k=1<strong>and</strong> so the Laurent expansion of f(s) at s = ∞ isf(s) =∞∑11, <strong>in</strong> 1 < |s| < ∞.sk The residue of f at s = ∞ is then Res s=∞ f(s) = −1.Notice that the residue at <strong>in</strong>f<strong>in</strong>ity can be nonzero even if the function isanalytic at s = ∞.We end this subsection with an example of application to SISO controlsystems.◦


318 Appendix A. Review of Complex Variable TheoryExample A.9.2. Let L be a proper rational function <strong>and</strong> consider the sensitivityfunction S = 1/(1 + L) (see e.g., (1.5) <strong>in</strong> Chapter 1). Assume furtherthat the closed-loop system is stable, i.e., the numerator of 1 + L is Hurwitz<strong>and</strong> L(∞) ≠ −1. We are <strong>in</strong>terested <strong>in</strong> evaluat<strong>in</strong>g the residue at <strong>in</strong>f<strong>in</strong>ityof the function log S, which is analytic at <strong>in</strong>f<strong>in</strong>ity by the assumptions ofclosed-loop stability <strong>and</strong> the properness of L.S<strong>in</strong>ce L is a proper rational function, it has a Laurent expansion at <strong>in</strong>f<strong>in</strong>ityof the formL(s) = L(∞) + c −1s+ c −2s 2 + · · · , (A.82)<strong>in</strong> |s| > r, for some r > 0. Then log S = − log(1 + L) can be written as[log S(s) = − log 1 + L(∞) + c −1Then= − log[1 + L(∞)] − log= log S(∞) − logs+ c ]−2s 2 + · · · [1 +c −11 + L(∞)1s +[1 + S(∞)c −1+ S(∞)c −2s s 2 + · · ·log S(s)S(∞) = − log[1 + ˜L(s)],]c −2 11 + L(∞) s 2 + · · · ].(A.83)where˜L(s) = S(∞)c −1+ S(∞)c −2s s 2 + · · · .The power series expansion of log(1 + s) for |s| < 1 is (see Example A.8.6)log(1 + s) = s − s2 2+ · · · , <strong>in</strong> |s| < 1.Us<strong>in</strong>g this expansion <strong>in</strong> (A.83) yieldslog S(s)S(∞) = −˜L(s) + ˜L 2 (s)+ · · · , <strong>in</strong> |˜L(s)| < 12= − S(∞)c −1s− S(∞)c −2s 2 + · · · , <strong>in</strong> |s| > ˜r,for some ˜r > r. We have thus obta<strong>in</strong>ed the Laurent expansion at <strong>in</strong>f<strong>in</strong>ity ofthe function log S, whose residue at <strong>in</strong>f<strong>in</strong>ity can be computed from (A.80)asS(s)Res log S(s) = Res logs=∞ s=∞ S(∞)= S(∞)c −1= 1S(∞) lim s[S(∞) − S(s)], (A.84)s→∞


A.9 Integration of Functions with S<strong>in</strong>gularities 319where the last l<strong>in</strong>e follows from (A.82) on not<strong>in</strong>g that[ 1c −1 = lim s[L(s) − L(∞)] = lim ss→∞ s→∞ S(s) − 1 ].S(∞)If L <strong>in</strong> (A.82) has relative degree one, thenRes log S(s) = lim s[1 − S(s)].s=∞ s→∞Alternatively, if L has relative degree two or more, thenRes log S(s) = 0.s=∞◦A.9.2Functions with Branch Po<strong>in</strong>tsWhen <strong>in</strong>tegration is to be performed around a region where the <strong>in</strong>tegr<strong>and</strong>has branch po<strong>in</strong>ts, the residue theorem cannot be used <strong>in</strong> the form of TheoremA.9.1 but an extended version that h<strong>and</strong>les this situation can bederived (Lev<strong>in</strong>son <strong>and</strong> Redheffer, 1970, Theorem 9.1). S<strong>in</strong>ce our requirementsof <strong>in</strong>tegration of functions with branch po<strong>in</strong>ts are limited to thelogarithm, we will concentrate on an example of contour <strong>in</strong>tegration ofthis function around a branch cut.Consider the semicircular contour of Figure A.6, <strong>and</strong> the <strong>in</strong>tegral on thiscontour of function log S, where S is a stable system transfer function. IfS(s) has a zero at s = p <strong>in</strong> the ORHP then log S(s) has a branch po<strong>in</strong>t ats = p. It is then necessary to <strong>in</strong>dent the contour to avoid the branch cut, asshown <strong>in</strong> Figure A.18 for p real.jωRpσC RFIGURE A.18. Contour for log(s − p).For simplicity, we will next assume that S(s) has a simple zero at s = pon the positive real axis. Consider the <strong>in</strong>tegral of log S(s) on the <strong>in</strong>dentation,C, shown <strong>in</strong> detail <strong>in</strong> Figure A.19.


320 Appendix A. Review of Complex Variable TheoryɛIIIpIIIFIGURE A.19. Indentation around branch cut.S<strong>in</strong>ce log S(s) = log(s−p)+log ¯S(s), where ¯S(s) is analytic on <strong>and</strong> <strong>in</strong>sidethe <strong>in</strong>dentation, then∫∫∫ jεlog S(s) ds = log(s − p) ds + log ¯S(jω) dω.−jεCCWe will thus concentrate on ∫ log(s − p) ds.CS<strong>in</strong>ce C is <strong>in</strong> the doma<strong>in</strong> of analyticity of log(s − p), the fundamentaltheorem of calculus, given by (A.18), yields∫jεlog(s − p) ds = [(s − p) log(s − p) − (s − p)]∣C−jε= jp[arg(−p − jε) − arg(−p + jε)].The limit when ε → 0 is then∫log(s − p) ds = −j2πp.limε→0CAs an aside, note that the <strong>in</strong>tegral around the semicircle II <strong>in</strong> Figure A.19goes to zero when ε → 0. Indeed,∫∣ ∣∣∣∣ ∫ π/2∣ log(s − p) ds∣ = (log ε + jθ)ε e jθ dθII−π/2∣∫ π/2 ∫ π/2≤ ε log ε +ε |θdθ|,−π/2−π/2which goes to 0 as ε → 0. S<strong>in</strong>ce the total <strong>in</strong>tegral on C is non zero, thismeans that the <strong>in</strong>tegrals on I <strong>and</strong> III, which are <strong>in</strong> different sides of thebranch cut, do not cancel. This is <strong>in</strong> contrast to what happens with functionswith at most isolated s<strong>in</strong>gularities.It is easy to see that if the function S has zeros p 1 , p 2 , . . . , p n with thesame imag<strong>in</strong>ary part (possibly different than zero <strong>and</strong> possibly with thesame real part), the same branch cut can be used for all of them to obta<strong>in</strong>∫Clog[(s − p 1 )(s − p 2 ) · · · (s − p n )] ds = −j2πn∑Re p i .i=1(A.85)


A.10 The Maximum Modulus Pr<strong>in</strong>ciple 321A.10 The Maximum Modulus Pr<strong>in</strong>cipleAnother result that relates characteristics of a function, analytic <strong>in</strong> a region,with its behavior on the boundary of the region is the maximum moduluspr<strong>in</strong>ciple of analytic functions. This is <strong>in</strong>terest<strong>in</strong>g but less <strong>in</strong>formativethan Poisson <strong>in</strong>tegrals. We will give two versions of this pr<strong>in</strong>ciple.Theorem A.10.1 (Maximum Pr<strong>in</strong>ciple). Let f be analytic <strong>in</strong> a doma<strong>in</strong> D.Then |f| cannot have a maximum anywhere <strong>in</strong> D unless f is a constant.Proof. Assume that f is not a constant <strong>in</strong> D <strong>and</strong> suppose that s 0 is a po<strong>in</strong>t<strong>in</strong> D such that |f(s 0 )| is maximum. Let n be the smallest <strong>in</strong>teger such thatf (n) (s 0 ) ≠ 0, which exists s<strong>in</strong>ce f is not constant. Then the Taylor series off around s 0 has the form 17f(s) = f(s 0 ) + f(n) (s 0 )(s − s 0 ) n + o ((s − s 0 ) n ) . (A.86)n!Let h be a complex number such thath n f(s 0 )= n!f (n) (s 0 ) εn , ε > 0,where ε is small enough such that s = s 0 + h is <strong>in</strong>side the disk of convergenceof the Taylor series (A.86). Then, evaluat<strong>in</strong>g this series at s = s 0 + h,we havef(s 0 + h) = (1 + ε n )f(s 0 ) + o (ε n ) .This implies|f(s 0 + h)| > |f(s 0 )|,which contradicts the assumption that |f(s 0 )| was a maximum. S<strong>in</strong>ce s 0 isgeneric, it follows that |f| cannot achieve a maximum anywhere <strong>in</strong> D, <strong>and</strong>the result follows.□Theorem A.10.2 (Maximum Pr<strong>in</strong>ciple, second version). Let f(s) be analytic<strong>in</strong> a bounded region R <strong>and</strong> let |f(s)| be cont<strong>in</strong>uous <strong>in</strong> the closed regionR. Then |f(s)| assumes its maximum on the boundary of the region.Proof. The theorem trivially holds if f(s) is constant. Suppose then thatf(s) is not a constant. S<strong>in</strong>ce |f(s)| is cont<strong>in</strong>uous, by a well-known theorem<strong>in</strong> real variables |f(s)| assumes a maximum somewhere <strong>in</strong> the closedbounded region R. By Theorem A.10.1, this maximum cannot be assumedat any <strong>in</strong>terior po<strong>in</strong>t <strong>and</strong> hence must be assumed on the boundary. □The maximum modulus pr<strong>in</strong>ciple is used <strong>in</strong> Chapter 3 to show the“push-pop” or “water-bed” phenomenon <strong>in</strong> l<strong>in</strong>ear control theory.17 Here we use the notation g(s) = o(s µ ) to mean that g(s)/s µ → 0 when s is near tosome given limit.


322 Appendix A. Review of Complex Variable TheoryA.11 Entire FunctionsAn entire function, f, is a function def<strong>in</strong>ed <strong>and</strong> analytic for all f<strong>in</strong>ite valuesof the complex variable s. An entire function that is not a polynomial iscalled an entire transcendental function.In Chapters 10 <strong>and</strong> 11, we use the follow<strong>in</strong>g result on zeros of transcendentalfunctions of a particular form.Lemma A.11.1. Consider an entire function hav<strong>in</strong>g the particular formf(s) = g 1 (s)e −sτ + g 2 (s),(A.87)where τ > 0 <strong>and</strong> g 1 (s) <strong>and</strong> g 2 (s) are polynomials. Letδ = deg(g 2 ) − deg(g 1 ).(A.88)We then have the follow<strong>in</strong>g implications.(i) If δ > 0, then the high frequency zeros (s → ∞) of f(s) have negativereal part.(ii) If δ < 0, then the high frequency zeros of f(s) have positive real part.(iii) If δ = 0, then the high frequency zeros of f(s) converge to the sequences k = − 1 τ log η + jk2π τ, k = 0, ±1, ±2, · · · . (A.89)And η is the ratio between the highest order coefficients of g 2 (s) <strong>and</strong>g 1 (s), i.e.,g 2 (s)η = lims→∞ g 1 (s) .(A.90)Proof. The zeros of f <strong>in</strong> (A.87) satisfy the identitye −sτ = − g 2(s)g 1 (s) .Tak<strong>in</strong>g logarithms yieldss = − 1 ∣ ∣∣∣τ log g 2 (s)g 1 (s) ∣ − j 1 [ ]τ arg g2 (s)+ jk 2π g 1 (s) τ . (A.91)Assume δ > 0. Then clearly, for s a high frequency zero of f, the argumentof the log function <strong>in</strong> (A.91) will be greater than one <strong>and</strong> thus the zerowill have negative real part. This shows that case (i) is true. The proof ofcase (ii) follows similarly.F<strong>in</strong>ally, for δ = 0, we note that, for large s, the right h<strong>and</strong> side of (A.91)converges to (− log |η| − j arg(η) + jk2π)/τ, with η given <strong>in</strong> (A.90). Hence(A.89) follows.□


A.11 Entire Functions 323Example A.11.1. Even <strong>in</strong> the case of δ > 0, the function f may have zeroswith positive real part. For example, let f be given byf(s) = e 1 s + e 2 − be aτ e −sτ ,(A.92)where e 1 , e 2 , b, a are positive real constants. Assume further that b > e 2 .Consider the functionsp(s) = e 2 − be aτ e −sτ ,q(s) = e 1 s,(A.93)<strong>and</strong> the closed curve γ = γ 1 ∪ γ 2 def<strong>in</strong>ed asγ 1 = {s = jω : ω ∈ [−R, R ]},γ 2 = {s = Re jθ : θ ∈ [−π/2, π/2 ]}.(A.94)On γ 1 we have|p(s)| = |e 2 − be aτ e jωτ | ≥ ||e 2 | − |be aτ || = be aτ − e 2 ,|q(s)| = e 1 ω ≤ e 1 R.(A.95)Then |q(s)| < |p(s)| on γ 1 ifOn the other h<strong>and</strong>, on γ 2 , we haveR < R 1 beaτ − e 2e 1. (A.96)|p(s)| ≥ |e 2 − |be aτ e −Rτ cos θ ||,|q(s)| = e 1 R.(A.97)Some simple calculations show that |q(s)| < |p(s)| on γ 2 ifR < R 2 beaτ − e 2e 1 + τ . (A.98)S<strong>in</strong>ce τ > 0, we have that R 2 < R 1 . It follows that, if R < R 2 , |q(s)| < |p(s)|on the semicircular contour γ. Note also that both p <strong>and</strong> q are analytic on<strong>and</strong> <strong>in</strong>side γ. By Rouche’s Theorem (Conway, 1973) we then know that p<strong>and</strong> p + q = f have the same number of zeros <strong>in</strong>side γ.The zeros of p(s) are given bys k = − 1 τ log e 2be aτ + jk2π τ .(A.99)S<strong>in</strong>ce e 2 < be aτ , the real parts of s k are positive. this establishes the claimthat f can have low frequency zeros hav<strong>in</strong>g positive real part. (Actually,the imag<strong>in</strong>ary parts are spaced by 2π/τ. Hence, as τ <strong>in</strong>creases, then R 2given <strong>in</strong> (A.98) <strong>in</strong>creases <strong>and</strong> the spac<strong>in</strong>g between zeros decreases. It followsthat the number of roots of f <strong>in</strong> the ORHP <strong>in</strong>creases with τ). ◦


324 Appendix A. Review of Complex Variable TheoryNotes <strong>and</strong> ReferencesThe material of this chapter is based ma<strong>in</strong>ly on Lev<strong>in</strong>son <strong>and</strong> Redheffer (1970) <strong>and</strong>Churchill <strong>and</strong> Brown (1984). These are st<strong>and</strong>ard textbooks on Complex VariableTheory. Some specific results <strong>and</strong> def<strong>in</strong>itions were obta<strong>in</strong>ed from Widder (1961),Kaplan (1973), Markushevich (1965) <strong>and</strong> Conway (1973).In particular, §A.4.2 conta<strong>in</strong>s important <strong>in</strong>put from Widder (1961); §A.5.1 <strong>and</strong>§A.8 are largely based on Widder (1961) <strong>and</strong> Conway (1973), respectively.Equations (A.5), which are usually called the Cauchy-Riemann equations, are ofcentral importance <strong>in</strong> the theory of analytic functions. However, it should be notedthat this universally encountered attribution is not historically justified. In fact,equations (A.5) had already been studied <strong>in</strong> the eighteenth century by D’Alembert(1717-1783) <strong>and</strong> Euler (1707-1783), <strong>in</strong> research devoted to the application of functionsof a complex variable to hydrodynamics (D’Alembert <strong>and</strong> Euler), <strong>and</strong> to cartography<strong>and</strong> <strong>in</strong>tegral calculus (Euler) (Markushevich, 1965, p. 111).


Appendix BProofs of Some Results <strong>in</strong> theChaptersB.1 Proofs for Chapter 4In this section, we prove the Bode <strong>in</strong>tegral constra<strong>in</strong>ts for the logarithm ofthe s<strong>in</strong>gular values of the sensitivity function, given <strong>in</strong> Theorem 4.2.2 ofChapter 4. We follow Chen (1995). A few prelim<strong>in</strong>ary technical results areneeded <strong>in</strong> order to prove this theorem.Fact B.1.1. Let f : ¡ × ¡ → ¡ given byf(η, ω) =η 2η 2 + ω 2 .Then f(η, ω) → 1 as η → ∞, uniformly on any compact <strong>in</strong>terval.Fact B.1.2 (Lev<strong>in</strong>son <strong>and</strong> Redheffer 1970, p.337). Consider f : ¢ × ¡ → ¢ .Suppose that f(s, η) is analytic <strong>in</strong> a doma<strong>in</strong> D ⊂ ¢ , <strong>and</strong> that f(s, η) →g(s) uniformly <strong>in</strong> D as η → ∞. Write s = re jθ . Then, ∂f(re jθ , η)/∂r →∂g(re jθ )/∂r uniformly <strong>in</strong> D as η → ∞.◦Lemma B.1.3. The follow<strong>in</strong>g limit converges uniformly on any compactset∣ ∣∣∣lim η log η + sη→∞ η − s∣ = 2 Re s.(B.1)Proof. The result follows from Example A.8.7 <strong>in</strong> Appendix A, <strong>and</strong> the observationthatη logη + s∣η − s∣ (1 = Re η log + s ) (− Re η log 1 − s ).ηη◦


326 Appendix B. Proofs of Some Results <strong>in</strong> the ChaptersLemma B.1.4. The follow<strong>in</strong>g limit converges uniformly on any compactsetlim η ∂ (∣ logη + Re jθ ) ∣∣∣η→∞ ∂R ∣η − Re jθ = 2 cos θ. (B.2)Proof. Immediate from Lemma B.1.3 <strong>and</strong> Fact B.1.2.Consider next the sensitivity function S correspond<strong>in</strong>g to the feedbacksystem of Figure 4.1. Let p i , i = 1, . . . , n p , be the poles of the open-loopsystem L <strong>in</strong> the ORHP, repeated accord<strong>in</strong>g to their geometric multiplicities.Recall that, if the open-loop system is unstable, then S can be factoredasn∏ pS = S m B i ,(B.3)where S m is m<strong>in</strong>imum phase <strong>and</strong> B i is the all-pass factor correspond<strong>in</strong>gto the pole p i .We make the follow<strong>in</strong>g assumption.Assumption B.1.i=1(i) The closed-loop system of Figure 4.1 is stable.(ii)lim sup R σ(L(s)) = 0.R→∞s∈ +|s|≥R(iii) The s<strong>in</strong>gular values of S m <strong>in</strong> (B.3), i.e., σ i (S m (s)), i = 1, · · · , n, havecont<strong>in</strong>uous second order derivatives for all s ∈ ¢ +.◦We first prove the Bode <strong>in</strong>tegral for open-loop stable system, i.e., S m = S<strong>in</strong> (B.3).Theorem B.1.5 (Bode Integral for S – Stable systems). Let S be the sensitivityfunction of the feedback loop of Figure 4.1. Assume that the openloopsystem, L, is stable. Then, under Assumption B.1 (with S m = S),log σ j (S(jω)) dω = 1 ∫∫σ∇ 2 log σ j (S(σ + jω)) dσ dω. (B.4)2∫ ∞0+Proof. Consider the half disc D R depicted <strong>in</strong> Figure B.1. Denote by γ R ={−jω : −R ≤ ω ≤ R} <strong>and</strong> C R = {Re jθ : −π/2 ≤ θ ≤ π/2}. Then theboundary of D R is given by γ R ∪ C R . Let η > R <strong>and</strong> consider the realfunction g : ¢ → ¡ def<strong>in</strong>ed byg(s) = logη + s∣η − s∣ .□□


B.1 Proofs for Chapter 4 327jωRγ RD RσC RFIGURE B.1. Contour for the Poisson <strong>in</strong>tegral formulaIt is easy to see that g is harmonic <strong>in</strong> D R , <strong>and</strong> hence for any s ∈ D R ,∇ 2 g(s) = 0. Also, g(jω) = 0, ∀ω ∈ ¡ . Letf(s) = log σ j (S(s)).S<strong>in</strong>ce Assumption B.1 holds, then each σ j (S) has cont<strong>in</strong>uous second orderderivative <strong>in</strong> D R , <strong>and</strong> so does log σ j (S) s<strong>in</strong>ce σ j (S(s)) > 0. We can nowapply Green’s formula (A.30) of Appendix A to f <strong>and</strong> g, with the contourC = γ R ∪ C R <strong>and</strong> the doma<strong>in</strong> Ω = D R . We have∫f ∂g ∫ (γ R∂⃗n ds + f ∂gC R∂⃗n − g ∂f ) ∫∫ds = − g∇ 2 f dσ dω. (B.5)∂⃗nD RWe next compute each of the <strong>in</strong>tegrals <strong>in</strong> (B.5). Notice that, on γ R , the outernormal ⃗n is <strong>in</strong> the direction of the negative real axis. Thus, we have that∂g/∂⃗n = −∂g/∂σ, <strong>and</strong>∂g∂σ = ∂ (Re log η + s )∂σ η − s= Re ∂ (log η + s )∂σ η − s= Re( 1η + s + 1η − sSett<strong>in</strong>g σ = 0 <strong>in</strong> the last equality gives, on γ Rwhich leads to∫γ Rf ∂g∂⃗n ds = − ∫ R−R∂g∂⃗n = − 2ηη 2 + ω 2 ,).2ηlog σ j (S(jω))η 2 + ω 2 dω.


328 Appendix B. Proofs of Some Results <strong>in</strong> the ChaptersMultiply<strong>in</strong>g both sides of the above equality by η, tak<strong>in</strong>g limits as η →∞, <strong>and</strong> us<strong>in</strong>g the uniform convergence theorem (Lev<strong>in</strong>son <strong>and</strong> Redheffer,1970, p. 335) <strong>and</strong> Fact B.1.1, it follows that∫lim η f ∂g∫ Rds = − limη→∞γ R∂⃗n η→∞−R=2η 2log σ j (S(jω))η 2 + ω 2 dω∫ R2η 2log σ j (S(jω)) lim−Rη→∞ η 2 + ω 2 dω∫ R= −2 log σ j (S(jω)) dω.−R(B.6)Turn<strong>in</strong>g to the second <strong>in</strong>tegral <strong>in</strong> (B.5), we have(f∫C ∂gR∂⃗n − g ∂f )ds = I 1 − I 2 ,∂⃗nwhereI 1 I 2 ∫ π/2−π/2∫ π/2−π/2R ∂ (∂R∣∣ logη + Re jθ ) ∣∣∣∣η − Re jθ log σ j (S(Re jθ )) dθ,∣ R logη + Re jθ ∣∣∣ ∂∣η − Re jθ ∂R [log σ j(S(Re jθ ))] dθ.As before, we have, us<strong>in</strong>g the uniform convergence theorem <strong>and</strong> LemmasB.1.3 <strong>and</strong> B.1.4,∫ π/2lim ηI 1 = lim ηη→∞ η→∞= 2∫ π/2−π/2−π/2R ∂ (∣ logη + Re jθ ) ∣∣∣∂R ∣η − Re jθ log σ j (S(Re jθ )) dθR log σ j (S(Re jθ )) cos θ dθ,<strong>and</strong>∫ π/2lim ηI 2 = lim ηη→∞ η→∞= 2∫ π/2−π/2−π/2∣ R logη + Re jθ ∣∣∣ ∂∣η − Re jθ ∂R [log σ j(S(Re jθ ))] dθR 2 ∂∂R [log σ j(S(Re jθ ))] cos θ dθ.These <strong>in</strong>equalities lead to∫ (lim η f ∂gη→∞C R∂⃗n − g ∂f )ds = 2I 3 − 2I 4 ,∂⃗n(B.7)


B.1 Proofs for Chapter 4 329whereI 3 I 4 ∫ π/2−π/2∫ π/2−π/2R log σ j (S(Re jθ )) cos θ dθ,R 2 ∂∂R [log σ j(S(Re jθ ))] cos θ dθ.As for the <strong>in</strong>tegral on the RHS of (B.5), we use the same limit<strong>in</strong>g techniqueto obta<strong>in</strong>∫∫∫∫lim η g∇ 2 f dσ dω = 2 σ∇ 2 log σ j (S(σ + jω)) dσ dω. (B.8)η→∞D R D RComb<strong>in</strong><strong>in</strong>g (B.6)-(B.8) yields∫∫D Rσ∇ 2 log σ j (S(σ + jω)) dσ dω =∫ Rlog σ j (S(jω)) dω − I 3 + I 4 .−RThe next step is to take limits of both sides of the above equation as R →∞. This gives∫∫∫ ∞σ∇ 2 log σ j (S(σ + jω)) dσ dω = log σ j (S(jω)) dω−∞+(B.9)− limR→∞ I 3 + limR→∞ I 4.We claim that lim R→∞ I 3 = 0. To see this, we first note that for j = 1, 2, . . . , n,we have that σ(S(s)) ≤ σ j (S(s)) ≤ σ(S(s)) for any s ¢ ∈ +. Furthermore,under Assumption B.1 (ii), σ(L(s)) < 1 for any |s| > R if R > 0 is sufficientlylarge. As a result, I 3 can be bounded as follows|I 3 | ≤∫ π/2−π/2R| log σ j (S(Re jθ ))| cos θ dθ≤ 2 max R| log σ j(S(Re jθ ))|θ∈[−π/2,π/2]≤ 2 max max {R| logθ∈[−π/2,π/2] σ(S(Rejθ ))|, R| log σ(S(Re jθ ))|}≤ 2 max max {R| log[1 −θ∈[−π/2,π/2] σ(L(Rejθ ))]|, R| log[1 + σ(L(Re jθ ))]|}.From Assumption B.1 (ii) <strong>and</strong> the logarithm series expansions (see ExampleA.8.6 <strong>in</strong> Appendix A)log[1 − σ(L(Re jθ ))] = −σ(L(Re jθ )) − 1 2 σ2 (L(Re jθ )) + · · · ,log[1 + σ(L(Re jθ ))] = σ(L(Re jθ )) − 1 2 σ2 (L(Re jθ )) + · · · ,


330 Appendix B. Proofs of Some Results <strong>in</strong> the Chapterswhich hold for |s| > R <strong>and</strong> R > 0 sufficiently large, we conclude that<strong>and</strong> thuslim R| log[1 −R→∞ σ(L(Rejθ ))]| = 0,lim R| log[1 +R→∞ σ(L(Rejθ ))]| = 0,lim I 3 = 0,R→∞prov<strong>in</strong>g our claim. We next show that(B.10)lim I 4 = 0.R→∞(B.11)We do this by show<strong>in</strong>g that lim R→∞ I 4 = − lim R→∞ I 3 . First we write,us<strong>in</strong>g the Leibniz rule (see e.g., Kaplan, 1973, p. 219)∫ π/2−π/2Then we have∂∂R [log σ j(S(Re jθ ))] cos θ dθ = d ∫ π/2log σ j (S(Re jθ )) cos θ dθ.dR −π/2lim I 4 = limdR→∞ R→∞ R2 dR= − limR→∞ddR∫ π/2−π/2log σ j (S(Re jθ )) cos θ dθ∫ π/2−π/2 log σ j(S(Re jθ )) cos θ dθd(1/R)/dR∫ π/2= − lim R log σ j (S(Re jθ )) cos θ dθR→∞ −π/2= − limR→∞ I 3,(B.12)where the third equality holds from L’Hospital’s rule (e.g., Widder, 1961,p. 260). Thus, (B.11) follows on us<strong>in</strong>g (B.12) <strong>and</strong> (B.10).F<strong>in</strong>ally, comb<strong>in</strong><strong>in</strong>g (B.9)-(B.11), <strong>and</strong> observ<strong>in</strong>g that∫ ∞log σ j (S(jω)) dω = 2−∞∫ ∞0log σ j (S(jω)) dω,due to conjugate symmetry of S(jω), (B.4) is then obta<strong>in</strong>ed.□It rema<strong>in</strong>s to establish Theorem 4.2.2 for open-loop unstable systems.This is done <strong>in</strong> the follow<strong>in</strong>g theorem by us<strong>in</strong>g the factorization of S given<strong>in</strong> (B.3).Theorem B.1.6 (Bode Integral for S – Unstable systems). Let S be factorizedas <strong>in</strong> (B.3). Then, under Assumption B.1,∫ ∞log σ j (S(jω)) dω = F j + K j ,0


B.1 Proofs for Chapter 4 331whereF j = 1 ∫∫σ∇ 2 log σ j (S m (σ + jω)) dσ dω,2<strong>and</strong>+∫ π/2K j = limR→∞ −π/2(∏npR log σ ji=1B −1i(Re jθ ))cos θ dθ.Proof. We use (the proof of) Theorem B.1.5 for the m<strong>in</strong>imum-phase factorS m <strong>in</strong> (B.3). If then follows from (B.9) <strong>and</strong> (B.12) that∫ ∞log σ j (S m (jω)) dω = 2F j + 2 lim I 3,−∞R→∞(B.13)whereI 3 =∫ π/2−π/2R log σ j (S m (Re jθ )) cos θ dθ.S<strong>in</strong>ce each B i <strong>in</strong> (B.3) is all-pass for i = 1, . . . , n p , we have that S(jω)S ∗ (jω) =S m (jω)S ∗ m(jω), <strong>and</strong> henceσ j (S m (jω)) = σ j (S(jω)), j = 1, . . . , n. (B.14)Also, for |s| ≥ R ≥ max 1≤i≤np |p i |, both B −1i(s) <strong>and</strong> S(s) are well-def<strong>in</strong>ed.By us<strong>in</strong>g norm <strong>in</strong>equalities (see e.g., Golub <strong>and</strong> Van Loan, 1983), we havewhereThis leads to<strong>and</strong>I 3 ≥I 3 ≤σ(S(Re jθ ))σ j (Π) ≤ σ j (S m (jω)) ≤ σ(S(Re jθ ))σ j (Π),∫ π/2−π/2∫ π/2−π/2n p∏Π B −1i(Re jθ ).i=1∫ π/2R log σ(S(Re jθ )) cos θ dθ + R log σ j (Π) cos θ dθ,−π/2∫ π/2R log σ(S(Re jθ )) cos θ dθ + R log σ j (Π) cos θ dθ.−π/2As shown <strong>in</strong> the proof of Theorem B.1.5, however, the first terms on theRHSs of the two <strong>in</strong>equalities above tend to zero as R → ∞, <strong>and</strong> thus∫ (π/2np)lim I ∏3 = lim R log σ j B −1i(Re jθ ) cos θ dθ. (B.15)R→∞ R→∞ −π/2The proof is completed by substitut<strong>in</strong>g (B.14) <strong>and</strong> (B.15) <strong>in</strong>to (B.13), <strong>and</strong>by us<strong>in</strong>g the conjugate symmetry of S(jω).□i=1


332 Appendix B. Proofs of Some Results <strong>in</strong> the ChaptersB.2 Proofs for Chapter 6B.2.1 Proof of Lemma 6.2.2In this subsection, we provide the proof of Lemma 6.2.2 on steady-statefrequency response of sampled-data systems. First, we need two prelim<strong>in</strong>arylemmas.Lemma B.2.1. Suppose that h is the pulse response of a hold device as describedon page 138 of §6.1.2, i.e., h is a function of bounded variation withf<strong>in</strong>ite support on the <strong>in</strong>terval [0, τ]; let H = Lh be its frequency responsefunction. Then, there exist f<strong>in</strong>ite constants c 1 <strong>and</strong> c 2 such that|s H(s)| ≤ c 1 + c 2 e − Re sτ if s is <strong>in</strong> ¢ − , (B.16)|s H(s)| ≤ c 2 + c 1 e − Re sτ if s is <strong>in</strong> ¢ +. (B.17)Proof. Us<strong>in</strong>g the def<strong>in</strong>ition of H <strong>and</strong> <strong>in</strong>tegration by parts, we can write∫ τ |s H(s)| =∣ s e −sζ h(ζ) dζ∣0=∣ h(0+ ) − e −sτ h(τ − ) + e −sζ ḣ(ζ) dζ∣∫ τ0≤ ‖h‖ ∞(1 + e− Re sτ ) +∫ τ0e − Re sζ ∣ ∣ ḣ(ζ) ∣ ∣ dζ ,(B.18)where ‖h‖ ∞ = sup [0,τ]|h(t)|, <strong>and</strong> ḣ denotes dh/dt, which exists almosteverywhere on [0, τ] because h is of bounded variation. If s ¢ is <strong>in</strong> − , thene − Re sζ ≤ e − Re sτ for ζ ∈ [0, τ], <strong>and</strong> we have from (B.18) that)|s H(s)| ≤ ‖h‖ ∞ +(‖h‖ ∞ + ‖ḣ‖ 1 e − Re sτ ,where ‖ḣ‖ 1 = ∫ τ∣ ∣0 ḣ(ζ) dζ, which is f<strong>in</strong>ite also because h is assumed ofbounded variation (e.g., see Rud<strong>in</strong> (1987, p. 157)). The bound (B.16) thenfollows. If s ¢ is <strong>in</strong> +, then (B.17) follows from (B.18) on us<strong>in</strong>g the bounde − Re sζ ≤ 1 for ζ ∈ [0, τ].□Lemma B.2.2. Suppose that G is a proper transfer function, which may<strong>in</strong>clude a time delay, <strong>and</strong> let H be the frequency response of a hold, as <strong>in</strong>Lemma B.2.1. Let ρ be a f<strong>in</strong>ite number <strong>in</strong> ¢ − that is not a pole of G, <strong>and</strong>def<strong>in</strong>e ρ k ρ + jkω s , with k = ±1, ±2, . . . Assume further that ρ k is not apole of G for any k. Then,limK∑K→∞k=−K|G(ρ k )H(ρ k )| 2 < ∞ .(B.19)


B.2 Proofs for Chapter 6 333Proof. S<strong>in</strong>ce G is assumed to be proper, then |G(ρ k )| converges to a f<strong>in</strong>iteconstant as k → ∞. Denote this constant by M G . Us<strong>in</strong>g this bound, <strong>and</strong>B.16 from Lemma B.2.1 yieldslimK∑K→∞k=−K<strong>and</strong> the result follows.|G(ρ k )H(ρ k )| 2 ≤ M 2 G(c 1 + c 2 e −ρτ ) 2∞∑k=−∞1|ρ k | 2 < ∞ ,□Proof of Lemma 6.2.2. We consider only the disturbance response, calculationsfor the noise response follow similarly. For convenience, we recallthe expression for Y d from (6.9) <strong>in</strong> Chapter 6, which we rewrite asY d (s) = D(s) − G(s)H(s)K d (e sτ )S d (e sτ )V d (e sτ ),where V d (e sτ ) is given by (cf. the proof of Lemma 6.2.1)V d (e sτ ) = 1 τ∞∑k=−∞F k (s)D k (s).(B.20)To evaluate the steady-state response to d(t) = e jωt , we must first evaluatethe <strong>in</strong>verse Laplace transform of Y d , <strong>and</strong> then discard all terms dueto those poles ly<strong>in</strong>g ¢ <strong>in</strong> − . Invert<strong>in</strong>g the Laplace transform requires thatwe evaluate the Bromwich <strong>in</strong>tegral (Lev<strong>in</strong>son <strong>and</strong> Redheffer, 1970)y d (t) = 12πj∫ γ+j∞γ−j∞e st Y d (s) ds ,(B.21)where γ > 0. This <strong>in</strong>tegral may be evaluated us<strong>in</strong>g the residue theorem(Theorem A.9.1 <strong>in</strong> Appendix A).It follows from (B.20) that Y d has poles due to the disturbance locatedalong the imag<strong>in</strong>ary axis at s = j(ω + kω s ), k = 0, ±1, ±2, . . .. By theassumption of closed loop stability all other poles of Y d lie <strong>in</strong> ¢ − . Us<strong>in</strong>g(B.20), it may be shown that these poles have the follow<strong>in</strong>g properties:(i) they all lie to the right of some vertical l<strong>in</strong>e Re s = c < 0,(ii) there are f<strong>in</strong>itely many poles due to G <strong>and</strong> no poles due to H,(iii) there are f<strong>in</strong>itely many sequences of poles due to K d (e sτ ), S d (e sτ ),<strong>and</strong> F(s + jkω s ), k = 0, ±1, ±2, . . . ly<strong>in</strong>g on vertical l<strong>in</strong>es <strong>and</strong> spacedat <strong>in</strong>tervals equal to ω s .Next, it is straightforward to verify from (B.20) <strong>and</strong> Def<strong>in</strong>ition 6.2.1 thatthe residues of e st Y d at the jω-axis poles are given by(lim s − j(ω + kωs ) ) {e st Y d S 0 (jω)e jωt if k = 0 ,(s) =s→j(ω+kω s)−T k (jω)e j(ω+kωs)t if k ≠ 0 .(B.22)


334 Appendix B. Proofs of Some Results <strong>in</strong> the ChaptersWe need not calculate explicitly the residues at the other poles; as we willshow, they do not contribute to the steady-state response.Consider the contours of <strong>in</strong>tegration C n , n = 1, 2, 3, . . . depicted <strong>in</strong> FigureB.2, <strong>and</strong> chosen so that (i) C 1 encloses only that jω-axis pole ly<strong>in</strong>g <strong>in</strong>Ω N , (ii) the horizontal l<strong>in</strong>e Im s = R 1 does not conta<strong>in</strong> any OLHP poles ofY d , <strong>and</strong> (iii) R n+1 = R n + ω s .(III)j Im s(II)j(ω + (n − 1)ω s)C nR n j(ω + ω s)(I)Re scjωj(ω − ω s)Ω Nγj(ω + ω s)j(ω − nω s)(IV)FIGURE B.2. Contours of <strong>in</strong>tegration.Figure B.2 <strong>and</strong> subsequent calculations are appropriate for the case thatω is <strong>in</strong> Ω N (modifications to the general case are straightforward). Ourconstruction of the contour of <strong>in</strong>tegration guarantees that for n sufficientlylarge no poles of Y d will lie on C N . Hence the residue theorem may beapplied to yield12πj{∫e st Y d (s) ds +I∫II∫∫e st Y d (s) ds + e st Y d (s) ds +IIIn∑= S 0 (jω)e jωt −k=−nk≠0IV}e st Y d (s) dsT k (jω)e j(ω+kωs)t + Ψ(t) ,(B.23)where Ψ(t) denotes the contribution of the poles <strong>in</strong> ¢ − .We now sketch a proof that as t → ∞, Ψ(t) → 0. First, it is clear that thecontribution to Ψ from each pole of G converges to zero. Consider next thecontribution of one of the f<strong>in</strong>itely many sequences of poles described <strong>in</strong>


B.2 Proofs for Chapter 6 335(iii) above. Let this sequence be denoted ρ k ρ + jkω s , k = 0, ±1, ±2, . . .,<strong>and</strong> Re ρ < 0. We will assume that ρ is real for notational simplicity, <strong>and</strong>will also assume for simplicity that each ρ k is a simple pole. Then, for anyfixed value of t, the contribution to Ψ from this sequence of poles is givenbyK∑y ρ (t) e ρt lim Res Y d (s)e jkωst , (B.24)s=ρ kK→∞k=−Kwhere Res s=ρk Y d (s) = lim s→ρk (s − ρ k )Y d (s) (see §A.9 <strong>in</strong> Appendix A).From (B.20) we haveRes Y d (s) = −G(ρ k )H(ρ k ) lim (s − ρ k )K d (e sτ )S d (e sτ )V d (e sτ ) .s=ρ k s→ρ k(B.25)Because K d (e sτ ), S d (e sτ ), <strong>and</strong> V d (e sτ ) are each periodic <strong>in</strong> s along verticall<strong>in</strong>es, it may be shown that the limit on the RHS of (B.25) is <strong>in</strong>dependentof k. Denote the common value of this limit by −L ρ . Then (B.24) becomesy ρ (t) = e ρt L ρlimK∑K→∞k=−KG(ρ k )H(ρ k )e jkωst .(B.26)By Lemma B.2.2, the sequence {G(ρ k )H(ρ k )} is square-summable. Therefore,by the Riesz-Fischer Theorem (Riesz <strong>and</strong> Sz.-Nagy, 1990, p.70), theseries <strong>in</strong> (B.26) converges to a bounded periodic function of t. S<strong>in</strong>ce ρ < 0,it thus follows that y ρ (t) → 0 as t → ∞. S<strong>in</strong>ce there are only f<strong>in</strong>itely manysequences of the form (B.26), we then have that Ψ(t) → 0.The desired result (6.17) will hold if it may be shown that the last three<strong>in</strong>tegrals on the LHS of (B.23) converge to zero as n → ∞. We now showthat the <strong>in</strong>tegral (II) converges to zero; similar calculations apply to (IV).Consider values of s such that s = σ + jR n , c ≤ σ ≤ γ, <strong>and</strong> R n is sufficientlylarge that R n > ω <strong>and</strong> that C n encloses all poles of G. It may beshown that there exist constants M <strong>and</strong> M G , <strong>in</strong>dependent of n, such that|K d (e sτ )S d (e sτ )V d (e sτ )| ≤ M <strong>and</strong> |G(s)| ≤ M G for all such s. Also, it followsfrom Lemma B.2.1 that, for t ≥ τ, there exists some constant c 3 suchthat|se st H(s)| ≤ c 3 e γt , for all s with Re s ≤ γ. (B.27)Us<strong>in</strong>g these bounds <strong>in</strong> conjunction with (B.20) <strong>and</strong> the fact that D(s) =1/(s − jω), yields|e st Y d (s)| ≤ eγt (1 + MM G c 3 )R n − ωUs<strong>in</strong>g (B.28) <strong>in</strong> <strong>in</strong>tegral (II) yields∫∣ e st Y d (s) ds∣ ≤ (γ − c)eγt (1 + MM G c 3 ),R n − ωII. (B.28)


336 Appendix B. Proofs of Some Results <strong>in</strong> the Chapterswhich converges to zero as R n → ∞.It rema<strong>in</strong>s to show that the <strong>in</strong>tegral (III) converges to zero as R n → ∞.Parametrize (III) by s = c + R n e jθ , with π/2 ≤ θ ≤ 3π/2, <strong>and</strong> def<strong>in</strong>eξ = s − c; contour (III) is then a semicircle ϕ n centered at the orig<strong>in</strong> of theξ-plane <strong>and</strong> extended <strong>in</strong>to the left half plane. We then have that∫∣ ∣∣∣ ∣ e st Y d (s) ds∣ = eIII∫ϕ ct e ξ(t−τ) e ξτ Y d (ξ + c) dξ∣n∣ ≤ e∫ϕ ct ∣∣e ξτ Y d (ξ + c) ∣ ∣ e ξ(t−τ) dξ ∣ . (B.29)n∣Now, follow<strong>in</strong>g similar steps to those used to obta<strong>in</strong> (B.28), ∣e ξτ Y d (ξ + c) ∣can be bounded on ϕ n as∣∣e ξτ Y d (ξ + c) ∣ ≤ e(γ−c)τ (1 + MM G c 3 ),R n − ωThis bound <strong>and</strong> the application of Jordan’s Lemma (Lemma A.4.1 fromAppendix A) to obta<strong>in</strong> a bound on the <strong>in</strong>tegral ∫ ϕ n|e ξ(t−τ) dξ|, for t > τ,gives, from (B.29),∫∣ e st Y d (s) ds∣ ≤ eγt (1 + MM G c 3 )π,(R n − ω)(t − τ)IIIwhich converges to zero as R n → ∞, conclud<strong>in</strong>g the proof.□The follow<strong>in</strong>g subsections give the proofs of the results <strong>in</strong> §6.2.2 on robuststability of sampled-data systems.B.2.2 Proof of Lemma 6.2.4It is necessary for closed loop stability that˜S d (z) = [I + K d (z)(F ˜GH) d (z)] −1(B.30)have no poles <strong>in</strong> £ c . Rearrang<strong>in</strong>g yields˜S d (z) = [I + S d (z)K d (z)(FW∆GH) d (z)] −1 S d (z).(B.31)S<strong>in</strong>ce the nom<strong>in</strong>al system is stable, ˜S d will have no poles <strong>in</strong> £ c if <strong>and</strong> onlyifdet[I + S d (e jωτ )K d (e jωτ )(FW∆GH) d (e jωτ )] ≠ 0 for all ω. (B.32)The proof proceeds by contradiction. We follow the argument used <strong>in</strong>Chen <strong>and</strong> Desoer (1982, Theorem 2). Denote Q(jω) T 0 (jω)W(jω), <strong>and</strong>


B.2 Proofs for Chapter 6 337suppose that (6.23) is violated. Then there exists a frequency ω 1 such thatσ 1 σ(Q(jω 1 )) > 1, where σ(·), recall, denotes the maximum s<strong>in</strong>gularvalue. Perform<strong>in</strong>g a s<strong>in</strong>gular value decomposition of Q(jω 1 ) yieldsQ(jω 1 ) = U diag[σ 1 . . .] V ∗ ,where U {u ij } <strong>and</strong> V {v ij } are unitary matrices. Now assume for themoment that there exists an admissible ^∆ that also satisfies⎡ ⎤v 11⎢^∆(jω 1 ) = .⎣⎥. ⎦ (−σ 1 ) −1 [ ]u ∗ 11. . . u ∗ n1(B.33)v n1<strong>and</strong>= V diag[(−σ 1 ) −1 , 0, . . . , 0] U ∗ ,^∆(j(ω 1 + kω s )) = 0 for k = ±1, ±2, . . ., <strong>and</strong> k ≠ −2ω 1 /ω s . (B.34)The assumptions on W, <strong>and</strong> ∆, imply that a formula similar to (6.7) maybe used to calculate (FW ^∆GH) d . Us<strong>in</strong>g (B.33) <strong>and</strong> (B.34) yields(FW ^∆GH) d (e jω 1τ ) = − 1 τ F(jω 1)W(jω 1 )V× diag[− 1 σ 1, 0, . . . , 0] U ∗ G(jω 1 )H(jω 1 ),(B.35)<strong>and</strong> therefore 1det[I + S d (e jω 1τ )K d (e jω 1τ )(FW ^∆GH) d (e jω 1τ )]= det[I + S d K d1τ FWV diag[(−σ 1) −1 , 0, . . . , 0] U ∗ GH]= det[I + V diag[(−σ 1 ) −1 , 0, . . . , 0] U ∗ 1 τ GHS dK d FW]= det[I + V diag[(−σ 1 ) −1 , 0, . . . , 0] U ∗ Q(jω 1 )]= [I + V diag[−1, 0, . . . , 0] V ∗ ]= det[V] det[diag[0, 1, 1, . . . , 1]] det[V ∗ ]= 0.Hence, (B.32) fails <strong>and</strong> so the perturbed system is unstable.It rema<strong>in</strong>s to show that ^∆ satisfy<strong>in</strong>g the required properties exists. Wedo this follow<strong>in</strong>g a construction used <strong>in</strong> Chen <strong>and</strong> Desoer (1982). Consider^∆(s) ⎡ ⎤α 1 (s)⎢ ⎥⎣ ⎦.α n (s)(− 1 σ 1)f q (s) k ′ z(s) [ β 1 (s), . . . , β n (s) ] ,1 We suppress dependence on the transform variable when convenient <strong>and</strong> where themean<strong>in</strong>g is clear from the context.


338 Appendix B. Proofs of Some Results <strong>in</strong> the Chapterswhere k ′ is a natural number, <strong>and</strong>ω 1 sf q (s) q(s 2 + ω 2 1 ) + ω 1s , q > 0,α i (s) s Im v i1 + Re v i1 ,ω 1β i (s) − sω 1Im u i1 + Re u i1 ,z(s) H ZOH(s − jω 1 )H ZOH (s + jω 1 )η(s),τ|H ZOH (j2ω 1 )|(η(s) − s)s<strong>in</strong>(∢H ZOH (j2ω 1 )) + cos(∢H ZOH (j2ω 1 )) ,ω 1<strong>and</strong> where H ZOH (s) is the frequency response function of the ZOH, <strong>and</strong>∢ denotes the phase of a complex number. It is then straightforward toverify that(i) ^∆(jω 1 ) satisfies (B.33) <strong>and</strong> (B.34), <strong>and</strong>(ii) by choos<strong>in</strong>g both k ′ <strong>and</strong> q large enough, ^∆ is exponentially stable<strong>and</strong>, for all ω ≠ ±ω 1 , lim ω→∞ σ( ^∆(jω)) → 0, i.e., ‖ ^∆‖ ∞ < 1 issatisfied.□B.2.3 Proof of Lemma 6.2.5The proof follows the same l<strong>in</strong>es of that of Lemma 6.2.4 after not<strong>in</strong>g thatwe can alternatively write the perturbed discrete sensitivity function as˜S d = [ ] −11 + K d (F ˜GH) d[( )] −1F∆WGH= 1 + K d (FGH) d − K d1 + W∆ d[ ( )] −1 F∆WGH= 1 − S d K d S d . (B.36)1 + W∆ dThat the nons<strong>in</strong>gularity of the term between brackets <strong>in</strong> (B.36) implies(6.25) may be shown by an argument by contradiction, similar to that used<strong>in</strong> the proof of Lemma 6.2.4, <strong>and</strong> is omitted to avoid repetition. □


Appendix CThe Laplace Transform of thePrediction ErrorThroughout Chapter 10 we have used a modified Laplace transform thatshifts the lower limit of <strong>in</strong>tegration from t = 0 to t = −τ, i.e., the shiftedLaplace transform of a function h(t) was def<strong>in</strong>ed to beH(s) ∫ ∞e −st h(t)dt.−τObviously, this modification leaves unchanged the transforms of signalsstart<strong>in</strong>g at t = 0, but it over-evaluates the transform of the shifted statex(t+τ). Despite this overmeasure, the results given <strong>in</strong> Chapter 10 providea good <strong>in</strong>dication of the performance limits <strong>in</strong> prediction.In this appendix, we will obta<strong>in</strong> the conventional Laplace transform (denotedby L) of the estimation error <strong>and</strong> po<strong>in</strong>t out the difficulties of us<strong>in</strong>gthis version <strong>in</strong> sensitivity analysis.We recall the prediction error from (10.5) (for full-state prediction, i.e.,z = x <strong>in</strong> (10.1)),˜x(t + τ|t) = x(t + τ) − e Aτ^x(t).Tak<strong>in</strong>g the Laplace transform, we have∫ τ]˜X(s) = e[X(s) sτ − x(t)e −st dt − e Aτ ^X(s).0The second term above was not considered <strong>in</strong> our analysis <strong>in</strong> Chapter 10.We will see that this second term cannot be written as aff<strong>in</strong>e <strong>in</strong> the transformsof the <strong>in</strong>put signals.


340 Appendix C. The Laplace Transform of the Prediction ErrorThe contribution to the prediction error due to the process <strong>in</strong>put v isgiven by 1∫ τ]˜X v (s) = e[H sτ xv (s)V(s) − x v (t)e −st dt − e Aτ H^xv (s)V(s),0where x v (t) = ∫ t0 eA(t−θ) Bv(θ)dθ, 0 ≤ t ≤ τ, <strong>and</strong> V = Lv. The aboveequation can be expressed as∫ ∞]˜X v (s) = e[H sτ xv (s)V(s) − x v (t)h τ (t)e −st dt − e Aτ H^xv (s)V(s)0](C.1)= e[H sτ xv (s)V(s) − L[x v (t)h τ (t)](s) − e Aτ H^xv (s)V(s)where h τ is the pulse function given by{1 if 0 ≤ t ≤ τ,h τ (t) =0 otherwise.Solv<strong>in</strong>g the transform of the real multiplication <strong>in</strong> (C.1), we further obta<strong>in</strong>]˜X v (s) = e[H sτ xv (s)V(s) − [H xv (s)V(s)] ⊗ H τ (s) − e Aτ H^xv (s)V(s), (C.2)where H τ = Lh τ , <strong>and</strong> where ‘⊗’ denotes complex convolution (Gardner<strong>and</strong> Barnes, 1949), i.e.,[H xv (s)V(s)] ⊗ H τ (s) =∫ c+jωc−jω∫ c+jωc−jωH xv (θ)V(θ)H τ (s − θ)dθH xv (θ)V(θ) 1 − e−(s−θ)τdθ.s − θ(C.3)The real constant c <strong>in</strong> the <strong>in</strong>tegration limits of (C.3) is chosen such thatσ v < c < ∞, where σ v is the abscissa of absolute convergence of H xv (s)V(s). 2From (C.2), we can def<strong>in</strong>e a “transfer function” (although not <strong>in</strong> theusual multiplicative form) from the process <strong>in</strong>put to the prediction erroras]H ˜xv (s) e[H sτ xv (s) − K τ [ · ](s) − e Aτ H^xv (s), (C.4)where the l<strong>in</strong>ear operatorK τ [v](s) [H xv (s)V(s)] ⊗ H τ (s)1 In the sequel, the symbol H ba will st<strong>and</strong> for the mapp<strong>in</strong>g from signal a to signal b.2 S<strong>in</strong>ce H τ(s) is an entire function, the <strong>in</strong>tegral <strong>in</strong> (C.3) converges absolutely <strong>in</strong> the halfplane Re(s) > σ v.


Appendix C. The Laplace Transform of the Prediction Error 341is clearly not a multiplication operator. With the expression for H ˜xv given<strong>in</strong> (C.4), the correspond<strong>in</strong>g prediction sensitivities would be def<strong>in</strong>ed asP e −sτ H ˜xv(H xv − K τ [ · ]) −1,M e −sτ H^xv(H xv − K τ [ · ]) −1.It is clear from the above def<strong>in</strong>itions that the derivation of <strong>in</strong>terpolation<strong>and</strong> <strong>in</strong>tegral constra<strong>in</strong>ts for this version of the prediction sensitivities isnontrivial. Moreover, it cannot be performed with the tools developed <strong>in</strong>this book.


Appendix DLeast Squares SmootherSensitivities for Large τIn this appendix, we derive an expression for the smooth<strong>in</strong>g sensitivitiesof Chapter 11 that holds for large values of the smooth<strong>in</strong>g lag.Consider, <strong>in</strong> system (11.1) of Chapter 11, that D 1 = D 2 = 0, i.e.,ẋ = Ax + Bv,z = C 1 x,y = C 2 x + w,(D.1)where v <strong>and</strong> w are uncorrelated white noises with <strong>in</strong>cremental covariancesequal to Q <strong>and</strong> R, respectively.Let the Kalman filter for the above system be given bywhere˙^x = ^A^x + K y y,^z = C 1^x,^A = A − K y C 2 ,K y = ΦC 2R ′ −1 ,(D.2)(D.3)<strong>and</strong> Φ ≥ 0 is a stabiliz<strong>in</strong>g solution of the Riccati equation (11.6), reproducedhere for convenienceAΦ + ΦA ′ + BQB ′ − ΦC ′ 2R −1 C 2 Φ = 0.Let F be the Laplace transform from y to ^x for the Kalman filter (D.2), i.e.,F(s) = (sI − ^A) −1 K y .


344 Appendix D. Least Squares Smoother Sensitivities for Large τUs<strong>in</strong>g (D.3) <strong>and</strong> the matrix <strong>in</strong>version lemma (e.g., Kailath, 1980, p. 656) wecan further write F asF(s) = (sI − A) −1 K y [I − C 2 (sI − ^A) −1 K y ].(D.4)Consider next the least squares smoother (11.7) of Example 11.1.1. We thenhave the follow<strong>in</strong>g result.Lemma D.0.3. Consider the least squares smoother of Example 11.1.1 forthe system given <strong>in</strong> (D.1). Assume that the solution, Φ, of the Riccati equation(11.6) is positive def<strong>in</strong>ite. Then, for τ = ¯τ ≫ τ max ( ^A), where τ max ( ^A)is the dom<strong>in</strong>ant time constant 1 of the Kalman filter, the smooth<strong>in</strong>g sensitivities(11.17) can be approximated byM[ ¯τ](s) = H zv (s)QH ′ ιv(−s)R −1 H ιv (s)H −1zv (s),P[ ¯τ](s) = I − H zv (s)QH ′ ιv(−s)R −1 H ιv (s)H −1zv (s),(D.5)where H ιv is the transfer function from the process <strong>in</strong>put, v, to the <strong>in</strong>novationsprocess ι = y − C 2^x correspond<strong>in</strong>g to the Kalman filter (D.2), <strong>and</strong>given byH ιv = C 2 (sI − ^A) −1 B.Proof. For the least squares smoother of Example 11.1.1, the expression ofthe smoother function (11.10) isH s = C 1 Φ(sI + ^A ′ ) −1[ e τ ^A ′ − e −sτ I ] C ′ 2R −1= C 1 Φ(sI + ^A ′ ) −1 e τ ^A ′ C ′ 2R −1 − e −sτ Φ(sI + ^A ′ ) −1 C ′ 2R −1 .Note that the expression e τ ^A ′ C ′ 2 R−1 can be thought of as the state of thefilter at time t = τ, <strong>in</strong> response to the <strong>in</strong>itial conditions given by thecolumns of C ′ 2 R−1 . Then, for τ = ¯τ ≫ τ max ( ^A), where τ max ( ^A) is thedom<strong>in</strong>ant time constant of the Kalman filter, e ¯τ ^A ′ C ′ 2 R−1 will be negligiblecompared to the value of C ′ 2 R−1 . Hence, for τ = ¯τ, we may neglect thefirst term on the RHS above <strong>and</strong> assume that H s is of the formH s [ ¯τ] = −e −s ¯τ C 1 Φ(sI + ^A ′ ) −1 C ′ 2R −1 .(D.6)For convenience, we will use the Riccati equation (11.6) to express ^A ′ <strong>in</strong>(D.3) as^A ′ = A ′ − C ′ 2R −1 C 2 Φ= −Φ −1 (A + BQB ′ Φ −1 )Φ −Φ −1 ĀΦ,1 Recall that the dom<strong>in</strong>ant time constant of a stable system can be taken as the <strong>in</strong>verse ofthe real part of the eigenvalue with smallest magnitude for the real part.


Appendix D. Least Squares Smoother Sensitivities for Large τ 345with the obvious def<strong>in</strong>ition for Ā. Hence(sI + ^A ′ ) −1 = Φ −1 (sI − Ā) −1 Φ.(D.7)Us<strong>in</strong>g (D.7) <strong>in</strong> (D.6), we haveH s [ ¯τ] = −e −s ¯τ C 1 (sI − Ā) −1 K y ,(D.8)where K y = ΦC2 ′ R−1 is the Kalman filter ga<strong>in</strong> given <strong>in</strong> (D.3).Next, recall from (11.21) the expression for the smooth<strong>in</strong>g complementarysensitivity, MM = e sτ F s H yv H −1zv ,where H zv = G z <strong>and</strong> H yv = G y . Replac<strong>in</strong>g F s us<strong>in</strong>g (11.14), we haveM = [C 1 F + e sτ H s (I − C 2 F)]H yv H −1zv ,(D.9)where F is given <strong>in</strong> (D.4). Us<strong>in</strong>g (D.4) <strong>and</strong> (D.8) <strong>in</strong> (D.9), we have, for τ ≫τ max ( ^A),M[ ¯τ] = C 1[(sI−A) −1 −(sI−Ā) −1] K y [I−C 2 (sI− ^A) −1 K y ]H yv H −1zv . (D.10)Next note that, us<strong>in</strong>g (D.3) <strong>and</strong> H yv = C 2 (sI − A) −1 B, we can write[I − C 2 (sI − ^A) −1 K y ]H yv = C 2 (sI − ^A) −1 B= H ιv ,where H ιv is the transfer function from the process <strong>in</strong>put, v, to the <strong>in</strong>novationsprocess, ι. Us<strong>in</strong>g this expression <strong>in</strong> (D.10), we haveM[ ¯τ] = C 1[(sI − A) −1 − (sI − Ā) −1] K y H ιv H −1zv .Us<strong>in</strong>g Ā = A + BQB ′ Φ −1 <strong>and</strong> replac<strong>in</strong>g K y from (D.3) <strong>in</strong> the above equation,<strong>and</strong> rearrang<strong>in</strong>g, yieldsM[ ¯τ] = −C 1 (sI − A) −1 BQB ′ Φ −1 (sI − Ā) −1 ΦC ′ 2R −1 H ιv H −1zv .Us<strong>in</strong>g H zv = C 1 (sI − A) −1 B, <strong>and</strong> (D.7) aga<strong>in</strong>, we further getM[ ¯τ] = −H zv QB ′ (sI + ^A ′ ) −1 C ′ 2R −1 H ιv H −1zv= H zv Q[C 2 (−sI − ^A) −1 B] ′ R −1 H ιv H −1zv= H zv (s)QH ′ ιv(−s)R −1 H ιv (s)H −1zv (s),(D.11)where we have explicitly <strong>in</strong>dicated the arguments s or −s to avoid confusion.This establishes (D.5) for M. By complementarity, the result for P isalso as given <strong>in</strong> (D.5). The result then follows.□


BibliographyB.D.O. Anderson. Properties of optimal l<strong>in</strong>ear smooth<strong>in</strong>g. IEEE Trans. on Automatic<strong>Control</strong>, 14:114–115, 1969.B.D.O. Anderson <strong>and</strong> S. Chirarattananon. Smooth<strong>in</strong>g as an improvement on filter<strong>in</strong>g:A universal bound. Electronics Letters, 7(18):524–525, 1971.B.D.O. Anderson <strong>and</strong> H.B. Doan. Design procedures for stable suboptimal fixedlagsmoothers. IEEE Trans. on Automatic <strong>Control</strong>, 22(6):949–952, 1977.B.D.O. Anderson <strong>and</strong> J.B. Moore. Optimal Filter<strong>in</strong>g. Prentice Hall, EnglewoodCliffs, NJ, 1979.M. Araki. Recent developments <strong>in</strong> digital control theory. In Proceed<strong>in</strong>gs of the 12thWorld Congress of the IFAC, pages IX–251, 1993.M. Araki, Y. Ito, <strong>and</strong> T. Hagiwara. Frequency response of sampled-data systems. InProceed<strong>in</strong>gs of the 12th IFAC World Congress, pages VII–289, 1993. Also Automatica,32(4), pp. 483-497, 1996.A. Astolfi <strong>and</strong> L. Guzzella. Robust control of nonl<strong>in</strong>ear systems. An H ∞ approach.In Proc. of the 12th IFAC World Congress, volume 9, pages 281–284, Sydney, Australia,July 1993.K.J. Åström. <strong>Fundamental</strong> limitations of control system performance. InA. Paulraj, C. Schaper, <strong>and</strong> V. Roychowdhury, editors, Mathematical Eng<strong>in</strong>eer<strong>in</strong>g:A Kailath Festschrift. Kluwer Academic Publishers, 1996.K.J. Åström <strong>and</strong> B. Wittenmark. Computer-<strong>Control</strong>led Systems: Theory <strong>and</strong> Design.Prentice Hall, Englewood Cliffs NY, 2nd edition, 1990.


348 BibliographyD.S. Bernste<strong>in</strong> <strong>and</strong> W.M. Haddad. Steady-state kalman filter<strong>in</strong>g with an H ∞ errorbound. Systems <strong>and</strong> <strong>Control</strong> Letters, 12:9–16, 1989.S.P. Bhattacharyya. The structure of robust observers. IEEE Trans. on Automatic<strong>Control</strong>, 21:581–588, 1976.Å. Björck <strong>and</strong> G.H. Golub. Numerical methods for comput<strong>in</strong>g angles betweenl<strong>in</strong>ear subspaces. Math. Computation, 27(123):579–594, July 1973.H.W. Bode. Network Analysis <strong>and</strong> Feedback Amplifier Design. D. van Nostr<strong>and</strong>, NewYork, 1945.J.L. Bower <strong>and</strong> P.M. Schultheiss. Introduction to the design of servomechanisms. JohnWiley & Sons, 1958.S. Boyd <strong>and</strong> C.A. Desoer. Subharmonic functions <strong>and</strong> performance bounds onl<strong>in</strong>ear time-<strong>in</strong>variant feedback systems. IMA Journal of Math. <strong>Control</strong> <strong>and</strong> Information,pages 153–170, 1985.J.H. Braslavsky. Frequency doma<strong>in</strong> analysis of sampled-data control systems. PhDthesis, Department of Electrical <strong>and</strong> Computer Eng<strong>in</strong>eer<strong>in</strong>g, The University ofNewcastle, NSW 2308 Australia, October 1995.J.H. Braslavsky, G. Me<strong>in</strong>sma, R.H. Middleton, <strong>and</strong> J.S. Freudenberg. On a key sampl<strong>in</strong>gformula relat<strong>in</strong>g the Laplace <strong>and</strong> Z-transforms. Technical Report EE9524,Dept. of Elec. & Comp. Eng., The Univ. of Newcastle, Australia, April 1995a. Toappear Systems <strong>and</strong> <strong>Control</strong> Letters.J.H. Braslavsky, R.H. Middleton, <strong>and</strong> J.S. Freudenberg. Frequency response of generalizedsampled-data hold functions. In Proceed<strong>in</strong>gs of the 34th CDC, New Orleans,LO, December 1995b.J.H. Braslavsky, M.M. Seron, G.C. Goodw<strong>in</strong>, <strong>and</strong> R.W. Gra<strong>in</strong>ger. Tradeoffs <strong>in</strong> multivariablefilter design with applications to fault detection. In Proceed<strong>in</strong>gs of the35th CDC, Kobe, Japan, December 1996.P.E. Ca<strong>in</strong>es. L<strong>in</strong>ear Stochastic Systems. Wiley & Sons, New York, 1988.A.B. Carlson. Communication Systems. An <strong>in</strong>troduction to signals <strong>and</strong> noise <strong>in</strong> electricalcommunication. McGraw-Hill Book Co., 2nd. edition, 1975.J. Chen. Sensitivity <strong>in</strong>tegral relations <strong>and</strong> design trade-offs <strong>in</strong> l<strong>in</strong>ear multivariablefeedback systems. IEEE Trans. on Automatic <strong>Control</strong>, 40(10):1700–1716, October1995.J. Chen <strong>and</strong> C.N. Nett. Sensitivity <strong>in</strong>tegrals for multivariable discrete-time systems.Automatica, 31(8):113–1124, 1995.M.J. Chen <strong>and</strong> C.A. Desoer. Necessary <strong>and</strong> sufficient condition for robust stabilityof l<strong>in</strong>ear distributed feedback systems. Int. J. of <strong>Control</strong>, 35(2):255–267, 1982.T. Chen <strong>and</strong> B.A. Francis. Input-output stability of sampled-data systems. IEEETrans. on Automatic <strong>Control</strong>, 36(1):50–58, January 1991.


Bibliography 349T. Chen <strong>and</strong> B.A. Francis. Optimal sampled-data control systems. Spr<strong>in</strong>ger-Verlag,1995.R.V. Churchill <strong>and</strong> J.W. Brown. Complex Variables <strong>and</strong> Applications. McGraw-HillBook Co., fourth edition, 1984.J.B. Conway. Functions of One Complex Variable. Spr<strong>in</strong>ger-Verlag, New York, 1973.J.B. Conway. A Course <strong>in</strong> Functional Analysis. Spr<strong>in</strong>ger-Verlag, New York, 2ndedition, 1990.J-M Coron, L. Praly, <strong>and</strong> A. Teel. Feedback stabilization of nonl<strong>in</strong>ear systems:Sufficient conditions <strong>and</strong> Lyapunov <strong>and</strong> <strong>in</strong>put-output techniques. In A. Isidori,editor, Trends <strong>in</strong> <strong>Control</strong>. A European perspective, pages 293–348. Spr<strong>in</strong>ger-Verlag,London, 3rd. European <strong>Control</strong> Conference, Rome, Italy, September 1995.H. Cramér. Mathematical Methods of Statistics. Pr<strong>in</strong>ceton University Press, 1946.L. De Branges. Hilbert spaces of entire functions. Prentice-Hall, Inc., EnglewoodCliffs, N.J., 1968.C.E. de Souza, U. Shaked, <strong>and</strong> M. Fu. Robust H ∞ filter<strong>in</strong>g for cont<strong>in</strong>uous timevary<strong>in</strong>g uncerta<strong>in</strong> systems with determ<strong>in</strong>istic <strong>in</strong>put signals. IEEE Trans. on SignalProcess<strong>in</strong>g, 43(3):709–719, 1995.C.A. Desoer <strong>and</strong> A.N. Gündes. Decoupl<strong>in</strong>g l<strong>in</strong>ear multi-<strong>in</strong>put multi-output plantsby dynamic output feedback: An algebraic theory. IEEE Trans. on Automatic<strong>Control</strong>, 31(8):744–750, 1986.C.A. Desoer <strong>and</strong> M. Vidyasagar. Feedback systems: <strong>in</strong>put-output properties. AcademicPress, 1975.J.M. Dion <strong>and</strong> C. Commault. The m<strong>in</strong>imal delay decoupl<strong>in</strong>g problem: feedbackimplementation with stability. SIAM J. <strong>Control</strong> <strong>and</strong> Optimization, 26:66–82, 1988.J.C. Doyle, B.A. Francis, <strong>and</strong> A.R. Tannenbaum. Feedback <strong>Control</strong> Theory. MacmillanPublish<strong>in</strong>g Company, New York, 1992.J.C. Doyle, T.T. Georgiou, <strong>and</strong> M.C. Smith. The parallel projection operators of anonl<strong>in</strong>ear feedback system. Systems <strong>and</strong> <strong>Control</strong> Letters, 20:79–85, 1993.J.C. Doyle <strong>and</strong> G. Ste<strong>in</strong>. Multivariable feedback design: concepts for a classical/modernsynthesis. IEEE Trans. on Automatic <strong>Control</strong>, 26, 1981.G. Dullerud <strong>and</strong> K. Glover. Robust stabilization of sampled-data systems to structuredLTI perturbations. IEEE Trans. on Automatic <strong>Control</strong>, 38(10):1497–1508, October1993.M.J. Er <strong>and</strong> B.D.O. Anderson. Discrete-time loop transfer recovery via generalizedsampled-data hold functions based compensators. Int. J. of Robust <strong>and</strong> Nonl<strong>in</strong>ear<strong>Control</strong>, 4:741–756, 1994.A. Feuer. Periodic control of l<strong>in</strong>ear time-<strong>in</strong>variant systems – a critical po<strong>in</strong>t of view.In Proceed<strong>in</strong>gs of the 12th IFAC World Congress, pages VIII–145, 1993.


350 BibliographyA. Feuer <strong>and</strong> G. Goodw<strong>in</strong>. Sampl<strong>in</strong>g <strong>in</strong> Digital Signal Process<strong>in</strong>g <strong>and</strong> <strong>Control</strong>.Birkhäuser, 1996.A. Feuer <strong>and</strong> G.C. Goodw<strong>in</strong>. Generalized sample hold functions: Frequency doma<strong>in</strong>analysis of robustness, sensitivity, <strong>and</strong> <strong>in</strong>tersample difficulties. IEEE Trans.on Automatic <strong>Control</strong>, 39(5):1042, May 1994.B. Francis. Lectures on H ∞ control <strong>and</strong> sampled-data systems. In C. Foias,E. Mosca, <strong>and</strong> L. P<strong>and</strong>olfi, editors, H ∞-control theory: lectures given at the 2nd. sessionof the CIME, Lecture notes <strong>in</strong> mathematics, pages 37–105. Spr<strong>in</strong>ger-Verlag,1991.B.A. Francis <strong>and</strong> T.T. Georgiou. Stability theory for l<strong>in</strong>ear time-<strong>in</strong>variant plantswith periodic digital controllers. IEEE Trans. on Automatic <strong>Control</strong>, 33(9):820–832, January 1988.B.A. Francis <strong>and</strong> G. Zames. On H ∞-optimal sensitivity theory for SISO feedbacksystems. IEEE Trans. on Automatic <strong>Control</strong>, 29(1):pp. 9–16, 1984.G.F. Frankl<strong>in</strong>, J.D. Powell, <strong>and</strong> A. Emami-Nae<strong>in</strong>i. Feedback <strong>Control</strong> of Dynamic Systems.Addison-Wesley, Read<strong>in</strong>g, MA, 3rd edition, 1994.G.F. Frankl<strong>in</strong>, J.D. Powell, <strong>and</strong> M.L. Workman. Digital <strong>Control</strong> of Dynamic Systems.Addison-Wesley, Read<strong>in</strong>g, MA, 2nd edition, 1990.J.S. Freudenberg <strong>and</strong> D.P. Looze. Right half plane poles <strong>and</strong> zeros <strong>and</strong> designtradeoffs <strong>in</strong> feedback systems. IEEE Trans. on Automatic <strong>Control</strong>, 30(6):555–565,June 1985.J.S. Freudenberg <strong>and</strong> D.P. Looze. A sensitivity tradeoff for plants with time delay.IEEE Trans. on Automatic <strong>Control</strong>, 32(2):99–104, February 1987.J.S. Freudenberg <strong>and</strong> D.P. Looze. Frequency Doma<strong>in</strong> Properties of Scalar <strong>and</strong> MultivariableFeedback Systems, volume 104 of Lecture Notes <strong>in</strong> <strong>Control</strong> <strong>and</strong> InformationSciences. Spr<strong>in</strong>ger-Verlag, 1988.J.S. Freudenberg, R.H. Middleton, <strong>and</strong> J.H. Braslavsky. Robustness of zero-shift<strong>in</strong>gvia generalized sampled-data hold functions. In Proceed<strong>in</strong>gs of the 33th CDC,pages 231–235, Florida, December 1994.J.S. Freudenberg, R.H. Middleton, <strong>and</strong> J.H. Braslavsky. Inherent design limitationsfor l<strong>in</strong>ear sampled-data feedback systems. International Journal of <strong>Control</strong>, 61(6):1387–1421, June 1995.M.F. Gardner <strong>and</strong> J.L. Barnes. Transients <strong>in</strong> L<strong>in</strong>ear Systems. John Wiley & Sons, Inc.,New York, 1949.T.T. Georgiou <strong>and</strong> M.C. Smith. Distance measures for uncerta<strong>in</strong> nonl<strong>in</strong>ear systems.In Proc. of the 3rd European <strong>Control</strong> Conference, pages 1016–1020, Rome,Italy, September 1995a.


Bibliography 351T.T. Georgiou <strong>and</strong> M.C. Smith. Metric uncerta<strong>in</strong>ty <strong>and</strong> nonl<strong>in</strong>ear feedback stabilization.In B.A. Francis <strong>and</strong> A.R. Tannenbaum, editors, Feedback <strong>Control</strong>, Nonl<strong>in</strong>earSystems <strong>and</strong> Complexity, volume 202 of Lecture Notes <strong>in</strong> <strong>Control</strong> <strong>and</strong> InformationSciences, pages 88–98. Spr<strong>in</strong>ger-Verlag, 1995b.T.T. Georgiou <strong>and</strong> M.C. Smith. Remarks on robustness of nonl<strong>in</strong>ear systems. InProc. 34th Conf. Decision <strong>Control</strong>, pages 1662–1663, New Orleans, Louisiana, December1995c.J.-C. Gille, M.J. Pelegr<strong>in</strong>, <strong>and</strong> P. Decaulne. Feedback <strong>Control</strong> Systems. McGraw-HillBook Co., 1959.G.H. Golub <strong>and</strong> C.F. Van Loan. Matrix Computations. Johns Hopk<strong>in</strong>s Univ. Press.,Baltimore, 1983.G.I. Gómez <strong>and</strong> G.C. Goodw<strong>in</strong>. Vectorial sensitivity constra<strong>in</strong>ts for l<strong>in</strong>ear multivariablesystems. In Proceed<strong>in</strong>gs of the 34th CDC, New Orleans, LO, December1995.G.C. Goodw<strong>in</strong> <strong>and</strong> A. Feuer. L<strong>in</strong>ear periodic control: a frequency doma<strong>in</strong> viewpo<strong>in</strong>t.Systems <strong>and</strong> <strong>Control</strong> Letters, 19:379–390, 1992.G.C. Goodw<strong>in</strong> <strong>and</strong> G.I. Gómez. Frequency doma<strong>in</strong> design trade-offs for l<strong>in</strong>earperiodic feedback systems. Technical Report EE9544, Department of Electrical<strong>and</strong> Computer Eng<strong>in</strong>eer<strong>in</strong>g, The University of Newcastle, Australia, 1995.G.C. Goodw<strong>in</strong>, D.Q. Mayne, <strong>and</strong> J. Shim. Trade-offs <strong>in</strong> l<strong>in</strong>ear filter design. Automatica,31(10):1367–1376, Oct. 1995.G.C. Goodw<strong>in</strong> <strong>and</strong> M. Salgado. Frequency doma<strong>in</strong> sensitivity functions for cont<strong>in</strong>uoustime systems under sampled data control. Automatica, 30(8):1263, August1994.G.C.G. Goodw<strong>in</strong> <strong>and</strong> M.M. Seron. Complementarity constra<strong>in</strong>ts for nonl<strong>in</strong>ear systems.In Proc. of the IFAC Symposium on Nonl<strong>in</strong>ear <strong>Control</strong> System Design, TahoeCity, California, June 1995.T. Hagiwara, Y. Ito, <strong>and</strong> M. Araki. Computation of the frequency response ga<strong>in</strong>s<strong>and</strong> H ∞-norm of a sampled-data system. Systems <strong>and</strong> <strong>Control</strong> Letters, 25:281–288,1995.S. Hara <strong>and</strong> H.-K. Sung. Constra<strong>in</strong>ts on sensitivity characteristics <strong>in</strong> l<strong>in</strong>ear multivariablediscrete-time control systems. L<strong>in</strong>ear Algebra <strong>and</strong> its Applications, 122-124:889–919, 1989.D.J. Hill <strong>and</strong> P.J. Moylan. Dissipative dynamical systems: basic <strong>in</strong>put-output <strong>and</strong>state properties. J. of the Frankl<strong>in</strong> Institute, 309(Nr. 5):pp. 327–357, 1980.K. Hoffman. Banach spaces of analytic functions. Dover, New York, 1962.I.M. Horowitz. Synthesis of feedback systems. Academic Press, New York, 1963.


352 BibliographyN.T. Hung <strong>and</strong> B.D.O. Anderson. Triangularization technique for the design ofmultivariable control systems. IEEE Trans. on Automatic <strong>Control</strong>, 24(3):455–460,1979.P.T. Kabamba. <strong>Control</strong> of l<strong>in</strong>ear systems us<strong>in</strong>g generalized sampled-data holdfunctions. IEEE Trans. on Automatic <strong>Control</strong>, 32(9):772–783, September 1987.T. Kailath. An <strong>in</strong>novations approach to least-squares estimation. Part I: L<strong>in</strong>earfilter<strong>in</strong>g <strong>in</strong> additive white noise. IEEE Trans. on Automatic <strong>Control</strong>, 13(6):pp. 647–655, 1968.T. Kailath. A view of three decades of l<strong>in</strong>ear filter<strong>in</strong>g theory. IEEE Trans. on InformationTheory, 20(6):145–181, 1974.T. Kailath. L<strong>in</strong>ear systems. Prentice-Hall, Inc., Englewood Cliffs, NJ, 1980.T. Kailath. Lectures on Wiener <strong>and</strong> Kalman Filter<strong>in</strong>g. Spr<strong>in</strong>ger-Verlag, 1981.T. Kailath <strong>and</strong> P. Frost. An <strong>in</strong>novations approach to least-squares estimation.Part II: L<strong>in</strong>ear smooth<strong>in</strong>g <strong>in</strong> additive white noise. IEEE Trans. on Automatic <strong>Control</strong>,13(6):pp. 655–660, 1968.R. Kalman, B.L. Ho, <strong>and</strong> K. Narendra. <strong>Control</strong>lability of l<strong>in</strong>ear dynamical systems.In Contributions to differential equations. Interscience, 1963. Vol.1.R.E. Kalman <strong>and</strong> R.S. Bucy. New results <strong>in</strong> l<strong>in</strong>ear filter<strong>in</strong>g <strong>and</strong> prediction theory.Trans. ASME, J. Basic Engrg., 83-D:95–107, 1961.W. Kaplan. Advanced Calculus. Addison-Wesley, Read<strong>in</strong>g, MA, 2nd edition, 1973.P. Khargonekar, K. Poolla, <strong>and</strong> A. Tannenbaum. Robust control of l<strong>in</strong>ear time<strong>in</strong>variantplants us<strong>in</strong>g periodic compensation. IEEE Trans. on Automatic <strong>Control</strong>,30(11):1088–1096, November 1985.M. K<strong>in</strong>naert <strong>and</strong> Y. Peng. Residual generator for sensor <strong>and</strong> actuator fault detection<strong>and</strong> isolation: a frequency doma<strong>in</strong> approach. Int. J. of <strong>Control</strong>, 61(6):1423–1435, 1995.H. Kwakernaak. M<strong>in</strong>imax frequency doma<strong>in</strong> performance <strong>and</strong> robustness optimizationof l<strong>in</strong>ear feedback systems. IEEE Trans. on Automatic <strong>Control</strong>, 30:994–1004, 1985.H. Kwakernaak. Symmetries <strong>in</strong> control system design. In A. Isidori, editor, Trends<strong>in</strong> control: A European perspective, pages 17–51. Spr<strong>in</strong>ger-Verlag, 1995.H. Kwakernaak <strong>and</strong> R. Sivan. L<strong>in</strong>ear Optimal <strong>Control</strong> Systems. Wiley-Interscience,New York, 1972.S. Lee, S.M. Meerkov, <strong>and</strong> T. Runolfsson. Vibrational feedback control: zero placementcapabilities. IEEE Trans. on Automatic <strong>Control</strong>, 32(7):604–611, 1987.N. Lev<strong>in</strong>son <strong>and</strong> R.M. Redheffer. Complex Variables. Holden-Day, Inc., 1970.


Bibliography 353A.G.J. MacFarlane <strong>and</strong> N. Karcanias. Poles <strong>and</strong> zeros of l<strong>in</strong>ear multivariable systems:a survey of the algebraic, geometric <strong>and</strong> complex-varible theory. Int. J. of<strong>Control</strong>, 24:33–74, 1976.J.M. Maciejowski. Multivariable Feedback Design. Addison-Wesley Publish<strong>in</strong>g Company,1991.A.I Markushevich. Theory of functions of a complex variable, volume 1. Prentice-Hall,Inc., Englewood Cliffs, N.J., 1965.R.H. Mart<strong>in</strong> Jr. Nonl<strong>in</strong>ear operators <strong>and</strong> differential equations <strong>in</strong> Banach spaces. JohnWiley & Sons, Inc., New York, 1976.M.A. Massoumnia, G.C. Verghese, <strong>and</strong> A.S. Willsky. Failure detection <strong>and</strong> identification.IEEE Trans. on Automatic <strong>Control</strong>, 34(3):316–321, March 1989.D.G. Meyer. A parametrization of stabiliz<strong>in</strong>g controllers for multirate sampleddatasystems. IEEE Trans. on Automatic <strong>Control</strong>, 35(11):233–236, 1990.R.H. Middleton. Trade-offs <strong>in</strong> l<strong>in</strong>ear control systems design. Automatica, 27(2):281–292, March 1991.R.H. Middleton <strong>and</strong> J.S. Freudenberg. Non-pathological sampl<strong>in</strong>g for generalisedsampled-data hold functions. Automatica, 31(2), February 1995.R.H. Middleton <strong>and</strong> G.C. Goodw<strong>in</strong>. Digital control <strong>and</strong> estimation. A unified approach.Prentice-Hall, Inc., 1990.R.H. Middleton <strong>and</strong> J.Q. Xie. Non-pathological sampl<strong>in</strong>g for high order generalisedsampled-data hold functions. In Proceed<strong>in</strong>gs of the ACC, Seattle, WA, June1995a.R.H. Middleton <strong>and</strong> J.Q. Xie. Non-pathological sampl<strong>in</strong>g for high order generalisedsampled-data hold functions. In Proceed<strong>in</strong>gs of the ACC, Seattle, WA, June1995b.M. Morari <strong>and</strong> E. Zafiriou. Robust Process <strong>Control</strong>. Prentice-Halls, EnglewoodCliffs, 1989.K.M. Nagpal <strong>and</strong> P.P. Kharghonekar. Filter<strong>in</strong>g <strong>and</strong> smooth<strong>in</strong>g <strong>in</strong> an H ∞ sett<strong>in</strong>g.IEEE Trans. on Automatic <strong>Control</strong>, 36:152–166, 1991.P. Nasl<strong>in</strong>. The dynamics of l<strong>in</strong>ear <strong>and</strong> non-l<strong>in</strong>ear systems. Blackie, London, 1965.J. O’Reilly. Observers for L<strong>in</strong>ear Systems. Academic Press, 1983.S.D. O’Young <strong>and</strong> B. Francis. Sensitivity tradeoffs for multivariable plants. IEEETrans. on Automatic <strong>Control</strong>, 30(7):625–632, July 1985.I. Postlethwaite <strong>and</strong> A.G.J. MacFarlane. A Complex Variable Approach to the analysisof l<strong>in</strong>ear Multivariable Feedback Systems, volume 12 of Lecture Notes <strong>in</strong> <strong>Control</strong> <strong>and</strong>Information Sciences. Spr<strong>in</strong>ger-Verlag, 1979.


354 BibliographyT. Ramstad. Analysis/synthesis filterbanks with critical sampl<strong>in</strong>g. In Proc. IEEEConf. DSP., 1984.R. Ravi, P.P. Kharghonekar, K.D. M<strong>in</strong>to, <strong>and</strong> C.N. Nett. <strong>Control</strong>ler parametrizationfor time-vary<strong>in</strong>g multirate plants. IEEE Trans. on Automatic <strong>Control</strong>, 35(11):1259–1262, 1990.O. Reynolds. An experimental <strong>in</strong>vestigation of the circumstances which determ<strong>in</strong>ewhether the motion of water shall be direct or s<strong>in</strong>uous, <strong>and</strong> of the laws ofresistance <strong>in</strong> parallel channels. Trans. Royal Society, London, 174, 1883.F. Riesz <strong>and</strong> B. Sz.-Nagy. Functional Analysis. Dover, New York, 1990.H.H. Rosenbrock. Design of multivariable control systems us<strong>in</strong>g the <strong>in</strong>versenyquist array. IEE Proceed<strong>in</strong>gs Part D, 116(11):1929–1936, 1969.Y.N. Rosenvasser. Mathematical description <strong>and</strong> analysis of multivariablesampled-data systems <strong>in</strong> cont<strong>in</strong>uous-time: Part I. Automation <strong>and</strong> Remote <strong>Control</strong>,56(4):526–540, 1995. Part II, Automation <strong>and</strong> Remote <strong>Control</strong>, vol.56(5), pp.684-697, 1995.W. Rud<strong>in</strong>. Real <strong>and</strong> complex analysis. McGraw-Hill Book Co., 3rd. edition, 1987.I.W. S<strong>and</strong>berg. An observation concern<strong>in</strong>g the application of the contraction mapp<strong>in</strong>gfixed-po<strong>in</strong>t theorem <strong>and</strong> a result concern<strong>in</strong>g the norm-boundedness of solutionsof nonl<strong>in</strong>ear functional equations. Bell Sys. Tech. J., 44:1809–1812, 1965.M.M. Seron. Complementary operators <strong>in</strong> filter<strong>in</strong>g <strong>and</strong> control. PhD thesis, Departmentof Electrical <strong>and</strong> Computer Eng<strong>in</strong>eer<strong>in</strong>g, The University of Newcastle,NSW 2308 Australia, October 1995.M.M. Seron <strong>and</strong> G.C. Goodw<strong>in</strong>. Design limitations <strong>in</strong> l<strong>in</strong>ear filter<strong>in</strong>g. In Proceed<strong>in</strong>gsof the 34th CDC, New Orleans, LO, December 1995.M.M. Seron <strong>and</strong> G.C. Goodw<strong>in</strong>. Sensitivity limitations <strong>in</strong> nonl<strong>in</strong>ear feedback control.Systems <strong>and</strong> <strong>Control</strong> Letters, 27:249–254, 1996.J.S. Shamma. Performance limitations <strong>in</strong> sensitivity reduction for nonl<strong>in</strong>ear plants.Systems <strong>and</strong> <strong>Control</strong> Letters, 17:43–47, 1991.J.S. Shamma <strong>and</strong> M.A. Dahleh. Time-vary<strong>in</strong>g versus time-<strong>in</strong>variant compensationfor rejection of persistent bounded disturbances <strong>and</strong> robust stabilization. IEEETrans. on Automatic <strong>Control</strong>, 36(7):838–847, 1991.C.E. Shannon. A mathematical theory of communications. BSTJ, 27:379–623, 1948.R.G. Shenoy, D. Burnside, <strong>and</strong> T.W. Parks. L<strong>in</strong>ear periodic systems <strong>and</strong> multiratefilter design. IEEE Trans. on Signal Process<strong>in</strong>g, 42(9):2242–2256, 1994.N. Sivashankar <strong>and</strong> P. Khargonekar. Robust stability <strong>and</strong> performance analysis forsampled-data systems. IEEE Trans. on Automatic <strong>Control</strong>, 39(1):58–69, January1993.


Bibliography 355M.J.T. Smith <strong>and</strong> T.J. Barnwell. A new filter bank theory for time-frequency representation.IEEE Trans. on Acoustics, Speech, <strong>and</strong> Signal Process<strong>in</strong>g, 35(3):314–327,1987.E.D. Sontag. Mathematical control theory. Determ<strong>in</strong>istic f<strong>in</strong>ite dimensional systems.Spr<strong>in</strong>ger-Verlag, 1990.V.R. Sule <strong>and</strong> V.V. Athani. Directional sensitivity tradeoffs <strong>in</strong> multivariable feedbacksystems. Automatica, 27(5):869–872, 1991.H.-K. Sung <strong>and</strong> S. Hara. Properties of sensitivity <strong>and</strong> complementary sensitivityfunctions <strong>in</strong> s<strong>in</strong>gle-<strong>in</strong>put s<strong>in</strong>gle-output digital control systems. Int. J. of <strong>Control</strong>,48(6):2429–2439, 1988.J.G. Truxal. <strong>Control</strong> System Synthesis. McGraw-Hill, New York, 1955.M. Vetterli. A theory of multirate filter banks. IEEE Trans. on Acoustics, Speech, <strong>and</strong>Signal Process<strong>in</strong>g, 35(3):356–372, 1987.M. Vetterli. Invertibility of l<strong>in</strong>ear periodically time-vary<strong>in</strong>g filters. IEEE Trans. onCircuits <strong>and</strong> Systems, 36(1):148–150, 1989.M. Vidyasagar. <strong>Control</strong> System Synthesis, A Factorization Approach. MIT Press, Cambridge,1985.J.E. Wall, Jr., J.C. Doyle, <strong>and</strong> C.A. Harvey. Trade-offs <strong>in</strong> the design of multivariablefeedback systems. In Proc. of the 18th Allerton Conference, pages 715–725, 1980.S.R. Weller. Sensitivity limitations for multivariable l<strong>in</strong>ear filter<strong>in</strong>g. Technical report,Dept of Electrical <strong>and</strong> Electronic Eng<strong>in</strong>eer<strong>in</strong>g, University of Melbourne,Australia, February 1996.S.R. Weller <strong>and</strong> G.C. Goodw<strong>in</strong>. <strong>Control</strong>ler design for partial decoupl<strong>in</strong>g for l<strong>in</strong>earmultivariable systems. In Proceed<strong>in</strong>gs of the 32nd CDC, pages 833–834, December1993.D.V. Widder. Advanced Calculus. Prentice-Hall, Inc., Englewood Cliffs, N.J., 2ndedition, 1961.J.C. Willems. The Analysis of Feedback Systems. The MIT Press, Cambridge, Massachusetts,1971a.J.C. Willems. Least-squares stationary optimal control <strong>and</strong> the algebraic Riccatiequation. IEEE Trans. on Automatic <strong>Control</strong>, 16:621–634, 1971b.I. Yaesh <strong>and</strong> U. Shaked. Game theoretic approach to optimal l<strong>in</strong>ear estimation<strong>in</strong> the m<strong>in</strong>imum H ∞ norm sense. In Proc. 28th Conf. Decision <strong>Control</strong>, Tampa,Florida, 1989.Y. Yamamoto <strong>and</strong> M. Araki. Frequency responses for sampled-data systems - Theirequivalence <strong>and</strong> relationships. L<strong>in</strong>ear Algebra <strong>and</strong> its Applications, 206:1319–1339,1994.


356 BibliographyY. Yamamoto <strong>and</strong> P. Khargonekar. Frequency response of sampled-data systems.IEEE Trans. on Automatic <strong>Control</strong>, 41(2):166–176, 1996.L. Zadeh. Frequency analysis of variable networks. Proceed<strong>in</strong>gs of the I.R.E., 38:291–299, 1950.G. Zames. On the <strong>in</strong>put-output stability of time-vary<strong>in</strong>g nonl<strong>in</strong>ear feedback systems.Part I: Conditions derived us<strong>in</strong>g concepts of loop ga<strong>in</strong>, conicity <strong>and</strong> positivity.IEEE Trans. on Automatic <strong>Control</strong>, 11:228–238, 1966a.G. Zames. On the <strong>in</strong>put-output stability of time-vary<strong>in</strong>g nonl<strong>in</strong>ear feedback systems.Part II: Conditions <strong>in</strong>volv<strong>in</strong>g circles <strong>in</strong> the frequency plane <strong>and</strong> sector nonl<strong>in</strong>earities.IEEE Trans. on Automatic <strong>Control</strong>, 11:465–476, 1966b.G. Zames <strong>and</strong> D. Bensoussan. Multivariable feedback, sensitivity, <strong>and</strong> decentralizedcontrol. IEEE Trans. on Automatic <strong>Control</strong>, 28(11):1030–1035, November1983.E.H. Zarantonello. The closure of the numerical range conta<strong>in</strong>s the spectrum. PacificJournal of Mathematics, 22(3):575–595, 1967.J. Zhang <strong>and</strong> C. Zhang. Robustness analysis of control systems us<strong>in</strong>g generalizedsample hold functions. In Proceed<strong>in</strong>gs of the 33th CDC, Lake Buena Vista, FL,December 1994.Z. Zhang <strong>and</strong> J.S. Freudenberg. Discrete-time loop transfer recovery for systemswith nonm<strong>in</strong>imum phase zeros <strong>and</strong> time delays. Automatica, 29(2):351–363,March 1993.


Indexactuator fault, 206additive perturbation model, 35, 260alias component matrix, (see also modulationrepresentation) 132all bounded error estimators, 192, 207all diagonaliz<strong>in</strong>g postcompensators,203all stabiliz<strong>in</strong>g controllers, 112, 146all-pass transfer function, 65, 76, 88nanalytic function, 278–280derivatives, 304–305Anderson, B.D.O., 103, 166, 177, 229,237, 241angle between subspaces, 88anti-alias<strong>in</strong>g filter, 136nAraki, M., 141, 158, 159arc, 282area formula, 48Astolfi, A., 263Åström, K.J., 46, 141Athani, V., 117Banach space, 247, 256extended, 250, 257b<strong>and</strong>width, 15, 56, 60, 70, 72, 82, 109Barnwell, T.J., 132basic perturbation model, 248, 260Bernste<strong>in</strong>, D.S., 166BIBO robust stabilization<strong>and</strong> time-vary<strong>in</strong>g control, 133biproper transfer function, 27Björck, Å., 88Blaschke producthalf-plane, 65, 98, 150, 181, 200,217, 220, 224, 225, 234, 236unit disk, 76, 78, 114, 124Bode attenuation <strong>in</strong>tegral theorem, 49Bode ga<strong>in</strong>-phase relationship, 19, 40–45Bode <strong>in</strong>tegral constra<strong>in</strong>ts, 19MIMO control, 87–98, 326–332SISO control, 47–62, 79–83SISO filter<strong>in</strong>g, 183Bode plots, 30Bode, H.W., 19, 40n, 46, 47, 51, 84bounded error estimator, 172–176, 179,198, 209, 213, 227, 231nonl<strong>in</strong>ear, 268bounded variation, 136bounds on <strong>in</strong>f<strong>in</strong>ity norm<strong>in</strong>verted pendulum, 74MIMO control, 73, 253MIMO filter<strong>in</strong>g, 201periodic control, 128


358 Indexsampled-data control, 152, 154SISO control, 63, 69, 70, 72, 79SISO filter<strong>in</strong>g, 185–195Boyd, S., 87, 92, 116branch cut, 54, 184, 313branch po<strong>in</strong>t, 313–315Braslavsky, J.H., 136, 137, 158, 159,208Bromwich Integral, 333Brown, J.W., 324Bucy, R.S., 210Burnside, D., 132cancelations, (see also nonl<strong>in</strong>ear cancelations)67, 75, 105–107,112, 129, 153, 173, 249canonical zero direction, 28, 100, 129,201, 202Cauchy <strong>in</strong>tegral formula, 296–297Cauchy <strong>in</strong>tegral theorem, 40, 49, 55,291–295Cauchy-Riemann equations, 277, 324channelb<strong>and</strong>-limited, 4capacity, 4cont<strong>in</strong>uous, 4ncharacteristic polynomial, 8Chen, J., 88, 95, 96, 116, 117, 140, 208,325Chen, M.J., 337Chen, T., 159Chirarattananon, S., 237class R function, 65nclosed-loop stability, see <strong>in</strong>ternal stabilityclosure of a set, 247cod<strong>in</strong>g, 4communications theory, 4complementaritycontrol, 8, 53, 64filter<strong>in</strong>g, 167filter<strong>in</strong>g vs. control, 169–172nonl<strong>in</strong>ear control, 254nonl<strong>in</strong>ear filter<strong>in</strong>g, 265prediction, 212smooth<strong>in</strong>g, 231complementary sensitivity, see sensitivityfunctionscomplex differentiation, 276–278complex <strong>in</strong>tegration, 36, 281–288complex variable theory, 275–324analytic functions, 278–280Cauchy <strong>in</strong>tegral formula, 296–297Cauchy <strong>in</strong>tegral theorem, 291–295Cauchy-Riemann equations, 277curves, 281–282differentiation, 276–278doma<strong>in</strong>s, 275–276entire functions, 322–324Green’s theorem, 289–291harmonic functions, 280–281<strong>in</strong>tegral theorems, 289–297<strong>in</strong>tegration, 281–288maximum modulus pr<strong>in</strong>ciple, 321–322Poisson <strong>in</strong>tegral formula, 298–303power series, 304–310regions, 275–276residues, 315–319, 334s<strong>in</strong>gularities, 310–315component fault, 208composition of operators, see nonl<strong>in</strong>earoperatorcont<strong>in</strong>uous-time system, 26contour, 282controlgeneral concepts, 25–46MIMO limitations, 85–117nonl<strong>in</strong>ear, 245nonl<strong>in</strong>ear limitations, 253–263periodic limitations, 119–133sampled-data limitations, 135–158SISO limitations, 47–84control system, 6, 31, 51, 75, 85, 122,249, 254convolution, 229complex, 342convolution equation, 26Conway, J.B., 324coprime factorization, 30, 86, 123, 146,207coprimeness, 30Coron, J-M., 245


Index 359cost of decoupl<strong>in</strong>g, 103–105, 108, 127,202–205covariance, 4Cramér-Rao <strong>in</strong>equality, 3Cramér-Rao lower bound, 4crossover frequency, 60Curchill, R.V., 324curve, 281–282D’Alembert, J. le R., 324Dahleh, M.A., 133de Souza, C.E., 166, 177decibel, 30ndecoupl<strong>in</strong>g, 103–105, (see also cost ofdecoupl<strong>in</strong>g) 108, 202–205,207<strong>and</strong> periodic systems, 127–130design limitationsbrief history, 19–20communications theory, 4estimation theory, 3<strong>in</strong>troduction, 3–21MIMO control, 85–117MIMO filter<strong>in</strong>g, 197–208nonl<strong>in</strong>ear control, 253–260nonl<strong>in</strong>ear filter<strong>in</strong>g, 265–271periodic control, 119–133sampled-data control, 135–158SISO control, 47–84SISO filter<strong>in</strong>g, 179–196SISO prediction, 209–226SISO smooth<strong>in</strong>g, 227–241time doma<strong>in</strong>, 9–18design trade-offsfault detection <strong>and</strong> isolation, 208MIMO control, 96–98, 102–103,111MIMO filter<strong>in</strong>g, 202–208periodic control, 126–131sampled-data control, 151–156SISO control, 59–62, 67–73, 78–79, 82–83SISO filter<strong>in</strong>g, 184–189Desoer, C.A., 87, 92, 103, 116, 246, 252,337detection of faults, 205De Branges, L., 217, 235diagonal dom<strong>in</strong>ance, 103diagonalization, see decoupl<strong>in</strong>gdigital controller, 82, 148discrete loop transfer recovery, 156discrete pole-zero cancelations, 153,154discrete sensitivities, 139discrete-time systemvs. sampled-data system, 135MIMO limitations, 113–115, 119SISO limitations, 74–83transfer function, 27discretized plant, 82, 139disturbance rejection, optimal, 133divisive perturbation model, 35, 144,260Doan, H.B., 229, 241doma<strong>in</strong>, 275–276multiply connected, 294of an operator, see nonl<strong>in</strong>ear operatordom<strong>in</strong>ant time constant, 220n, 237,239, 240, 346double-sided Z-transform, 120, 132Doyle, J.C., 46, 84, 88, 252dynamic systems, 6Emami-Nae<strong>in</strong>i, A., 45entire function, 138, 213, 231, 322–324zeros, 216, 234error covariance, 4essential s<strong>in</strong>gularity, 65, 216, 311estimation theory, 3estimator, 164bounded error, 172–176, 179, 198,209, 213, 227, 231identity, 176nonl<strong>in</strong>ear bounded error, 268unbiased, 164, 175Euclidean norm, 253Euler, L., 324expectation, 4extended Banach space, 250, 257factorizationcoprime, 30, 86, 123, 146, 207<strong>in</strong>ner-outer, 88, 153, 326faultactuator <strong>and</strong> sensor, 206component, 208fault detection <strong>and</strong> isolation, 205


360 Indexmodel-based, 205sensitivity functions, 205under disturbances, 207feedback amplifier, 19, 47feedback loop, 6, 31, 51, 75, 85, 122,249, 254Lipschitz stability, 248L 2 stability, 249robust Lipschitz stability, 260–262Feuer, A., 121, 132, 158, 159filter, see estimatorfilter<strong>in</strong>g, (see also nonl<strong>in</strong>ear filter<strong>in</strong>g)163H ∞ optimal, 166, 192vs. prediction, 184, 220–225vs. smooth<strong>in</strong>g, 183, 236–240Bode <strong>in</strong>tegral constra<strong>in</strong>ts, 183complementary constra<strong>in</strong>t, 167general concepts, 163–177<strong>in</strong>terpolation constra<strong>in</strong>ts, 179, 198MIMO limitations, 197–208m<strong>in</strong>imum variance, 166, 190Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, 182,199sensitivity functions, 165–172, 180,198SISO limitations, 179–196filter<strong>in</strong>g system, 163, 265, 268f<strong>in</strong>al value theorem, 58f<strong>in</strong>ite-jump discont<strong>in</strong>uity, 137fixed-lag smooth<strong>in</strong>g, see smooth<strong>in</strong>gfluid dynamics, 7Fourier series, 120Fourier transform, 132, 248Francis, B.A., 19, 20, 45, 63, 73, 84,133, 140, 159Frankl<strong>in</strong>, G.F., 45, 132, 141frequency doma<strong>in</strong> rais<strong>in</strong>g, see modulationrepresentationfrequency response, 29periodic systems, 121, 127sampled-data systems, 135, 141–143frequency response function, 136Freudenberg, J.S., 19, 20, 46, 54, 84,92, 116, 137, 138, 140, 158,159Frost, P., 226, 227, 229, 241Fu, M., 166, 177functionanalytic, 278–280complex-valued, 275–324entire, 138, 213, 322–324harmonic, 77, 280–281, 327meromorphic, 37of bounded variation, 136of uniform bounded variation,137subharmonic, 88, 92with branch po<strong>in</strong>ts, 319–321fundamental response, 136fundamental sensitivity functions, 142Gündes, A., 103Gómez, G.I., 98, 103, 105n, 116, 117,122n, 199, 202, 208generalized sampled-data hold, 138–140robustness, 156–158geometric multiplicity, 28, 89Georgiou, T.T., 133, 140, 252, 263Golub, G.H., 88Goodw<strong>in</strong>, G.C., 45, 84, 98, 103, 105n,116, 117, 121, 122n, 132, 141,158, 159, 166, 177, 196, 199,202, 208, 263Gra<strong>in</strong>ger, R.W., 208Great Picard theorem, 216Green’s theorem, 88, 91, 289–291, 327Guzzella, L., 263Haddad, W.M., 166Hagiwara, T., 141, 159Hamiltonian matrix, 238Hara, S., 84, 117harmonic conjugate, 281harmonic function, 77, 280–281, 327harmonic response functions, 142harmonics, 136Harvey, C., 88Hermite form, 203hidden modes, 32, 52n, 65, 140Hilbert space, 248Hill, D.J., 246H ∞, (see also <strong>in</strong>f<strong>in</strong>ity norm) 19mixed sensitivity problem, 192,194


Index 361Ho, B., 140hold, 82, 138–139<strong>in</strong>ner-outer factorization, 153zeros, 140, 149, 152Horowitz formula, 52Horowitz, I.M., 19, 48, 84H 2 , 247, 249H 2 , 254nHung, N., 103hybrid system, see sampled-data systemidentity observer, 176improvement of least-squares optimalsmoother, 237–240impulse, 137impulse modulation formula, 139nimpulse response, 26<strong>in</strong>cremental ga<strong>in</strong>, see Lipschitz constant<strong>in</strong>f<strong>in</strong>ity norm, 30, 166, 246, 254bounds, see bounds on <strong>in</strong>f<strong>in</strong>itynormof nonl<strong>in</strong>ear systems, 246<strong>in</strong>formation source, 4<strong>in</strong>itial value theorem, 56<strong>in</strong>ner product, 248<strong>in</strong>novations process, 168, 228, 237, 346<strong>in</strong>put zero direction, 28, 201<strong>in</strong>put-output operator, (see also nonl<strong>in</strong>earoperator) 245<strong>in</strong>put-output representation, 26<strong>in</strong>tegral constra<strong>in</strong>tsBode, see Bode <strong>in</strong>tegral constra<strong>in</strong>tsPoisson, see Poisson <strong>in</strong>tegral constra<strong>in</strong>tstime doma<strong>in</strong>, 9<strong>in</strong>tegral theorems, 289–297Cauchy <strong>in</strong>tegral formula, 296–297Cauchy <strong>in</strong>tegral theorem, 291–295Green’s theorem, 289–291<strong>in</strong>tegrators<strong>and</strong> step response, 9<strong>in</strong>teractor matrix, 105n, 111<strong>in</strong>ternal model control, 12<strong>in</strong>ternal stability, 32sampled-data, 140–141<strong>in</strong>terpolation constra<strong>in</strong>tsMIMO control, 85MIMO filter<strong>in</strong>g, 198nonl<strong>in</strong>ear control, 256periodic control, 123sampled-data control, 145–149SISO control, 52, 65, 75SISO filter<strong>in</strong>g, 179SISO prediction, 214SISO smooth<strong>in</strong>g, 232<strong>in</strong>tersample behavior, 83<strong>in</strong>verted pendulum, 5, 16, 73, 193<strong>in</strong>vertibility, left, 202<strong>in</strong>vertibility, right, 165<strong>in</strong>vertible operator, see nonl<strong>in</strong>ear operatorisolation of faults, 205Ito, Y., 141, 159Jordan’s lemma, 288, 336Kabamba, P.T., 138Kailath, T., 45, 166, 210, 226, 227, 229,241Kalman filter, 189, 210, 229, 237, 345predictor based on, 221, 223, 224Kalman, R., 140Kalman, R.E., 210Kaplan, W., 324Khargonekar, P.P., 132, 159, 166K<strong>in</strong>naert, M., 202Kwakernaak, H., 46, 84L’Hospital’s rule, 58, 81, 330Laplace transform, 9, 26abscissa of convergence, 342f<strong>in</strong>al value theorem, 58<strong>in</strong>itial value theorem, 56<strong>in</strong>version, 333modified, 211, 341Laplacian, 88n, 96, 281Laurent series, 49, 308–310LCF, see left coprime factorizationleast-squares optimalfilter<strong>in</strong>g, 189, 229, 237, 345prediction, 210smooth<strong>in</strong>g, 228, 229, 237–240, 345left coprime factorization, 30left <strong>in</strong>vertible, 202


362 IndexLeibniz rule, 330Lev<strong>in</strong>son, N., 324, 325limitations, see design limitationsl<strong>in</strong>ear time-<strong>in</strong>variant system, 26–31,122n, 247, 249, 253vs. nonl<strong>in</strong>ear system, 271vs. periodic system, 128, 130–133as design goal of periodic system,127–130<strong>in</strong>f<strong>in</strong>ity norm, 246Lipschitz constant, 247, 256sub-multiplicative property, 247Lipschitz norm, see Lipschitz constantLipschitz operator, see nonl<strong>in</strong>ear operatorLipschitz stability, 248robust, 260–262logarithmic series, 55, 314, 330loop transfer recovery, 156Looze, D.P., 19, 20, 46, 54, 84, 92, 116LQG-LQR, 18, 74LTI systems, see l<strong>in</strong>ear time-<strong>in</strong>variantsystemsL 2 , 246–248, 254L 2 -ga<strong>in</strong>, 246, 257MacFarlane, A.G.J., 46Maciejowski, J.M., 45, 46Markushevich, A.I., 324Mart<strong>in</strong> Jr., R., 248, 252Massoumnia, M.A., 206matrix <strong>in</strong>version lemma, 90, 346maximum modulus pr<strong>in</strong>ciple, 63, 321–322Mayne, D.Q., 166, 177measurement noise, 163Meerkov, L.S., 133Me<strong>in</strong>sma, G., 137, 159meromorphic function, 37Meyer, D.G., 132Middleton, R.H., 20, 21, 45, 54, 84,137, 138, 140, 158, 159m<strong>in</strong>imal realization, 27nm<strong>in</strong>imum phase system, 27, 53ga<strong>in</strong>-phase relationship, 19, 40–45m<strong>in</strong>imum variance filter<strong>in</strong>g, 166M<strong>in</strong>to, K.D., 132mixed sensitivity problem, 192, 194model-based fault detection <strong>and</strong> isolation,205modified Z-transform, 83nmodulated complementary sensitivity,123modulated sensitivity, 123modulated transfer matrix, 121, 123modulation representation, 120–122,132Moore, J.B., 166, 177Moylan, P.J., 246multiplication operator, 343multiplicative perturbation model, 35,144, 260multirate sampl<strong>in</strong>g, 119Nagpal, K.M., 166Narendra, K., 140Nett, C.N., 117, 132Nevanl<strong>in</strong>na-Pick theory, 20, 73nonl<strong>in</strong>ear bounded error estimator,268nonl<strong>in</strong>ear cancelations, 249–251l<strong>in</strong>ear motivation, 249pole-zero, 251, 258zero-pole, 251, 269, 270nonl<strong>in</strong>ear control, 245complementarity constra<strong>in</strong>t, 254<strong>in</strong>terpolation constra<strong>in</strong>ts, 256sensitivity functions, 255nonl<strong>in</strong>ear filter<strong>in</strong>gcomplementarity constra<strong>in</strong>t, 265sensitivity functions, 266nonl<strong>in</strong>ear operatoradded doma<strong>in</strong>, 250, 269addition, 246composition, 246, 270composition doma<strong>in</strong>, 250doma<strong>in</strong>, 246, 250extended doma<strong>in</strong>, 250, 269general concepts, 245–252I/O, 248<strong>in</strong>vertible, 247Lipschitz, 247, 257nonm<strong>in</strong>imum phase, 247, 250, 256,270on Banach spaces, 247on extended spaces, 250on Hilbert spaces, 248


Index 363on l<strong>in</strong>ear spaces, 246range, 246, 250right <strong>in</strong>vertible, 266stable, 247, 250unbiased, 248, 266unstable, 247, 250, 256, 258, 269,270nonl<strong>in</strong>ear systemvs. LTI system, 271nonm<strong>in</strong>imum phase nonl<strong>in</strong>ear operator,247, 250<strong>and</strong> nonl<strong>in</strong>ear filter<strong>in</strong>g sensitivity,270<strong>and</strong> nonl<strong>in</strong>ear sensitivity, 256<strong>and</strong> water-bed effect, 258nonm<strong>in</strong>imum phase zeros, 28, 122additional phase lag, 67n, 187<strong>and</strong> actuator fault detection, 208<strong>and</strong> b<strong>and</strong>width, 72<strong>and</strong> complementary sensitivity,57, 80<strong>and</strong> filter<strong>in</strong>g complementary sensitivity,184<strong>and</strong> filter<strong>in</strong>g sensitivity, 182, 184,187, 200<strong>and</strong> prediction sensitivity, 217<strong>and</strong> sensitivity, 65, 77, 99, 115,125, 150<strong>and</strong> smooth<strong>in</strong>g sensitivity, 235<strong>and</strong> step response, 11<strong>and</strong> undershoot, 15<strong>and</strong> water-bed effect, 63, 69, 78periodic systems, 124, 125, 131sampled-data systems, 148, 149,151, 152nonpathological sampl<strong>in</strong>g, 140nons<strong>in</strong>gular transfer matrix, 27null space, 100n, 104, 128, 129Nyquist frequency, 138, 156Nyquist plot, 38, 59, 68Nyquist range, 138Nyquist stability criterion, 37–40O’Reilly, J., 177O’Young, S.D., 20, 73observer, see estimatoropen-loop system, 31operator, see nonl<strong>in</strong>ear operatoroperator approach, (see also nonl<strong>in</strong>earoperator) 245output zero direction, 28overshoot, 14Padé approximation, 59n, 182nParks, T.W., 132Peng, Y., 202performance, 33specifications, 13, 71specifications for filter<strong>in</strong>g, 185periodic reflection, 145, 150periodic systemvs. LTI system, 128, 130–133design limitations, 119–133sensitivity functions, 122perturbation modeladditive, 35, 260basic, 248, 260divisive, 35, 260multiplicative, 35, 260phase lag, 67, 187phase marg<strong>in</strong>, 5Picard theorem, 216plant uncerta<strong>in</strong>ty, (see also perturbationmodel) 35Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, 19<strong>and</strong> s<strong>in</strong>gular values, 96fault detection <strong>and</strong> isolation, 206MIMO control, 98–115MIMO filter<strong>in</strong>g, 199–202periodic control, 124–131sampled-data control, 150–156SISO control, 64–73, 75–79SISO filter<strong>in</strong>g, 181–183SISO prediction, 217–219SISO smooth<strong>in</strong>g, 234–236Poisson <strong>in</strong>tegral formulahalf plane, 64, 65, 181, 200, 298–302unit disk, 75, 79, 80, 302–303Poisson kernelhalf-plane, 68unit disk, 77Poisson summation formula, 139npoles, 27, 311directions, 94, 112geometric multiplicity, 89nimag<strong>in</strong>ary, 18, 66, 76, 96


364 Indexmultivariable, 27, 122, 253on unit circle, 77unstable, 11, 14, 28, 54, 61, 66,70, 77, 79, 92, 94, 154, 182,187, 201, 208, 218, 235unstable cancelations, see cancelationsPoolla, K., 132Postlethwaite, I., 46Powell, J.D., 45, 132, 141power series, 50, 304–310Laurent, 308–310Taylor, 306–308Praly, L., 245predictionvs. filter<strong>in</strong>g, 184, 220–225complementarity constra<strong>in</strong>t, 212effect of horizon on sensitivity,219–225general concepts, 209–212horizon, 209<strong>in</strong>terpolation constra<strong>in</strong>ts, 214least-squares optimal, 210Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, 217sensitivity functions, 212SISO limitations, 209–226pr<strong>in</strong>cipal angle, 88, 95pr<strong>in</strong>cipal branch, 313pr<strong>in</strong>cipal ga<strong>in</strong>, 29, 30pr<strong>in</strong>cipal part, 310pr<strong>in</strong>ciple of the argument, 37probability theory, 3proper transfer function, 27push-pop effect, see water-bed effectrais<strong>in</strong>gfrequency doma<strong>in</strong>, see modulationrepresentationtime doma<strong>in</strong>, 120–123, 129, 132ramp response, 58Ramstad, T., 132r<strong>and</strong>om variable, 3range of operator, see nonl<strong>in</strong>ear operatorRavi, R., 132RCF, see right coprime factorizationRedheffer, R.M., 324, 325region, 275–276relative degree, 27, 56, 78, 80, 125removable s<strong>in</strong>gularity, 311residual, 206residue, 49, 315–319, 334at <strong>in</strong>f<strong>in</strong>ity, 49, 53, 316–319Reynolds number, 8Riccati equation, 190, 229, 237, 238,345Riesz-Fischer Theorem, 335right coprime factorization, 30right <strong>in</strong>vertible, 165nonl<strong>in</strong>ear operator, 266rise time, 13, 103robust BIBO stabilization<strong>and</strong> time-vary<strong>in</strong>g control, 133robust stability, see stability robustnessrobustness, 35, 72, 110nonl<strong>in</strong>ear systems, 260–262periodic systems, 133sampled-data systems, 143–145,336Rosenbrock, H., 103Rosenwasser, Y.N., 159Runolfsson, T., 133Salgado, M., 141, 158sampled-data system, 75, 82design limitations, 135–158frequency response, 135, 141–143<strong>in</strong>ternal stability, 140–141robustness, 143–145, 336sensitivity functions, 142steady-state response, 141, 142sampler, 138sampl<strong>in</strong>g, 122, 138nonpathological, 140sampl<strong>in</strong>g frequency, 138sampl<strong>in</strong>g period, 138S<strong>and</strong>berg, I.W., 246, 248semi-norm, 247sensitivity functions<strong>and</strong> disturbances <strong>in</strong> filter<strong>in</strong>g, 167as transfer functions <strong>in</strong> filter<strong>in</strong>g,168control, 7, 32, 51, 75, 85, 170fault detection <strong>and</strong> isolation, 205filter<strong>in</strong>g, 165–172, 180, 198<strong>in</strong>tegral constra<strong>in</strong>ts, see Bode <strong>and</strong>Poisson <strong>in</strong>tegral constra<strong>in</strong>ts


Index 365nonl<strong>in</strong>ear control, 255nonl<strong>in</strong>ear filter<strong>in</strong>g, 266performance considerations, 33–34periodic control, 122prediction, 212robustness considerations, 35–36,143–145, 260–262sampled-data control, 142smooth<strong>in</strong>g, 230, 345sensitivity reduction, 61, 69, 72, 128,183, 237nonl<strong>in</strong>ear, 257, 258sensor fault, 206separation pr<strong>in</strong>ciple of control <strong>and</strong>estimation, 176sequenceweakly convergent, 248Seron, M.M., 166, 177, 196, 208, 226,241, 252, 263, 271setclosure, 247weak closure, 248settl<strong>in</strong>g time, 13Shaked, U., 166, 177Shamma, J.S., 133, 248, 252, 258, 263Shannon’s theorem, 4Shannon-Hartley theorem, 4Shenoy, R.G., 132Shim, J., 166, 177signal process<strong>in</strong>g, 132signal-to-noise ratio, 4s<strong>in</strong>gular values, 29, 88, 94<strong>and</strong> Bode <strong>in</strong>tegrals, 87–98, 326–332<strong>and</strong> decoupl<strong>in</strong>g, 110<strong>and</strong> Poisson <strong>in</strong>tegrals, 96s<strong>in</strong>gularities, 310–315slope<strong>in</strong> Bode plot, 43<strong>in</strong> step response, 56small ga<strong>in</strong> theorem, 248Smith, M.C., 252, 263Smith, M.J.T., 132smooth<strong>in</strong>gvs. filter<strong>in</strong>g, 183, 236–240complementarity constra<strong>in</strong>t, 231effect of lag on sensitivity, 236general concepts, 227–230<strong>in</strong>terpolation constra<strong>in</strong>ts, 232lag, 227least-squares optimal, 228, 229,237–240, 345Poisson <strong>in</strong>tegral constra<strong>in</strong>ts, 235sensitivity functions, 230, 345SISO limitations, 227–241Sontag, E.D., 45stability robustness, 35nonl<strong>in</strong>ear, 248, 260–262stable nonl<strong>in</strong>ear operator, 247, 250state-variable representation, 26steady-state response, 29sampled-data systems, 141, 142Ste<strong>in</strong>, G., 46step response<strong>and</strong> sensitivity, 56<strong>in</strong>tegral constra<strong>in</strong>ts, 9specifications, 13subharmonic function, 88, 92Sule, V., 117Sung, H.-K., 84, 117Tannenbaum, A.R., 84, 132Taylor series, 306–308Teel, A., 245time constant, dom<strong>in</strong>ant, 220n, 237,239, 240, 346time delay, 20, 54, 59, 65, 78, 154, 157,182time doma<strong>in</strong> constra<strong>in</strong>ts, 9–18time doma<strong>in</strong> rais<strong>in</strong>g, 120–123, 129, 132time-vary<strong>in</strong>g systemfrequency response, (see also periodic<strong>and</strong> sampled-data system)132total variation, 136trade-offs, see design trade-offstransfer function, 26all-pass, 65, 76, 88nbiproper, 27coprime factorization, 30<strong>in</strong>f<strong>in</strong>ity norm, 30<strong>in</strong>ner-outer factorization, 88, 326left <strong>in</strong>vertible, 202m<strong>in</strong>imum <strong>and</strong> nonm<strong>in</strong>imum phase,27modulated, 121, 123nons<strong>in</strong>gular, 27


366 Indexpr<strong>in</strong>cipal ga<strong>in</strong>s <strong>and</strong> directions,29proper, 27relative degree, 27right <strong>in</strong>vertible, 165s<strong>in</strong>gular values, 29stable, 27zeros <strong>and</strong> poles, 27transfer matrix, see transfer functiontransient response, see step responsetransmission zero, 27triangularization, 103type-1 system, 58unbiased estimator, 3, 164, 175unbounded signals, 250undershoot, 14, 103unfolded image, 145, 150, 153uniform bounded variation, 137uniform convergence theorem, 80, 81,92, 328unstable cancelations, see cancelationsunstable nonl<strong>in</strong>ear operator, 247, 250,258, 269<strong>and</strong> nonl<strong>in</strong>ear complementary sensitivity,256<strong>and</strong> nonl<strong>in</strong>ear filter<strong>in</strong>g complementaritysensitivity, 270<strong>and</strong> water-bed effect, 259unstable poles, 28<strong>and</strong> b<strong>and</strong>width, 61, 70<strong>and</strong> complementary sensitivity,66, 77, 154<strong>and</strong> filter<strong>in</strong>g complementarity sensitivity,182, 201<strong>and</strong> filter<strong>in</strong>g complementary sensitivity,188<strong>and</strong> filter<strong>in</strong>g sensitivity, 187<strong>and</strong> overshoot, 14<strong>and</strong> prediction complementaritysensitivity, 218<strong>and</strong> sensitivity, 54, 79, 92, 94<strong>and</strong> sensor fault detection, 208<strong>and</strong> smooth<strong>in</strong>g complementaritysensitivity, 235<strong>and</strong> step response, 11sampled-data systems, 148, 152–153unstructured plant uncerta<strong>in</strong>ty, 35Van Loan, C.E., 88velocity constant, 58, 81Verghese, G.C., 206Vetterli, M., 132Vidyasagar, M., 131, 246, 252Wall, Jr., J., 88water-bed effectfilter<strong>in</strong>g, 193l<strong>in</strong>ear control, 19, 62–64, 69, 78,105nonl<strong>in</strong>ear control, 258–260sampled-data systems, 151weak closure of a set, 248weakly convergent sequence, 248weighted length, 69, 78, 128, 185, 201weight<strong>in</strong>g functionhalf-plane, 67, 68periodic systems, 128unit disk, 77Weller, S.R., 103, 105n, 208white noise, 190, 228, 229, 345Widder, D.V., 324Willems, J.C., 238, 246Willsky, A.S., 206Wittenmark, B., 141Workman, M.L., 132, 141X-29 aircraft, 5, 44Xie, J.Q., 140, 158Yaesh, I., 166Yamamoto, Y., 159Youla parametrization, see all stabiliz<strong>in</strong>gcontrollersZadeh, L., 132Zames, G., 19, 63, 84, 246, 248Zarantonello, E.H., 247zero shift<strong>in</strong>g, 83, 156–158zero-order hold, 138, 149zeros, 27directions, 28, 103, 112, 201, 202geometric multiplicities, 28, 100nimag<strong>in</strong>ary, 18, 66, 76, 96multivariable, 27, 122, 198, 253nonm<strong>in</strong>imum phase, 11, 15, 28,57, 63, 65, 72, 77, 80, 99, 115,122, 124, 125, 131, 150, 182,


Index 367184, 187, 200, 208, 217, 235,247of entire function, 216, 234on unit circle, 77unstable cancelations, see cancelationsZhang, C., 158Zhang, J., 158Z-transform, 75double-sided, 120, 132modified, 83n

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!