Intel XENIX 286 Programmers Guide (86) - Tenox.tc

09.06.2013 Views
XENIX Programming lex: Lexical Analyzer Generator For example I *' might seem a good way of recognizing a string in single quotes. But it is an invitation for the program to read far ahead, looking for a distant single quote. Presented with the input 'first' quoted string here, 'second' here the above expression matches 'fi rst' quoted string here, 'second' which is probably not what was wanted. A better rule is of the form T''\n]*' which, on the above input, stops after 'first'. The consequences of errors like this are mitigated by the fact that the dot (.) operator does not match a newline. Therefore, no more than one line is ever matched by such expressions. Don't try to defeat this with expressions like [.\n] + or their equivalents; the lex generated program will try to read the entire input file, causing internal buffer overflows. Note that lex is normally partitioning the input stream, not searching for all possible matches of each expression. This means that each character is accounted for only once. For example, suppose you want to count occurrences of both she and he in an input text. Some lex rules to do this might be she s + +; he h + +; \n I where the last two rules ignore everything besides he and she. Remember that the period (.) does not include the newline. Since she includes he, lex will normally not recognize the instances of he included in she, since once it has passed a she those characters are gone. Sometimes the user would like to override this choice. The action REJECT means go do the next alternative. It causes whatever rule was second choice after the current rule to be executed. The position of the input pointer is adjusted accordingly. Suppose you really want to count the included instances of he: she { s + +; REJECT; } he { h + +; REJ ECT; } \n I 9-13

lex: Lexical Analyzer Generator XENIX Programming These rules are one way of changing the previous example to do just that. After counting each expression, it is rejected; whenever appropriate, the other expresion will then be counted. In this example, of course, you could note that she includes he, but not vice versa, and omit the REJECT action on he; in other cases, however, it would not be possible to tell which input characters were in both classes. Consider the two rules a[bc] + a[cd] + { . .. ; REJECT; } { ... ; REJECT; } If the input is ab, only the first rule matches, and on ad only the second matches. The input string accb matches the first rule for four characters and then the second rule for three characters. In contrast, the input aced agrees with the second rule for four characters and then the first rule for three. In general, REJECT is useful whenever the purpose of lex is not to partition the input stream but to detect all examples of some items in the input, and the instances of these items may overlap or include each other. Suppose a digram table of the input is desired; normally the digrams overlap; e.g., the word the is considered to contain both th and he. Assuming a two-dimensional array named digram to be incremented, the appropriate source is %% [a-z][a-z] { digram [ yytext [ 0] ] [ yytext [ 1 ] ] + + ; REJ ECT; } \n where the REJECT is necessary to pick up a letter pair beginning at every character, rather than at every other character. Remember that REJECT does not rescan the input. Instead it remembers the results of the previous scan. This means that if a rule with trailing context is found and REJECT executed, you must not have used unput to change the characters forthcoming from the input stream. This is the only restriction on the ability to manipulate the not-yetprocessed input. Specifying Left Context Sensitivity Sometimes you may want to have several sets of lexical rules to be applied at different times in the input. For example, a compiler preprocessor might distinguish preprocessor statements and analyze them differently from ordinary statements. This requires sensitivity to prior context, and there are several ways of handling such proble ms. The caret (") operator, for example, is a prior context operator, recognizing immediately preceding left context just as the dollar sign ($) recognizes immediately following right context. Adjacent left context could be extended, to produce a facility similar to that for adjacent right context, but it is unlikely to be as useful, since often the relevant left context appeared some time earlier, such as at the beginning of a line. 9-14

Page 1 and 2: 0 • • •

Page 3 and 4: The information in this document is

Page 5 and 6: Table of Contents CONTENTS Compiler

Page 7 and 8: Table of Contents CONTENTS Using Ot

Page 9 and 10: Table of Contents CONTENTS Assemble

Page 11 and 12: Table of Contents CONTENTS CHAPTER

Page 13 and 14: Table of Contents TABLES TABLE TITL

Page 15 and 16: Introduction XENIX Programming 10.

Page 17 and 18: cc: C Compiler XENIX Programming Cr

Page 19 and 20: cc: C Compiler XENIX Programming Th


Page 23 and 24: cc: C Compiler XENIX Programming Cr


Page 27 and 28: cc: C Compiler XENIX Programming Co

Page 29 and 30: cc: C Compiler XENIX Programming Sa

Page 31 and 32: cc: C Compiler XENIX Programming Us

Page 33 and 34: cc: C Compiler XENIX Programming Wh

Page 35 and 36: cc: C Compiler XENIX Programming Co

Page 37 and 38: cc: C Compiler XENIX Programming d

Page 39 and 40: cc: C Compiler XENIX Programming Se

Page 41 and 42: lint: C Program Checker XENIX Progr






Page 53 and 54: make: Program Maintainer XENIX Prog






Page 65 and 66: make: Program Maintainer print: $(F

Page 67 and 68: SCCS: Source Code Control System XE















Page 98 and 99: CHAPTER 6 adb: PROGRAM DEBUGGER adb

Page 100 and 101: XENIX Programming adb: Program Debu








Page 116 and 117: XENIX Programming int fcnt,gcnt,hcn






Page 128: XENIX Programming adb: Program Debu

Page 131 and 132: as: A sse m bier XENIX Programming

Page 133 and 134: as: Assembler XENIX Programming The

Page 135 and 136: as: Assembler XENIX Programming Key

Page 137 and 138: as: Assembler The combination rules

Page 139 and 140: as: Assembler XENIX Programming Ins

Page 141 and 142: as: Assembler Initial Value Directi

Page 143 and 144: as: Assembler XENIX Programming int

Page 145 and 146: as: A sse m bier sub subb test test

Page 147 and 148: as: Assembler XENIX Programming lnt

Page 149 and 150: as: Assembler XENIX Programming lnt

Page 151 and 152: as: Assembler XENIX Programming Imm

Page 153 and 154: as: A sse m bier XENIX Programming

Page 156 and 157: CHAPTER 8 csh : C SHEll The C shell

Page 158 and 159: XENIX Programming csh: C Shell Some

Page 160 and 161: XENIX Programm ing *w 32 * q % !c -

Page 162 and 163: XENIX Programming csh: C Shell the

Page 164 and 165: XENIX Programming csh: C Shell Usin

Page 166 and 167: XENIX Programming csh: C Shell Usin

Page 168 and 169: XENIX Programming csh: C Shell The

Page 170 and 171: XENIX Programming csh: C Shell Note

Page 172 and 173: XENIX Programming switch ( string c

Page 174 and 175: XENIX Programming csh: C Shell Star

Page 176 and 177: XENIX Programming csh: C Shell Spec

Page 178 and 179: CHAPTER 9 lex : LEXICAL ANA LYZER G

Page 180 and 181: XENIX Programming lex: Lexical Anal





Page 192 and 193: XENIX Programm ing lex: Lexical Ana

Page 194 and 195: XENIX Programming Specifying Source



Page 200 and 201: XENIX Programming The definitions s

Page 202 and 203: CHAPTER 10 yacc: COMPILER-COMPILER

Page 204 and 205: XENIX Programming yacc: Compiler-Co
















Page 236 and 237: XENIX Programming %left %left %left

Page 238 and 239: XENIX Programming I* lexical analys

Page 240: XENIX Programming yacc: Compiler-Co

Page 243 and 244: m4: Macro Processor XENIX Programmi


Page 247 and 248: m4: Macro Processor XENIX Programm



Page 253 and 254: C Language Portability XENIX Progra






Page 266 and 267: APPENDIX B PROG RAMMING COMMANDS Th

Page 268 and 269: XENIX Programming ad b (continued)

Page 270 and 271: XENIX Programming Programming Comma







Page 284 and 285: XENIX Programming as (continued) Fi



Page 290 and 291: XENIX Programming Programm ing Comm












Page 314 and 315: XENIX Programm ing Programming Comm















Page 344 and 345: XENIX Programming lex (continued) E








Page 360 and 361: XENIX Programm ing Programming Comm


Page 364 and 365: XENIX Programming prs (continued) :













Page 390 and 391: Intel Publications Copies of the fo

Page 392 and 393: INDEX Note: For a master index to t

Page 394 and 395: XENIX Programming lint, B-80 thru B

Page 396 and 397: XENIX Programming fscanf, A-12 get,

Page 398 and 399: XENIX Programming maketemp, 11-8, B

Page 400 and 401: XENIX Programming Segments, 2-19, 2

Page 402 and 403: XENIX Programming yacc.acts, B-123

Page 404 and 405: REQUE.ST FOR READER'S COMMENTS XENI

Page 406: I I

xenix

programming

thru

commands

input

shell

delta

output

segment

variable

intel

programmers

tenox.tc

Intel XENIX 286 Programmers Guide (86) - Tenox.tc

Intel XENIX 286 Programmers Guide (86) - Tenox.tc ... View more Intel XENIX 286 Programmers Guide (86) - Tenox.tc

Delete template?

Save as template ?

Intel XENIX 286 Programmers Guide (86) - Tenox.tc Intel XENIX 286 Programmers Guide (86) - Tenox.tc