Verification of Parameterised FPGA Circuit Descriptions with Layout ...

More documents

Recommendations

Info

CHAPTER 6. LAYOUT CASE STUDIES 136 Slices Util. t-PAR (s) Max freq. (Mhz) Unpipelined/Auto 777 15% 48 61.5 Unpipelined/Placed 406 7% 19 69.7 Pipelined/Auto 777 15% 28 206.0 Pipelined/Placed 406 7% 26 152.0 Table 6.1: Results for a single adder tree R using the timeless pre-condition and split into two repeated anti-delays within the parallel composition using the property that D is polymorphic: half 2 n ; [btreen(R ; D) ; D −n , btreen(R ; D) ; D −n ] ; R The induction hypothesis can then be used to complete the proof. Using this combinator we generate a 6-level tree of 8-bit ripple adders, producing a circuit which adds together 64 input values. The manual placement is compared with the identical circuit compiled without placement and using the Xilinx placement algorithm. 6.2.3 Results Table 6.1 shows the results for placed and unplaced pipelined and unpipelined adder trees. “Util” is the percentage of resources utilised on the device, “t-PAR” is the amount of time required to place and route the circuit. As expected, pipelining increases the maximum clock frequency significantly (although far from the predicted theoretical maximum of ×8). It is also interesting to note that the manually placed design has worse performance than the automatically placed version when the circuit is pipelined, even though the manually placed version has been mapped into fewer Virtex slices. We also experimented with placing multiple adder trees on the FPGA. Table 6.2 illustrates the results for an FPGA loaded with 7 of the adder trees. The difference in the resources used by the placed and unplaced descriptions is very significant, and possibly partially responsible for the fact that the placed version now exhibits significantly higher performance than the unplaced version regardless of pipelining. The difference in the number of slices used is quite interesting. It implies that the process of packing primitives into slices automatically does so much less densely than the manual
CHAPTER 6. LAYOUT CASE STUDIES 137 Slices Util. t-PAR (s) Max freq. (Mhz) Pwr (mW) Unpipelined/Auto 4872 95% 225 53.3 - Unpipelined/Placed 1907 37% 19 65.5 - Pipelined/Auto 4872 95% 142 123.0 1404 Pipelined/Placed 1908 37% 40 150.9 852 Table 6.2: Results for 7 adder trees method. This appears to be a result of the Xilinx algorithm only packing, as a first preference, “related” logic into the same slice. Thus, while the manually specified layout tends to use both function generators in a slice, the automatic one prefers to use only one. This may allow the Xilinx router to perform better and could explain why the automatically placed single pipelined adder tree example requires more FPGA resources than the placed version but still runs faster. We measure the power consumption of the pipelined variants. Running at the same clock frequency, the placed design consumes substantially less power (39% less dynamic power, once the quiescent consumption of the development board is subtracted) than the unplaced design, though it is unclear whether this is the result of the design using fewer logic resources or of better routing. 6.3 Median Filter Median filters are a special case of ranked order filtering. The median filtering operation is widely used in digital image processing to remove noise and in a variety of other applications. Our circuit will be restricted to one dimensional filtering, although the extension to a two dimensional filter is not difficult. A 1-dimensional median filtering operation involves “sliding” a filter window along a range of values and selecting the median value from the elements currently within the window. This can be achieved by sorting the elements and selecting the middle value - obviously the window size must always be an odd number so that there is a middle element to select. In our circuit the elements within the current window are stored and each cycle a new value is inserted while the oldest is discarded. Since only one element differs between different window positions we do not need to implement a full sorter but can simplify the circuitry to
Page 1 and 2:
Imperial College of Science, Techno
Page 3 and 4:
Acknowledgements Firstly, I’d lik
Page 5 and 6:
TABLE OF CONTENTS iv 2.5 Isabelle:
Page 7 and 8:
TABLE OF CONTENTS vi 5.3.1 Speciali
Page 9 and 10:
TABLE OF CONTENTS viii C.1.1 fst .
Page 11 and 12:
Chapter 1 Introduction This thesis
Page 13 and 14:
CHAPTER 1. INTRODUCTION 3 B A C Fig
Page 15 and 16:
CHAPTER 1. INTRODUCTION 5 pler, all
Page 17 and 18:
CHAPTER 2. BACKGROUND AND RELATED W
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
CHAPTER 3. GENERATING PARAMETERISED
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Chapter 4 Verifying Circuit Layouts
Page 77 and 78:
CHAPTER 4. VERIFYING CIRCUIT LAYOUT
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
Page 93 and 94:
Page 95 and 96: CHAPTER 4. VERIFYING CIRCUIT LAYOUT
Page 117 and 118: Chapter 5 Specialisation In this ch
Page 119 and 120: CHAPTER 5. SPECIALISATION 109 opera
Page 121 and 122: CHAPTER 5. SPECIALISATION 111 // Ha
Page 123 and 124: CHAPTER 5. SPECIALISATION 113 circu
Page 125 and 126: CHAPTER 5. SPECIALISATION 115 const
Page 127 and 128: CHAPTER 5. SPECIALISATION 117 block
Page 129 and 130: CHAPTER 5. SPECIALISATION 119 Modif
Page 131 and 132: CHAPTER 5. SPECIALISATION 121 Buffe
Page 133 and 134: CHAPTER 5. SPECIALISATION 123 a fas
Page 135 and 136: CHAPTER 5. SPECIALISATION 125 block
Page 137 and 138: CHAPTER 5. SPECIALISATION 127 y y y
Page 139 and 140: CHAPTER 5. SPECIALISATION 129 with
Page 141 and 142: CHAPTER 6. LAYOUT CASE STUDIES 131
Page 145: CHAPTER 6. LAYOUT CASE STUDIES 135
Page 175 and 176: CHAPTER 7. CONCLUSION AND FUTURE WO
Page 185 and 186: Bibliography [1] A. Aggoun and N. B
Page 187 and 188: BIBLIOGRAPHY 177 [19] H. Gelernter.
Page 189 and 190: BIBLIOGRAPHY 179 [41] Y. Li and M.
Page 191 and 192: BIBLIOGRAPHY 181 [60] L. C. Paulson
Page 193 and 194: BIBLIOGRAPHY 183 [83] J. Voeten. On
Page 195 and 196: APPENDIX A. QUARTZ LANGUAGE GRAMMAR
Page 197 and 198:
Appendix B Theoretical Basis for La
Page 199 and 200:
APPENDIX B. THEORETICAL BASIS FOR L
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Page 207 and 208:
Page 209 and 210:
Page 211 and 212:
Page 213 and 214:
Page 215 and 216:
Page 217 and 218:
Appendix C Placed Combinator Librar
Page 219 and 220:
APPENDIX C. PLACED COMBINATOR LIBRA
Page 221 and 222:
Page 223 and 224:
Page 225 and 226:
Page 227 and 228:
Page 229 and 230:
Page 231 and 232:
Page 233 and 234:
Page 235 and 236:
Page 237 and 238:
Page 239 and 240:
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
Page 249 and 250:
Page 251 and 252:
Page 253 and 254:
Page 255 and 256:
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
Page 279 and 280:
APPENDIX D. CIRCUIT LAYOUT CASE STU
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
Page 293 and 294:
Page 295 and 296:
Page 297 and 298:
Page 299 and 300:
Page 301 and 302:
Page 303 and 304:
Page 305 and 306:
Page 307 and 308:
Page 309 and 310:
Page 311 and 312:
Page 313 and 314:
Page 315 and 316:
Page 317 and 318:
Page 319 and 320:
Page 321 and 322:
Page 323 and 324:
Page 325 and 326:
Page 327:
show all

Verification of Parameterised FPGA Circuit Descriptions with Layout ...

Create successful ePaper yourself

Delete template?

Save as template?