23.05.2014 Views

Measurement of the Top Quark Mass, Cross Section and Anomalous ...

Measurement of the Top Quark Mass, Cross Section and Anomalous ...

Measurement of the Top Quark Mass, Cross Section and Anomalous ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Measurement</strong> <strong>of</strong> <strong>the</strong> <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong>, <strong>Cross</strong> <strong>Section</strong> <strong>and</strong><br />

<strong>Anomalous</strong> Couplings at <strong>the</strong> International Linear<br />

Collider<br />

Erik Devetak<br />

Brasenose College, Oxford<br />

Thesis submitted in fulfilment <strong>of</strong> <strong>the</strong> requirements for <strong>the</strong> degree <strong>of</strong><br />

Doctor <strong>of</strong> Philosophy at <strong>the</strong> University <strong>of</strong> Oxford<br />

Trinity Term, 2009


Abstract<br />

A feasibility study to measure <strong>the</strong> proprieties <strong>of</strong> <strong>the</strong> top quark at <strong>the</strong> International Linear<br />

Collider is presented. The study is performed in <strong>the</strong> fully hadronic channel e + e − → t¯t →<br />

b¯bq¯qq¯q <strong>and</strong> employs <strong>the</strong> Silicon Detector (SiD) concept to establish <strong>the</strong> precision achievable<br />

when measuring <strong>the</strong> cross section (∼ 1.3 fb), <strong>the</strong> top mass (∼ 49 MeV) <strong>and</strong> <strong>the</strong> forward<br />

backward asymmetry <strong>of</strong> <strong>the</strong> top quark (∼ 0.008) as well as <strong>of</strong> <strong>the</strong> b quark originating from<br />

<strong>the</strong> top decay (∼ 0.008).<br />

The s<strong>of</strong>tware developed for <strong>the</strong> tagging <strong>of</strong> heavy flavour jets <strong>and</strong> for reconstructing <strong>the</strong><br />

charge <strong>of</strong> <strong>the</strong> quark from which <strong>the</strong>y originate is also described. The jet tagging s<strong>of</strong>tware has<br />

been developed within <strong>the</strong> International Linear Detector framework; but tested also within<br />

SiD. The performance has been benchmarked using dijet samples at 91.2 GeV centre <strong>of</strong> mass<br />

energy. The achievable purity <strong>of</strong> b quark identification is <strong>of</strong> <strong>the</strong> order <strong>of</strong> 90% when a 70%<br />

efficiency is required. In <strong>the</strong> case <strong>of</strong> charge reconstruction a purity <strong>of</strong> up to 80% is achievable<br />

with an efficiency lower than 60% when <strong>the</strong> charges <strong>of</strong> well identified b quarks are being<br />

reconstructed. The charge reconstruction has been tested on a b¯bq¯qq¯q sample with 500 GeV<br />

centre <strong>of</strong> mass energy. Both algorithms are a fundamental component <strong>of</strong> <strong>the</strong> presented top<br />

study.<br />

A hardware study analysing <strong>the</strong> capacitance <strong>of</strong> <strong>the</strong> CPC2 sensor, a proposed technology<br />

for <strong>the</strong> SiD vertex detector, has also been performed. The analysis aims at minimizing <strong>the</strong><br />

clock current needed to operate <strong>the</strong> sensor. It has been found that <strong>the</strong> <strong>the</strong> major component<br />

<strong>of</strong> <strong>the</strong> pixel capacitance comes from <strong>the</strong> inter-gate capacitance (2.7pF/cm <strong>of</strong> gate overlap).<br />

A series <strong>of</strong> different possible designs <strong>of</strong> this region have been studied with <strong>the</strong> help <strong>of</strong> finite<br />

element modelling <strong>and</strong> <strong>of</strong> test structures.<br />

An experimental set up for <strong>the</strong> running <strong>and</strong> testing <strong>of</strong> <strong>the</strong> CPC2 sensor has been devised.<br />

The noise <strong>of</strong> <strong>the</strong> sensor has been measured as 70 e − .


Dedicated to my family<br />

i


“Poets say science takes away from <strong>the</strong> beauty <strong>of</strong> <strong>the</strong> stars - mere globs <strong>of</strong> gas atoms. I too<br />

can see <strong>the</strong> stars on a desert night <strong>and</strong> feel <strong>the</strong>m. But do I see less or more?”<br />

- Richard P. Feynman<br />

ii


Acknowledgements<br />

My past four years spent as a D.Phil. student in Oxford have been altoge<strong>the</strong>r thoroughly<br />

enjoyable <strong>and</strong> fulfilling. Although Oxford is a very pretty medieval town it has been <strong>the</strong><br />

presence <strong>of</strong> some wonderful people around me that have made my stay particularly pleasant.<br />

Firstly I would like to thank my supervisor Andrei Nomerotski. His help <strong>and</strong> patience<br />

over <strong>the</strong> past four years have been invaluable. Without his guidance <strong>the</strong> completion <strong>of</strong> this<br />

<strong>the</strong>sis would have been a much harder task <strong>and</strong> a substantially less enjoyable endeavour.<br />

I would like to extend my gratitude also to <strong>the</strong> rest <strong>of</strong> <strong>the</strong> LCFI collaboration <strong>and</strong> <strong>the</strong><br />

SiD benchmarking group, who were always ready to share <strong>the</strong>ir expertise <strong>and</strong> to help me<br />

with my ra<strong>the</strong>r continuous questions <strong>and</strong> problems.<br />

Thanks also to <strong>the</strong> fellow D.Phil. students who <strong>of</strong>ten lightened my days. A particular<br />

mention is due to all those whose pursuit <strong>of</strong> idiosyncratic <strong>of</strong>fice activities made <strong>the</strong> daily<br />

routine ever so pleasant. As it is <strong>of</strong>ten <strong>the</strong> case a lot <strong>of</strong> problems were solved outside <strong>the</strong><br />

<strong>of</strong>fice premises; thanks <strong>the</strong>refore to all <strong>the</strong> people who over lunch or in <strong>the</strong> pub have helped<br />

me with ideas <strong>and</strong> comments.<br />

During my D.Phil. I also had <strong>the</strong> opportunity to spend a few months at SLAC. A<br />

wonderful experience that has been made even more enjoyable because <strong>of</strong> <strong>the</strong> warm welcome<br />

I received from <strong>the</strong> whole SLAC SiD group. The welcome <strong>of</strong> <strong>the</strong> STFC student contingent<br />

in Palo Alto has also been very much appreciated.<br />

Without <strong>the</strong> generous founding provided by STFC, formerly PPARC, I would have never<br />

been able to pursue my D.Phil. studies; I am indeed very grateful for <strong>the</strong> opportunity <strong>the</strong>y<br />

have given me.<br />

I am also very grateful to all <strong>the</strong> Oxford support staff who <strong>of</strong>ten helped me with organisational<br />

<strong>and</strong> administrative problems that seemed more insurmountable than o<strong>the</strong>rs.<br />

Last but not least I would like to thank my friends <strong>and</strong> family whose persistent support<br />

has been a source <strong>of</strong> lasting <strong>and</strong> invaluable encouragement. It has been truly wonderful to<br />

know that I can always rely on <strong>the</strong>m.<br />

iii


Preface<br />

This <strong>the</strong>sis is <strong>the</strong> result <strong>of</strong> four years <strong>of</strong> <strong>the</strong> author’s work as a member <strong>of</strong> <strong>the</strong> LCFI <strong>and</strong><br />

SiD collaborations. As it is <strong>of</strong>ten <strong>the</strong> case in particle physics a lot <strong>of</strong> <strong>the</strong> work presented has<br />

been performed in conjunction with o<strong>the</strong>r collaborators. It is <strong>the</strong>refore important to clearly<br />

define <strong>the</strong> contribution <strong>of</strong> <strong>the</strong> author.<br />

The presented <strong>the</strong>sis can be subdivided in four areas <strong>of</strong> approximately equal lengths:<br />

• Chapters 1, 2 <strong>and</strong> 3 are <strong>the</strong> introductory chapters describing: <strong>the</strong> basis <strong>of</strong> <strong>the</strong> used<br />

particle physics <strong>the</strong>ory; <strong>the</strong> accelerator <strong>and</strong> <strong>the</strong> detector; <strong>the</strong> used simulation <strong>and</strong><br />

reconstruction s<strong>of</strong>tware. The author’s contribution in <strong>the</strong>se chapters is minimal with<br />

<strong>the</strong> exception <strong>of</strong> chapter 3 where he has helped with <strong>the</strong> testing <strong>and</strong> running <strong>of</strong> <strong>the</strong><br />

s<strong>of</strong>tware.<br />

• Chapter 4 presents <strong>the</strong> hardware work performed by <strong>the</strong> author: section 1 describes <strong>the</strong><br />

characterization <strong>of</strong> <strong>the</strong> CPC2 capacitance, section 2 describes <strong>the</strong> capacitance analysis<br />

on CPC2 test structures (work performed partially with <strong>the</strong> help <strong>of</strong> Oxford M.Phys.<br />

student Philip Coulter), section 3 describes <strong>the</strong> set-up for <strong>the</strong> testing <strong>of</strong> <strong>the</strong> CPC2<br />

sensor <strong>and</strong> <strong>the</strong> CPC2 sensor noise analysis (work performed with <strong>the</strong> help <strong>of</strong> Oxford<br />

D.Phil. students Ben Jeffery <strong>and</strong> Yambazi B<strong>and</strong>a). The overwhelming majority <strong>of</strong> <strong>the</strong><br />

work described in this chapter has been performed by <strong>the</strong> <strong>the</strong>sis author.<br />

• Chapter 5 presents <strong>the</strong> LCFI Vertex s<strong>of</strong>tware. The s<strong>of</strong>tware has been developed by <strong>the</strong><br />

author toge<strong>the</strong>r with Ben Jeffery <strong>and</strong> Mark Grimes (Ph.D. Bristol). Small contributions<br />

have been received also from some members <strong>of</strong> <strong>the</strong> LCFI collaboration. <strong>Section</strong><br />

1 describes <strong>the</strong> track selection procedures, section 2 describes <strong>the</strong> ZVTOP vertexing;<br />

section 3 describes <strong>the</strong> flavour tagging discriminating variables, section 4 describes <strong>the</strong><br />

used neural network, sections 5 <strong>and</strong> 6 describe <strong>the</strong> performances in <strong>the</strong> ILD <strong>and</strong> SiD<br />

concepts. The author has been responsible for sections 3 <strong>and</strong> 6. It has also had major<br />

contributions in sections 1 <strong>and</strong> 5, with minor contributions towards sections 2 <strong>and</strong> 4.<br />

• Chapters 6 <strong>and</strong> 7 present <strong>the</strong> performed physics analysis aimed to measure <strong>the</strong> proprieties<br />

<strong>of</strong> <strong>the</strong> top quark at <strong>the</strong> ILC. Both chapters present exclusively <strong>the</strong> work <strong>of</strong> <strong>the</strong><br />

author, including section 1 <strong>of</strong> chapter 7 which is part <strong>of</strong> <strong>the</strong> LCFI Vertex s<strong>of</strong>tware.<br />

Finally throughout <strong>the</strong> <strong>the</strong>sis <strong>the</strong> help <strong>of</strong> Dr. Andrei Nomerotski has been heavily exploited.<br />

iv


Contents<br />

1 Physics <strong>of</strong> <strong>the</strong> <strong>Top</strong> <strong>Quark</strong>: Theoretical Introduction 1<br />

1.1 The St<strong>and</strong>ard Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2<br />

1.2 The <strong>Top</strong> <strong>Quark</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7<br />

1.2.1 Effective Field Theories . . . . . . . . . . . . . . . . . . . . . . . . . 9<br />

1.2.2 Observables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11<br />

2 The ILC <strong>and</strong> <strong>the</strong> SiD Detector Concept 14<br />

2.1 The ILC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15<br />

2.2 The Silicon Detector Concept . . . . . . . . . . . . . . . . . . . . . . . . . . 18<br />

2.2.1 Vertex Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19<br />

2.2.2 Silicon Tracker . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21<br />

2.2.3 Calorimetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23<br />

2.2.4 Muon Identification System . . . . . . . . . . . . . . . . . . . . . . . 27<br />

2.2.5 Forward Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28<br />

2.3 The International Large Detector . . . . . . . . . . . . . . . . . . . . . . . . 29<br />

3 Simulation <strong>and</strong> Reconstruction 30<br />

3.1 Monte Carlo Generation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30<br />

3.2 Detector Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31<br />

3.3 Reconstruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32<br />

3.3.1 Tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32<br />

3.3.2 Particle Flow Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . 34<br />

3.3.3 Jet Finding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36<br />

3.3.4 The S<strong>of</strong>tware Framework . . . . . . . . . . . . . . . . . . . . . . . . . 37<br />

4 The Column Parallel CCD as an ILC Vertex Detector Technology 39<br />

4.1 CPC2 Capacitance Characterization . . . . . . . . . . . . . . . . . . . . . . . 42<br />

4.1.1 Substrate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . . 43<br />

4.1.2 Intergate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . . 45<br />

4.1.3 Finite Element Model <strong>of</strong> CPC2 Capacitance . . . . . . . . . . . . . . 47<br />

4.2 CPC2 Capacitance Test Structures . . . . . . . . . . . . . . . . . . . . . . . 50<br />

4.3 CPC2 Readout <strong>and</strong> Noise Analysis . . . . . . . . . . . . . . . . . . . . . . . 58<br />

4.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65<br />

v


5 The LCFI Vertex Flavour Tagging S<strong>of</strong>tware 66<br />

5.1 Track Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67<br />

5.2 The ZVRES Vertex Finding Algorithm . . . . . . . . . . . . . . . . . . . . . 69<br />

5.2.1 ZVRES Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . 73<br />

5.3 Flavour Discriminating Variables . . . . . . . . . . . . . . . . . . . . . . . . 76<br />

5.3.1 Variables for Jets without Secondary Vertices . . . . . . . . . . . . . 77<br />

5.3.2 Variables for Jets with Secondary Vertices . . . . . . . . . . . . . . . 83<br />

5.4 Neural Network Combination . . . . . . . . . . . . . . . . . . . . . . . . . . 89<br />

5.5 Performance <strong>of</strong> Flavour Tagging . . . . . . . . . . . . . . . . . . . . . . . . . 93<br />

5.6 Flavour Tagging in <strong>the</strong> SiD Detector Concept . . . . . . . . . . . . . . . . . 99<br />

5.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100<br />

6 The <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> <strong>and</strong> <strong>the</strong> t¯t <strong>Cross</strong> <strong>Section</strong> 101<br />

6.1 Preliminary Event Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . 103<br />

6.2 Kinematic Fitter <strong>and</strong> W Identification . . . . . . . . . . . . . . . . . . . . . 105<br />

6.3 b Tagging Event Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108<br />

6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111<br />

6.4.1 Curve Fitting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113<br />

6.4.2 Template Fitting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116<br />

6.5 <strong>Cross</strong> <strong>Section</strong> <strong>of</strong> <strong>the</strong> b¯bq¯qq¯q Process . . . . . . . . . . . . . . . . . . . . . . . 121<br />

6.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122<br />

7 <strong>Quark</strong> Charge <strong>and</strong> Forward Backward Asymmetries 124<br />

7.1 Vertex Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125<br />

7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge . . . . . . . . . . . . . . . . . . 126<br />

7.3 Combined Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131<br />

7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry . . . . . . . . . . . . . . . . . 133<br />

7.5 <strong>Top</strong> <strong>Quark</strong> Forward Backward Asymmetry . . . . . . . . . . . . . . . . . . . 138<br />

7.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140<br />

8 Conclusion 142<br />

A CPC2 Bias Voltages 144<br />

B LCFI Vertex Flavour Tagging S<strong>of</strong>tware Parameters 145<br />

Bibliography 146<br />

vi


List <strong>of</strong> Figures<br />

1.1 St<strong>and</strong>ard Model Interactions . . . . . . . . . . . . . . . . . . . . . . . . . . . 7<br />

1.2 Constraints to Higgs <strong>Mass</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8<br />

1.3 Feynman Diagrams <strong>of</strong> Signal <strong>and</strong> Background Events . . . . . . . . . . . . . 11<br />

2.1 The International Linear Collider . . . . . . . . . . . . . . . . . . . . . . . . 16<br />

2.2 The SiD Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19<br />

2.3 The SiD Subdetectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20<br />

2.4 The Vertex Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20<br />

2.5 The Tracking Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22<br />

2.6 Tracking Resolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23<br />

2.7 PFA Visual Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24<br />

2.8 Calorimeter Technologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25<br />

2.9 SiD Calorimeter Performance . . . . . . . . . . . . . . . . . . . . . . . . . . 27<br />

2.10 The Forward Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28<br />

3.1 Tracking Efficiency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34<br />

4.1 Charged Coupled Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40<br />

4.2 CCD vs. CPC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41<br />

4.3 Graphical Representation <strong>of</strong> CCD Depletion Regions . . . . . . . . . . . . . 43<br />

4.4 CPC2 Capacitance Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44<br />

4.5 Substrate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44<br />

4.6 CPC2 One Pixel Geometry for Finite Element Analysis . . . . . . . . . . . . 47<br />

4.7 Modelled Substrate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . 48<br />

4.8 Intergate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49<br />

4.9 Sketch <strong>of</strong> Realistic Intergate Region . . . . . . . . . . . . . . . . . . . . . . . 50<br />

4.10 Intergate Capacitance FEEM Study . . . . . . . . . . . . . . . . . . . . . . . 51<br />

4.11 Picture <strong>of</strong> a Test Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . 52<br />

4.12 Test Structures Schematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52<br />

4.13 Test Structure Parasitic Capacitance . . . . . . . . . . . . . . . . . . . . . . 54<br />

4.14 Test Structures Substrate Capacitance . . . . . . . . . . . . . . . . . . . . . 54<br />

4.15 Test Structures Substrate Capacitance for Different Polysilicon Layers . . . . 55<br />

4.16 Test Structures Intergate Capacitance . . . . . . . . . . . . . . . . . . . . . . 57<br />

4.17 Noise Analysis Experimental Set-up . . . . . . . . . . . . . . . . . . . . . . . 58<br />

4.18 Oscilloscope Picture <strong>of</strong> <strong>the</strong> CPC2 Readout . . . . . . . . . . . . . . . . . . . 59<br />

4.19 ADC Readout User Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . 60<br />

4.20 CPC2 Noise for Different Power Supplies . . . . . . . . . . . . . . . . . . . . 61<br />

vii


4.21 Correlated Double Sampling User Interface . . . . . . . . . . . . . . . . . . . 63<br />

4.22 CPC2 Noise with New Set-up . . . . . . . . . . . . . . . . . . . . . . . . . . 64<br />

5.1 Schematic <strong>of</strong> Displaced Vertices in B Meson Decays . . . . . . . . . . . . . . 67<br />

5.2 ZVRES Graphical Representation . . . . . . . . . . . . . . . . . . . . . . . . 70<br />

5.3 ZVRES Efficiency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73<br />

5.4 ZVRES Vertexing Performance . . . . . . . . . . . . . . . . . . . . . . . . . 75<br />

5.5 Flavour Discriminating Variables - 1 Vertex Reconstructed . . . . . . . . . . 79<br />

5.6 Joint Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81<br />

5.7 Impact Parameter Distributions . . . . . . . . . . . . . . . . . . . . . . . . . 83<br />

5.8 Track Attachment Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . 85<br />

5.9 Flavour Discriminating Variables - 2 Vertices Reconstructed . . . . . . . . . 88<br />

5.10 Charm Tagging Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . 90<br />

5.11 Charm tag vs. Bottom tag . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92<br />

5.12 Flavour Tagging Performance . . . . . . . . . . . . . . . . . . . . . . . . . . 94<br />

5.13 Flavour Tagging Leakage Rates . . . . . . . . . . . . . . . . . . . . . . . . . 95<br />

5.14 Flavour Tagging Performance with K s , Λ <strong>and</strong> Conversion Tagger . . . . . . . 96<br />

5.15 Flavour Tagging Performance in SiD . . . . . . . . . . . . . . . . . . . . . . 100<br />

6.1 Kinematic <strong>and</strong> <strong>Top</strong>ological Event Selections . . . . . . . . . . . . . . . . . . 105<br />

6.2 Kinematic Fit <strong>of</strong> <strong>the</strong> W bosons . . . . . . . . . . . . . . . . . . . . . . . . . 107<br />

6.3 b Tagging in a Six Jet Environment . . . . . . . . . . . . . . . . . . . . . . . 109<br />

6.4 b − tag sum Event Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110<br />

6.5 b Tagging Single Jet Event Selection . . . . . . . . . . . . . . . . . . . . . . 110<br />

6.6 <strong>Top</strong> <strong>Quark</strong> Kinematic Fit . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112<br />

6.7 <strong>Top</strong> <strong>Quark</strong> Kinematic Fit Event Selection . . . . . . . . . . . . . . . . . . . 112<br />

6.8 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> - Curve Fitting Technique . . . . . . . . . . . . . . . . . . . 114<br />

6.9 Template Fitting Technique . . . . . . . . . . . . . . . . . . . . . . . . . . . 117<br />

6.10 Template Fitting χ 2 Minimization . . . . . . . . . . . . . . . . . . . . . . . . 118<br />

7.1 LCFI Vertex Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126<br />

7.2 Momentum Weighted Vertex Charge . . . . . . . . . . . . . . . . . . . . . . 127<br />

7.3 Momentum Weighted Jet Charge . . . . . . . . . . . . . . . . . . . . . . . . 128<br />

7.4 Momentum Weighted Vertex <strong>and</strong> Jet Charge: B + /B − <strong>and</strong> ¯B 0 /B 0 . . . . . . 129<br />

7.5 f¯b<br />

i (x i )/fi b (x i ) for Momentum Weighted Vertex Charge <strong>and</strong> Jet Charge . . . . 131<br />

7.6 Combined Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132<br />

7.7 Combined Charge b jet-1 × Combined Charge b jet-2 . . . . . . . . . . . . . 134<br />

7.8 Events for <strong>the</strong> Calculation <strong>of</strong> <strong>the</strong> b <strong>Quark</strong> A fb . . . . . . . . . . . . . . . . . 135<br />

7.9 Resolution <strong>of</strong> b <strong>Quark</strong> Angle θ . . . . . . . . . . . . . . . . . . . . . . . . . . 138<br />

7.10 Events for <strong>the</strong> Calculation <strong>of</strong> <strong>the</strong> t <strong>Quark</strong> A fb . . . . . . . . . . . . . . . . . 139<br />

7.11 Resolution <strong>of</strong> t <strong>Quark</strong> Angle θ . . . . . . . . . . . . . . . . . . . . . . . . . . 140<br />

A.1 Schematic <strong>of</strong> <strong>the</strong> CPC2 Bias Voltages . . . . . . . . . . . . . . . . . . . . . . 144<br />

viii


List <strong>of</strong> Tables<br />

1.1 Bosons in <strong>the</strong> St<strong>and</strong>ard Model . . . . . . . . . . . . . . . . . . . . . . . . . . 5<br />

1.2 Fermions in <strong>the</strong> St<strong>and</strong>ard Model . . . . . . . . . . . . . . . . . . . . . . . . . 6<br />

2.1 ILC Beam Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17<br />

3.1 PFA Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36<br />

4.1 Measured Intergate Capacitance . . . . . . . . . . . . . . . . . . . . . . . . . 46<br />

5.1 Track Selection Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69<br />

5.2 ZVRES Track Association . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76<br />

5.3 Joint Probability Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . 83<br />

5.4 Importance <strong>of</strong> Discriminating Variables Z <strong>Mass</strong> Resonance - 1 Vertex Found 97<br />

5.5 Importance <strong>of</strong> Discriminating Variables Z <strong>Mass</strong> Resonance - 2 Vertices Found 97<br />

5.6 Importance <strong>of</strong> Discriminating Variables Z <strong>Mass</strong> Resonance - 3+ Vertices Found 97<br />

5.7 Importance <strong>of</strong> Discriminating Variables 500GeV - 1 Vertex Found . . . . . . 98<br />

5.8 Importance <strong>of</strong> Discriminating Variables 500GeV - 2 Vertices Found . . . . . 98<br />

5.9 Importance <strong>of</strong> Discriminating Variables 500GeV - 3+ Vertices Found . . . . 98<br />

6.1 Kinematic <strong>and</strong> <strong>Top</strong>ological Event Selections . . . . . . . . . . . . . . . . . . 104<br />

6.2 <strong>Top</strong> Kinematic Fitting Constraints . . . . . . . . . . . . . . . . . . . . . . . 111<br />

6.3 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> - Curve Fitting Technique . . . . . . . . . . . . . . . . . . . 115<br />

6.4 Template Fitting Results - Renormalized <strong>Cross</strong> <strong>Section</strong> . . . . . . . . . . . . 119<br />

6.5 Template Fitting Results - MC <strong>Cross</strong> <strong>Section</strong> . . . . . . . . . . . . . . . . . 120<br />

6.6 b¯bq¯qq¯q <strong>Cross</strong> <strong>Section</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122<br />

7.1 LCFI Vertex Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127<br />

7.2 Reconstructed A fb for <strong>the</strong> b <strong>Quark</strong> . . . . . . . . . . . . . . . . . . . . . . . 136<br />

7.3 <strong>Anomalous</strong> W tb Coupling Theoretical Predictions for b <strong>Quark</strong> A fb . . . . . . 137<br />

7.4 Reconstructed A fb for <strong>the</strong> t <strong>Quark</strong> . . . . . . . . . . . . . . . . . . . . . . . 140<br />

B.1 LCFI Vertex Default Parameters . . . . . . . . . . . . . . . . . . . . . . . . 145<br />

ix


Chapter 1<br />

Physics <strong>of</strong> <strong>the</strong> <strong>Top</strong> <strong>Quark</strong>: Theoretical<br />

Introduction<br />

Ideally <strong>the</strong> complex objects <strong>and</strong> phenomena that we observe every day can all be described<br />

with just a h<strong>and</strong>ful <strong>of</strong> basic constituents <strong>and</strong> a matching set <strong>of</strong> laws describing <strong>the</strong>ir interactions.<br />

Hence <strong>the</strong> inherit aim <strong>of</strong> particle physics is to comprehensively <strong>and</strong> self-consistently describe<br />

<strong>the</strong> most fundamental constituents <strong>of</strong> matter in <strong>the</strong> simplest form possible. Throughout<br />

<strong>the</strong> past century a new level <strong>of</strong> precision in measurements <strong>and</strong> new analytical techniques<br />

allowed human kind to explore deeper into matter. Scientists <strong>the</strong>refore observed what are<br />

now believed to be <strong>the</strong> discrete units <strong>of</strong> matter <strong>and</strong> tried to describe <strong>the</strong>ir interactions. First<br />

came <strong>the</strong> <strong>the</strong>ory <strong>of</strong> Quantum Electro Dynamics (QED), which is essentially <strong>the</strong> quantised<br />

version <strong>of</strong> electromagnetic interactions. Far from being complete <strong>the</strong> <strong>the</strong>ory left a lot unexplained,<br />

not least <strong>the</strong> β decay <strong>of</strong> nuclei. It was to explain such phenomena that <strong>the</strong> weak<br />

force was postulated <strong>and</strong> subsequently encompassed in <strong>the</strong> Electro-Weak <strong>the</strong>ory. This successfully<br />

incorporated QED <strong>and</strong> also provided a description for <strong>the</strong> weak force by introducing<br />

<strong>the</strong> exchange <strong>of</strong> massive force carrying particles. Still a complete picture could not be built.<br />

The <strong>the</strong>ory does not predict a stable atomic nucleus; fur<strong>the</strong>rmore, it has been discovered,<br />

that both <strong>the</strong> proton <strong>and</strong> <strong>the</strong> neutron exhibit a substructure. It is only with <strong>the</strong> introduction<br />

<strong>of</strong> <strong>the</strong> strong force <strong>and</strong> <strong>the</strong>refore <strong>the</strong> field <strong>the</strong>ory <strong>of</strong> Quantum Chromo Dynamics (QCD),<br />

1


1.1 The St<strong>and</strong>ard Model 2<br />

that it is possible to underst<strong>and</strong> this substructure <strong>and</strong> at <strong>the</strong> same time predict a stable<br />

behaviour <strong>of</strong> <strong>the</strong> nucleus.<br />

1.1 The St<strong>and</strong>ard Model<br />

The St<strong>and</strong>ard Model (SM) is currently <strong>the</strong> most widely accepted approximation to a comprehensive<br />

<strong>and</strong> self-consistent <strong>the</strong>ory <strong>of</strong> <strong>the</strong> fundamental constituents <strong>of</strong> matter <strong>and</strong> <strong>the</strong>ir<br />

interactions. It consists <strong>of</strong> three Quantum Field Theories: Quantum Electro Dynamics,<br />

Electro-Weak <strong>the</strong>ory <strong>and</strong> Quantum Chromo Dynamics <strong>the</strong>ories. It is <strong>the</strong> aim <strong>of</strong> this section<br />

to give a syn<strong>the</strong>tic, experimentally backed overview <strong>of</strong> <strong>the</strong> principles <strong>of</strong> <strong>the</strong> SM.<br />

In any quantum field <strong>the</strong>ory a complex n-dimensional wave function ψ(x) represents one<br />

or more particles, where |ψ(x)| 2 is <strong>the</strong> probability <strong>of</strong> finding <strong>the</strong> system in <strong>the</strong> physical state<br />

x. The amplitude <strong>of</strong> <strong>the</strong> function is hence a measurable quantity, while <strong>the</strong> phase is not. The<br />

phase <strong>of</strong> <strong>the</strong> system cannot be restricted from physical observations <strong>and</strong> hence an arbitrary<br />

phase can be assigned. Clearly <strong>the</strong> <strong>the</strong>ory should not be dependent on this arbitrary decision<br />

<strong>and</strong> <strong>the</strong>refore it should be invariant under local gauge transformations:<br />

ψ(x) → e iθ(x) ψ(x). (1.1)<br />

The gauge requirement has a few far reaching consequences. It can be immediately seen<br />

that <strong>the</strong> classical derivative ∂ µ is not gauge invariant. The covariant derivative must hence<br />

be used, leading to <strong>the</strong> automatic introduction <strong>of</strong> <strong>the</strong> force carrying particle as a gauge<br />

field <strong>of</strong> <strong>the</strong> covariant derivative. In <strong>the</strong> QED formalism this is easily shown. The covariant<br />

derivative is:<br />

D µ = ∂ µ + ieA µ (1.2)<br />

where e is <strong>the</strong> electric charge <strong>of</strong> <strong>the</strong> considered particle. Additionally <strong>the</strong> gauge field A µ is<br />

required to transform as:<br />

A µ → A µ + ∂ µ θ(x). (1.3)


1.1 The St<strong>and</strong>ard Model 3<br />

Taking <strong>the</strong>n, as an example, <strong>the</strong> simple Lagrangian <strong>of</strong> <strong>the</strong> particle in a electromagnetic field:<br />

L = i ¯ψγ µ ∂ µ ψ − e ¯ψγ µ A µ ψ − m ¯ψψ − 1 4 F µνF µν (1.4)<br />

where γ µ are <strong>the</strong> Dirac gamma matrices <strong>and</strong> m is <strong>the</strong> mass <strong>of</strong> <strong>the</strong> particle. The field F µν is<br />

<strong>the</strong>n defined as:<br />

F µν = ∂ µ A ν − ∂ ν A µ . (1.5)<br />

By <strong>the</strong>n using <strong>the</strong> Euler-Lagrange equation on <strong>the</strong> initial particle field ψ <strong>and</strong> on <strong>the</strong> gauge<br />

field A µ one derives:<br />

iγ µ ∂ µ ψ − mψ = eγ µ A µ ψ (1.6)<br />

∂ ν F µν = e ¯ψγ µ ψ (1.7)<br />

Noticeably <strong>the</strong> left h<strong>and</strong> side (LHS) <strong>of</strong> (1.6) is <strong>the</strong> well known Dirac equation. Differently<br />

<strong>the</strong> right h<strong>and</strong> side (RHS) <strong>of</strong> <strong>the</strong> equation, which derives directly from <strong>the</strong> covariant derivative,<br />

<strong>and</strong> hence from <strong>the</strong> gauge invariance postulate, represents <strong>the</strong> particle’s interaction with<br />

<strong>the</strong> electromagnetic field. The field A µ can now be interpreted as a force mediating particle.<br />

In fact its behaviour is not at all dissimilar from <strong>the</strong> original field ψ(x). The behaviour <strong>of</strong><br />

<strong>the</strong> electromagnetic field tensor can be instead seen in (1.7).<br />

This is however not <strong>the</strong> only important result that can be derived from <strong>the</strong> axiom <strong>of</strong> gauge<br />

invariance. Noe<strong>the</strong>r’s <strong>the</strong>orem in fact postulates that for every differentiable symmetry <strong>the</strong>re<br />

must be a corresponding conservation law. In <strong>the</strong> present case it can be shown that gauge<br />

invariance directly implies <strong>the</strong> conservation <strong>of</strong> charged currents. Charge itself must <strong>the</strong>refore<br />

be conserved. It is hence clear that <strong>the</strong> importance <strong>of</strong> gauge invariance in particle physics<br />

cannot be underestimated[1, 2].<br />

The figure illustrated here deals only with <strong>the</strong> QED <strong>the</strong>ory, which is ma<strong>the</strong>matically a<br />

Unitary Group <strong>of</strong> order one (U(1) EM ). It <strong>the</strong>refore has only one generator <strong>and</strong> hence only<br />

one charge <strong>and</strong> one force carrying vector boson. Not at all dissimilar are <strong>the</strong> considerations<br />

for <strong>the</strong> weak <strong>and</strong> strong force which can be ma<strong>the</strong>matically represented by <strong>the</strong> non-abelian


1.1 The St<strong>and</strong>ard Model 4<br />

SU(2) <strong>and</strong> SU(3) groups. Given that SU(3) has 8 generators it also has <strong>the</strong> corresponding<br />

eight gluons <strong>and</strong> three conserved “colour charges“. Intuitively <strong>the</strong> same will apply for <strong>the</strong><br />

weak SU(2) <strong>the</strong>ory. The Electro-Weak description is however slightly more complex. In<br />

SU(2) L ×U(1) Y two charged bosonic particles derive from <strong>the</strong> original weak fields as naively<br />

expected <strong>and</strong> <strong>the</strong>refore are responsible for <strong>the</strong> W + <strong>and</strong> W − particles. Differently <strong>the</strong> neutral<br />

fields derive from <strong>the</strong> mixing <strong>of</strong> <strong>the</strong> U(1) Y field <strong>and</strong> <strong>the</strong> neutral SU(2) L field. The strength<br />

<strong>of</strong> <strong>the</strong> mixing is encoded in terms <strong>of</strong> <strong>the</strong> weak mixing angle (θ w )[3, 4, 5]. After <strong>the</strong> mixing <strong>the</strong><br />

<strong>the</strong>ory describes both <strong>the</strong> Electro-Magnetic field <strong>and</strong> <strong>the</strong> neutral weak field <strong>and</strong> is <strong>the</strong>refore<br />

responsible for <strong>the</strong> γ <strong>and</strong> Z 0 particles.<br />

In order to present a more detailed picture <strong>of</strong> Electro-Weak <strong>the</strong>ory <strong>the</strong> field must be<br />

broken into its right <strong>and</strong> left h<strong>and</strong>ed components.<br />

ψ R = 1 2 (1 + γ 5)ψ (1.8)<br />

ψ L = 1 2 (1 − γ 5)ψ (1.9)<br />

By construction <strong>the</strong> 1±γ 5 operators decompose <strong>the</strong> field into <strong>the</strong> right h<strong>and</strong>ed <strong>and</strong> left<br />

h<strong>and</strong>ed chiral states. In <strong>the</strong> approximation where <strong>the</strong> particle is massless this is always<br />

equivalent to <strong>the</strong> spin parallel <strong>and</strong> anti-parallel components with respect to <strong>the</strong> momentum<br />

<strong>of</strong> <strong>the</strong> particle. This assumption breaks down for a massive particle, as by definition a<br />

frame <strong>of</strong> reference exists where <strong>the</strong> particle appears to be travelling in <strong>the</strong> opposite direction<br />

to <strong>the</strong> one initially postulated. Chien-Shiung Wu et al. discovered that <strong>the</strong> weak force<br />

interacts only with <strong>the</strong> chiral left h<strong>and</strong>ed component <strong>of</strong> <strong>the</strong> SU(2) group[6]. In terms <strong>of</strong><br />

a ma<strong>the</strong>matical representation this can be interpreted as a weak isodoublet for chiral left<br />

h<strong>and</strong>ed fermions interacting with <strong>the</strong> weak force <strong>and</strong> an isosinglet for <strong>the</strong>ir right h<strong>and</strong>ed<br />

counterparts. A syn<strong>the</strong>tic summary <strong>of</strong> <strong>the</strong> present <strong>the</strong>ory describing <strong>the</strong> interaction <strong>of</strong> matter<br />

at <strong>the</strong> most fundamental level has been now described (table 1.1, fig. 1.1). All matter <strong>and</strong><br />

its interaction is however still represented by massless constituents, something that has been<br />

clearly disproved by numerous experiments <strong>and</strong> our everyday life.


1.1 The St<strong>and</strong>ard Model 5<br />

It is only with <strong>the</strong> introduction <strong>of</strong> <strong>the</strong> Higgs mechanism that <strong>the</strong> <strong>the</strong>oretical problem has<br />

been solved. The mechanism introduces four new scalar fields. Three are massless Goldsone<br />

bosons <strong>and</strong> are responsible via <strong>the</strong>ir interaction terms for <strong>the</strong> transverse polarization <strong>and</strong><br />

<strong>the</strong>refore <strong>the</strong> masses <strong>of</strong> <strong>the</strong> W <strong>and</strong> Z bosons. Hence <strong>the</strong> Higgs mechanism elegantly predicts<br />

<strong>the</strong> masses <strong>of</strong> <strong>the</strong> W <strong>and</strong> Z bosons from <strong>the</strong>ir observed couplings (or vice versa) <strong>and</strong> as a<br />

consequence explains <strong>the</strong> broken Electro-Weak symmetry. In fact it can be now proven that<br />

<strong>the</strong> masses <strong>of</strong> <strong>the</strong> Z <strong>and</strong> W bosons depend on <strong>the</strong> mixing angle <strong>of</strong> <strong>the</strong> electromagnetic <strong>and</strong><br />

<strong>the</strong> neutral weak field: cos(θ w )=M W /M Z . The remaining scalar field manifests itself as a<br />

massive particles. The interaction <strong>of</strong> <strong>the</strong> massless fermionic fields with this last scalar field,<br />

<strong>the</strong> Higgs field, gives all remaining constituents <strong>of</strong> matter <strong>the</strong>ir masses [7]. Although <strong>the</strong>re<br />

is some indirect evidence for <strong>the</strong> Higgs mechanism from <strong>the</strong> masses <strong>of</strong> <strong>the</strong> W <strong>and</strong> Z boson a<br />

conclusive experimental pro<strong>of</strong> in terms <strong>of</strong> <strong>the</strong> direct observation <strong>of</strong> <strong>the</strong> Higgs boson has not<br />

yet been achieved.<br />

St<strong>and</strong>ard Model Bosons<br />

Coupling Particle Charge (e) Spin <strong>Mass</strong> (Gev/c 2 )<br />

Z 0 0 1 91.2<br />

Weak W + +1 1 80.4<br />

W − −1 1 80.4<br />

EM γ 0 1 0<br />

Strong 8 Gluons 0 1 0<br />

H 0 0 0 > 114.4<br />

Table 1.1: Boson spectrum <strong>of</strong> <strong>the</strong> St<strong>and</strong>ard Model.<br />

What still needs to be described are <strong>the</strong> fundamental constituents that perform <strong>the</strong>se<br />

interactions. At present <strong>the</strong>re are no tested <strong>the</strong>oretical models that will predict <strong>the</strong> existence<br />

<strong>of</strong> matter constituents from axiomatic principles, all information derives from experimental<br />

evidence. It has been so found that matter can be divided in two separate groups: quarks <strong>and</strong><br />

leptons, both with spins <strong>of</strong> 1/2. The difference between <strong>the</strong> two is that quarks can interact<br />

also via <strong>the</strong> strong force while leptons cannot. Additionally <strong>the</strong> neutrinos, representing<br />

<strong>the</strong> members <strong>of</strong> <strong>the</strong> left-h<strong>and</strong>ed isodoublets interacting with <strong>the</strong> weak force have no charge<br />

<strong>and</strong> hence cannot interact with <strong>the</strong> photon. Three different generations <strong>of</strong> leptons have<br />

been found <strong>and</strong> all interaction observed conserve <strong>the</strong> generation number, with <strong>the</strong> exception


1.1 The St<strong>and</strong>ard Model 6<br />

being <strong>the</strong> oscillation <strong>of</strong> neutrinos, an area however not universally regarded as <strong>the</strong> St<strong>and</strong>ard<br />

Model <strong>and</strong> that will not be discussed in this <strong>the</strong>sis. Similarly also three generation <strong>of</strong> quarks<br />

have been found. Mixing between <strong>the</strong> quark generations is allowed only via a charged weak<br />

interaction. In such case <strong>the</strong> interaction with generation conserving particles is still <strong>the</strong> most<br />

likely process. The precise quantitative value <strong>of</strong> each mixing is derived experimentally <strong>and</strong><br />

is encoded in <strong>the</strong> Cabibbo-Kobayashi-Maskawa matrix[8].<br />

St<strong>and</strong>ard Model Fermions<br />

Group Particles Charge (e) Spin Interaction Weak eigenstates<br />

Leptons<br />

ν e ν µ ν τ 0 1/2 Weak L<br />

e µ τ −1 1/2 EM, Weak L, R<br />

<strong>Quark</strong>s<br />

d i s i b i −1/3 1/2 Strong, EM, Weak L, R<br />

u i c i t i +2/3 1/2 Strong, EM, Weak L, R<br />

Table 1.2: Fermion spectrum <strong>of</strong> <strong>the</strong> St<strong>and</strong>ard Model. The index, i, on each quark runs from<br />

0 to 2 <strong>and</strong> represents its colour.<br />

The aim <strong>of</strong> a syn<strong>the</strong>tic description <strong>of</strong> all matter <strong>and</strong> its interaction has been achieved by<br />

means <strong>of</strong> <strong>the</strong> above described framework (table 1.2, table 1.1, fig. 1.1). The existence <strong>of</strong> antimatter<br />

has however been <strong>the</strong>oretically predicted by Paul Dirac in <strong>the</strong> late nineteen-twenties<br />

<strong>and</strong> successfully discovered by Anderson shortly <strong>the</strong>reafter. In a first order approximation<br />

<strong>the</strong> laws <strong>of</strong> physics for matter <strong>and</strong> antimatter should be equivalent once also a parity conjugation<br />

has been performed. The Charge conjugation operator effectively inverts <strong>the</strong> charge<br />

<strong>of</strong> <strong>the</strong> particle while <strong>the</strong> parity operator inverts, one or more, spacial axes <strong>of</strong> <strong>the</strong> coordinate<br />

system. Hence <strong>the</strong> parity operator transforms a physical coordinate system into its mirror<br />

image; <strong>the</strong> SU(2) L group <strong>the</strong>refore transforms into SU(2) R . Clearly this raises <strong>the</strong> problem<br />

<strong>of</strong> over abundance <strong>of</strong> matter in <strong>the</strong> known universe, with respect to antimatter. A mechanism<br />

that violates <strong>the</strong> charge <strong>and</strong> parity symmetry must <strong>the</strong>refore exist. In fact such mechanism<br />

was experimentally discovered in 1964 by James Cronin, Val Fitch <strong>and</strong> collaborators in a experiment<br />

observing <strong>the</strong> decay <strong>of</strong> neutral kaons [9]. Since <strong>the</strong>n o<strong>the</strong>r CP-Violation effects have<br />

been observed in many different reactions <strong>and</strong> <strong>the</strong> mechanism has been successfully incorporated<br />

in <strong>the</strong> Cabibbo-Kobayashi-Maskawa matrix as a complex phase. However <strong>the</strong> rate <strong>of</strong><br />

asymmetry predicted by <strong>the</strong> SM is substantially too small to explain <strong>the</strong> present universe.


1.2 The <strong>Top</strong> <strong>Quark</strong> 7<br />

The over-abundace <strong>of</strong> matter in <strong>the</strong> universe is <strong>the</strong>refore still an unresolved question.<br />

Figure 1.1: Map <strong>of</strong> all interactions allowed by <strong>the</strong> St<strong>and</strong>ard Model<br />

1.2 The <strong>Top</strong> <strong>Quark</strong><br />

The top quark is substantially more massive than <strong>the</strong> rest <strong>of</strong> <strong>the</strong> observed quarks <strong>and</strong> leptons.<br />

It is <strong>the</strong>refore <strong>the</strong> logical starting point for any physical mechanism that describes <strong>the</strong> mass<br />

difference between all fermionic matter; a mechanism <strong>of</strong>ten referred to as Flavour Symmetry<br />

Breaking (FSB). If <strong>the</strong> St<strong>and</strong>ard Model postulates that mass is acquired via <strong>the</strong> Yukawa<br />

coupling [2] <strong>the</strong> reason behind <strong>the</strong> vast variety <strong>of</strong> such couplings from particle to particle<br />

<strong>and</strong> generation to generation is a completely speculative area <strong>of</strong> <strong>the</strong>oretical research. Many<br />

models exists <strong>and</strong> in most, if not all, <strong>the</strong> top quark plays a central role. As an added benefit<br />

many <strong>of</strong> <strong>the</strong> discussed <strong>the</strong>ories not only address Flavour Symmetry Breaking but also address<br />

<strong>the</strong> hierarchy between different forces.<br />

Chronologically one <strong>of</strong> <strong>the</strong> first families <strong>of</strong> such <strong>the</strong>ories is Technicolor. This postulates a<br />

new strong gauge interaction in <strong>the</strong> TeV scale region that effectively performs <strong>the</strong> symmetry<br />

breaking role <strong>of</strong> <strong>the</strong> now absent Higgs boson. From an experimental point <strong>of</strong> view <strong>the</strong> <strong>the</strong>ory<br />

manifest itself as resonances decaying, usually preferentially, into t <strong>and</strong> ¯t quarks. In <strong>the</strong> more<br />

specific <strong>Top</strong>color <strong>and</strong> <strong>Top</strong>color assisted Techincolor models <strong>the</strong> gauge boson is nothing else<br />

than a t¯t bound state or a ’top gluon’. These again decay almost exclusively into top <strong>and</strong><br />

anti-top quarks[10].


1.2 The <strong>Top</strong> <strong>Quark</strong> 8<br />

Theories based on extra dimensions are similar to Technicolour; while <strong>the</strong> origin <strong>of</strong> <strong>the</strong><br />

gauge boson is different, <strong>the</strong> observable phenomenology is exactly <strong>the</strong> same. The symmetry<br />

breaking mechanism is a boson that propagates only in <strong>the</strong> hypo<strong>the</strong>sized extra dimension<br />

<strong>and</strong> that couples to <strong>the</strong> fields on <strong>the</strong> St<strong>and</strong>ard Model brane. In most scenarios <strong>the</strong> gauge<br />

boson has preferential couplings to heavier generations <strong>and</strong> hence to <strong>the</strong> top quark[11].<br />

The top quark is <strong>of</strong> great experimental importance also for <strong>the</strong> <strong>the</strong>ory <strong>of</strong> Super-Symmetry<br />

(SUSY). This <strong>the</strong>ory predicts a new global space-time symmetry in which a bosonic superpartner<br />

is assigned to every fermion present in <strong>the</strong> St<strong>and</strong>ard Model <strong>and</strong> vice versa. Direct<br />

searches at LEP <strong>and</strong> at Tevatron have set lower limits on masses <strong>of</strong> various SUSY particles.<br />

All <strong>the</strong> limits are substantially above <strong>the</strong> mass <strong>of</strong> <strong>the</strong> bottom quark, but <strong>the</strong> possibility <strong>of</strong> a<br />

top quark decaying into a SUSY particle is not as <strong>of</strong> yet excluded <strong>and</strong> it is an active area <strong>of</strong><br />

research [11, 12].<br />

Figure 1.2: Indirect Electro-Weak constraints to <strong>the</strong> Higgs <strong>Mass</strong> derived from <strong>the</strong> mass<br />

measurements <strong>of</strong> <strong>the</strong> W ± boson <strong>and</strong> <strong>the</strong> top quark. The region below 114.4 GeV/c 2 is<br />

excluded from direct LEP searches.[13]<br />

Not to be forgotten is also <strong>the</strong> role <strong>of</strong> <strong>the</strong> top quark in constraining <strong>the</strong> searches for <strong>the</strong><br />

Higgs boson. Because <strong>of</strong> <strong>the</strong> large Yukawa coupling <strong>the</strong> mass <strong>of</strong> <strong>the</strong> top quark is one <strong>of</strong> <strong>the</strong><br />

main <strong>the</strong>oretical constraints on <strong>the</strong> allowed mass <strong>of</strong> <strong>the</strong> Higgs boson, <strong>the</strong> o<strong>the</strong>r constraint<br />

being <strong>the</strong> mass <strong>of</strong> <strong>the</strong> W. As it can be seen from fig. 1.2 <strong>the</strong>se constraints point to a SM


1.2 The <strong>Top</strong> <strong>Quark</strong> 9<br />

Higgs with mass <strong>of</strong> around or below 114 GeV or a light Higgs. A scenario clearly excluded<br />

by direct LEP searches [13]. Such constraints are <strong>the</strong>refore useful not only for restricting <strong>the</strong><br />

phase space <strong>of</strong> <strong>the</strong> searches but also for interpreting <strong>the</strong> experimental results <strong>and</strong> <strong>the</strong>refore<br />

discriminating between, for example, a SM <strong>and</strong> a SUSY Higgs. It is clear that if a Higgs<br />

like particle is found at 300 GeV <strong>of</strong> mass this is incompatible with a purely SM description<br />

<strong>and</strong> <strong>the</strong> presently measured top mass <strong>of</strong> 171.2 ± 2.1 GeV [14]. An explanation <strong>of</strong> this result<br />

would <strong>the</strong>refore need to come from a different <strong>the</strong>ory.<br />

The aforementioned <strong>the</strong>ories are not in any way <strong>the</strong> only ones that benefit from a detailed<br />

<strong>and</strong> precise study <strong>of</strong> <strong>the</strong> characteristics <strong>of</strong> <strong>the</strong> top quark. The list presented is not meant to<br />

be exhaustive but aims to give an idea <strong>of</strong> <strong>the</strong> central role that <strong>the</strong> top quark plays in <strong>the</strong><br />

St<strong>and</strong>ard Model <strong>and</strong> most Beyond <strong>the</strong> St<strong>and</strong>ard Model <strong>the</strong>ories. They must be <strong>the</strong>refore<br />

viewed as a motivation <strong>of</strong> <strong>the</strong> work presented in <strong>the</strong> following chapters.<br />

1.2.1 Effective Field Theories<br />

It is also important to mention <strong>the</strong> widespread use <strong>of</strong> effective field <strong>the</strong>ories that are generally<br />

independent <strong>of</strong> wider <strong>the</strong>oretical models. Such <strong>the</strong>ories look only at <strong>the</strong> detail <strong>of</strong> a particular<br />

coupling <strong>and</strong> infer all possible extensions to <strong>the</strong> Lagrangian given specific postulates. The<br />

obvious <strong>and</strong> most universally used postulate is <strong>the</strong> conservation <strong>of</strong> local gauge invariance.<br />

In <strong>the</strong> present <strong>the</strong>sis all effective field <strong>the</strong>ories will use this axiom as well as Charge-Parity<br />

conservation. Additionally <strong>the</strong> Lagrangian will be restricted to operators <strong>of</strong> dimension six<br />

or lower <strong>and</strong> all higher order terms will be omitted. Under this assumption one can write<br />

any interaction between any two fermions <strong>and</strong> a vector gauge boson as [15]:<br />

L V f i f j<br />

= ¯fj γ µ (A L P L + A R P R ) f i V µ<br />

+ ¯f j iσ µν q ν (B L P L + B R P R )f i V µ + H.O. (1.10)<br />

where V µ is <strong>the</strong> gauge field <strong>of</strong> <strong>the</strong> boson, q ν is <strong>the</strong> four momentum <strong>of</strong> <strong>the</strong> boson f <strong>and</strong> ¯f are<br />

<strong>the</strong> fermionic fields, A L,V <strong>and</strong> B L,V are <strong>the</strong> form factors, P L,V are <strong>the</strong> left <strong>and</strong> right h<strong>and</strong>ed


1.2 The <strong>Top</strong> <strong>Quark</strong> 10<br />

chiral projection operators, σ µν = i/2(γ µ γ ν − γ ν γ µ ) <strong>and</strong> H.O. are <strong>the</strong> higher order terms.<br />

Using this parametrization it is <strong>the</strong>n possible to derive <strong>the</strong> vertices that are relevant to this<br />

analysis. Firstly <strong>the</strong> decay vertex Wtb [15][16]:<br />

L W tb = − g √<br />

2¯bγ µ (A L P L + A R P R )tW − µ<br />

− g √ 2¯biσ µν q ν<br />

M W<br />

(B WL P L + B WR P R ) tW − µ (1.11)<br />

where <strong>the</strong> t <strong>and</strong> ¯b fermions replace <strong>the</strong> fermionic fields f <strong>of</strong> <strong>the</strong> previous equation <strong>and</strong> <strong>the</strong><br />

W µ boson replaces <strong>the</strong> gauge field V µ , M W is <strong>the</strong> mass <strong>of</strong> <strong>the</strong> gauge boson. According to <strong>the</strong><br />

Cabibbo mixing in <strong>the</strong> St<strong>and</strong>ard Model <strong>the</strong> top quark decays to a b quark via <strong>the</strong> emission<br />

<strong>of</strong> a W boson in 99.8% <strong>of</strong> <strong>the</strong> cases. For all practical purposes <strong>of</strong> this analysis <strong>the</strong> Wtb vertex<br />

will be considered as <strong>the</strong> only top quark decay mode. Experimentally <strong>the</strong> ratio <strong>of</strong> <strong>the</strong> top<br />

quark decay into <strong>the</strong> b quark divided by <strong>the</strong> top quark decays into any negatively charged<br />

quark has been measured to be 0.97 +0.09<br />

−0.08 [8]. A value that is consistent with <strong>the</strong> St<strong>and</strong>ard<br />

model prediction. This also predicts a value <strong>of</strong> 1 for A L <strong>and</strong> 0 for all <strong>the</strong> remaining form<br />

factors.<br />

Additionally also <strong>the</strong> production vertices γt¯t <strong>and</strong> Zt¯t will be used in <strong>the</strong> analysis[15]:<br />

L γ t¯t =<br />

L Z t¯t =<br />

−eQ t¯tγ µ tAµ − e¯t iσµν q ν<br />

(B γL P L + B γR P R )tA µ<br />

m t − g ¯tγ ( ) µ X L P L + X R P R − 2s 2<br />

2c<br />

wQ t tZµ<br />

w<br />

− g<br />

2c w<br />

¯t iσµν q ν<br />

m Z<br />

(B ZL P L + B ZR P R )tZ µ (1.12)<br />

where e is <strong>the</strong> electron charge, Q t is <strong>the</strong> electric charge <strong>of</strong> <strong>the</strong> top quark, A µ <strong>and</strong> Z µ are <strong>the</strong><br />

bosonic gauge fields, m t <strong>and</strong> m Z are <strong>the</strong> masses <strong>of</strong> <strong>the</strong> top quark <strong>and</strong> <strong>the</strong> Z boson, c w <strong>and</strong> s w<br />

are <strong>the</strong> cosine <strong>and</strong> sine <strong>of</strong> <strong>the</strong> Weinberg angle respectively. The slightly different format <strong>of</strong><br />

<strong>the</strong> form factors aims to highlight explicitly <strong>the</strong> Electro-Weak symmetry breaking <strong>and</strong> <strong>the</strong><br />

resulting Weinberg angle. Therefore in <strong>the</strong> SM X L = 2T 3 (t L ) = 1, where T 3 denotes <strong>the</strong>


1.2 The <strong>Top</strong> <strong>Quark</strong> 11<br />

third isospin component <strong>of</strong> SU(2) L . Similarly X R = 2T 3 (t R ) = 0, because <strong>the</strong> Weak force<br />

interacts only with <strong>the</strong> chiral left h<strong>and</strong>ed component. All <strong>the</strong> remaining, higher order form<br />

factors (B γL , B γR , B ZL , B ZR ), are also predicted to have a value <strong>of</strong> 0 in <strong>the</strong> St<strong>and</strong>ard Model.<br />

1.2.2 Observables<br />

Having presented <strong>the</strong> <strong>the</strong>ories that will be considered in this <strong>the</strong>sis it is now important to<br />

define a narrow set <strong>of</strong> observables that will be analysed. As <strong>the</strong> International Linear Collider<br />

(ILC) <strong>and</strong> <strong>the</strong> Silicon Detector (SiD) are in <strong>the</strong> design stages this choice has been driven<br />

not only by <strong>the</strong> underlying physics but also by <strong>the</strong> need for optimization <strong>of</strong> <strong>the</strong> proposed<br />

detector <strong>and</strong> <strong>the</strong> related algorithms.<br />

With such considerations in mind it has been chosen not to pursue any analysis where<br />

<strong>the</strong> top decays leptonically. It has also been chosen to analyse only <strong>the</strong> top pair production<br />

channel <strong>and</strong> disregard <strong>the</strong> single top channel. Therefore for <strong>the</strong> purposes <strong>of</strong> this analysis<br />

<strong>the</strong> process e + e − → t¯t → b¯bq¯qq¯q (representing ≈ 45% <strong>of</strong> all e + e − → t¯t) is considered as <strong>the</strong><br />

signal <strong>and</strong> all remaining St<strong>and</strong>ard Model processes are considered background; <strong>the</strong> signal<br />

process <strong>and</strong> <strong>the</strong> top mediated background processes can be seen in fig. 1.3. The reasoning<br />

behind this decision is two fold. The decision to pursue <strong>the</strong> pair production channel is<br />

based on considerations about <strong>the</strong> sample <strong>and</strong> background statistics. The pair production<br />

channel has a higher number <strong>of</strong> events produced than <strong>the</strong> single top case (at a centre-<strong>of</strong>-mass<br />

energy <strong>of</strong> 500 GeV). Additionally <strong>the</strong>re are significantly less background processes that have<br />

a topology similar to <strong>the</strong> top pair production as more particles are produced in this process.<br />

(a) (b) (c)<br />

Figure 1.3: Feynman diagrams <strong>of</strong> a) signal events, b) <strong>and</strong> c) top mediated background events.


1.2 The <strong>Top</strong> <strong>Quark</strong> 12<br />

The hadronic decay has instead been considered as more challenging for all <strong>the</strong> detector<br />

components; a quark undergoes hadronisation <strong>and</strong> fragmentation before reaching <strong>the</strong> detector<br />

in <strong>the</strong> form <strong>of</strong> a jet, a much more complex object than any lepton. The possible exception<br />

being <strong>the</strong> τ lepton, but a top decays via <strong>the</strong> tau channel in only ≈ 11% <strong>of</strong> <strong>the</strong> events. Because<br />

<strong>of</strong> jet multiplicity <strong>and</strong> <strong>the</strong> lack <strong>of</strong> missing energy it is believed that <strong>the</strong> hadronic process is<br />

<strong>the</strong> best benchmarking process for many detector components <strong>and</strong> reconstruction techniques<br />

described in <strong>the</strong> next chapters. It is also a test for any quark charge identification technique,<br />

which is not needed in any o<strong>the</strong>r <strong>of</strong> <strong>the</strong> top decay channels as it can be reconstructed by<br />

<strong>the</strong> charge <strong>of</strong> <strong>the</strong> detected lepton. These are in return very stringent benchmarks for <strong>the</strong><br />

tracking <strong>and</strong> vertexing as <strong>the</strong> loss <strong>of</strong> even <strong>the</strong> s<strong>of</strong>test track can result in a mis-reconstruction.<br />

The underlying idea behind <strong>the</strong> presented channel selection is <strong>the</strong>refore to test <strong>the</strong> detector<br />

<strong>and</strong> reconstruction as thoroughly as possible, but keep <strong>the</strong> analysis methods simple.<br />

The analysis also focuses on four specific observables: <strong>the</strong> cross section, <strong>the</strong> mass <strong>of</strong> <strong>the</strong><br />

top, <strong>the</strong> forward backward asymmetry <strong>of</strong> <strong>the</strong> top quark <strong>and</strong> <strong>the</strong> forward backward asymmetry<br />

<strong>of</strong> <strong>the</strong> b quark resulting from <strong>the</strong> top decay. In <strong>the</strong> precedent sections <strong>the</strong> importance <strong>of</strong> <strong>the</strong><br />

mass as a constraint to <strong>the</strong> SM Higgs mass has already been discussed. Also o<strong>the</strong>r models<br />

<strong>of</strong> Electro-Weak symmetry breaking have phenomenologies that are highly dependent on<br />

<strong>the</strong> mass <strong>of</strong> <strong>the</strong> top quark with <strong>the</strong> obvious example <strong>of</strong> a SUSY Higgs[12]. The paramount<br />

importance <strong>of</strong> this observable is <strong>the</strong>refore clear; hence <strong>the</strong> inclusion in this analysis. Similarly<br />

straightforward is <strong>the</strong> justification <strong>of</strong> <strong>the</strong> cross section observable. Any discrepancy between<br />

<strong>the</strong> <strong>the</strong>oretically predicted cross section <strong>and</strong> <strong>the</strong> measured one would indicate <strong>the</strong> presence<br />

<strong>of</strong> an unknown production mechanism.<br />

A bit more convoluted is <strong>the</strong> interpretation <strong>of</strong> <strong>the</strong> forward-backward asymmetry observables:<br />

A fb = σ(θ < 90◦ ) − σ(θ > 90 ◦ )<br />

σ(θ < 90 ◦ ) + σ(θ > 90 ◦ )<br />

(1.13)<br />

where σ(θ < 90 ◦ ) is <strong>the</strong> cross section <strong>of</strong> <strong>the</strong> events where <strong>the</strong> quark (t or b) has a polar<br />

angle <strong>of</strong> less than 90 ◦ in <strong>the</strong> centre <strong>of</strong> mass frame <strong>of</strong> reference. These two parameters can be<br />

interpreted as a test <strong>of</strong> <strong>the</strong> Electro-Weak <strong>the</strong>ory couplings. By considering equations 1.12 it


1.2 The <strong>Top</strong> <strong>Quark</strong> 13<br />

is possible to see how any deviation from <strong>the</strong> Electro-Weak predictions will alter <strong>the</strong> relation<br />

between <strong>the</strong> number <strong>of</strong> spin parallel <strong>and</strong> spin anti-parallel top quarks produced. This in<br />

turn will alter <strong>the</strong> stated forward backward asymmetry, which comes as a direct result <strong>of</strong><br />

<strong>the</strong> left h<strong>and</strong>edness <strong>of</strong> <strong>the</strong> SU(2) L <strong>and</strong> spin conservation. By measuring <strong>the</strong> asymmetry one<br />

<strong>the</strong>refore constrains <strong>the</strong> relationship between <strong>the</strong> form factors presented in <strong>the</strong>se equations.<br />

For <strong>the</strong> case <strong>of</strong> equation 1.11 it must be noted that <strong>the</strong> A fb for <strong>the</strong> bottom quark in <strong>the</strong><br />

centre <strong>of</strong> mass frame is really a convolution <strong>of</strong> <strong>the</strong> production t quark asymmetry <strong>and</strong> <strong>the</strong><br />

A fb <strong>of</strong> <strong>the</strong> b quark in <strong>the</strong> top quark rest frame. In order to deconvolute <strong>the</strong> two process<br />

<strong>and</strong> <strong>the</strong>refore be able to set <strong>the</strong> constraints on <strong>the</strong> W tb vertex one must ei<strong>the</strong>r measure <strong>and</strong><br />

<strong>the</strong>refore constrain <strong>the</strong> t quark asymmetry or perform <strong>the</strong> whole measurement in <strong>the</strong> top<br />

quark rest frame. This analysis follows <strong>the</strong> first method <strong>and</strong> <strong>the</strong>refore simply measures A fb<br />

<strong>of</strong> both quarks in <strong>the</strong> centre <strong>of</strong> mass frame.<br />

Finally a centre <strong>of</strong> mass energy has to be chosen. The choice for this analysis is to use<br />

<strong>the</strong> highest energy that will be available at <strong>the</strong> ILC as it is clear that this is where most<br />

<strong>of</strong> <strong>the</strong> runs will be performed. Although this is probably not <strong>the</strong> best method to measure<br />

<strong>the</strong> top mass (a threshold scan <strong>of</strong> <strong>the</strong> limit where top pair production becomes accessible<br />

probably is [17]), it is a belief <strong>of</strong> <strong>the</strong> writer that for <strong>the</strong> more wide <strong>and</strong> generic physics <strong>and</strong><br />

benchmarking purposes <strong>of</strong> this analysis <strong>the</strong> 500 GeV centre-<strong>of</strong>-mass energy is optimal. Not<br />

only this is <strong>the</strong> energy where <strong>the</strong> highest top quark statistics will be available; it is also <strong>the</strong><br />

energy where new particles <strong>and</strong> phenomena are most likely to be observed.


Chapter 2<br />

The ILC <strong>and</strong> <strong>the</strong> SiD Detector<br />

Concept<br />

In order to probe deeper <strong>and</strong> deeper into <strong>the</strong> matter surrounding us higher <strong>and</strong> higher<br />

energies need to be reached. The scales <strong>of</strong> <strong>the</strong> constituents <strong>of</strong> nature that we are probing are<br />

in fact, to a first order approximation, inversely proportional to <strong>the</strong> energies at which <strong>the</strong>y<br />

collide. Bigger energies imply, assuming no paradigm shift in <strong>the</strong> technology, ever larger sizes<br />

for accelerators <strong>and</strong> detectors. Particle physics has <strong>the</strong>refore moved from using cyclotrons<br />

that fit into a small room to modern synchrotrons with circumferences comparable to those<br />

<strong>of</strong> medium sized cities. Historically two main types <strong>of</strong> colliders exist: hadron colliders<br />

(proton-proton <strong>and</strong> proton-antiproton) <strong>and</strong> lepton colliders (electron-positron). The first<br />

ones generally reach higher energies, but suffer from a less clean environment. The proton<br />

is in fact composed <strong>of</strong> quarks <strong>and</strong> <strong>the</strong> sought for interactions occur only between <strong>the</strong>se<br />

constituents, resulting in a large amount <strong>of</strong> proton remnant <strong>and</strong> a large spread <strong>of</strong> possible<br />

interaction energies. Differently, lepton colliders produce a much cleaner environment <strong>and</strong><br />

a well defined interaction. However, as <strong>the</strong> mass <strong>of</strong> electrons is quite small when compared<br />

to <strong>the</strong> proton mass, circular e + e − colliders suffer extensively from synchrotron radiation <strong>and</strong><br />

reaching higher energy in a circular accelerator becomes more <strong>and</strong> more expensive. Above<br />

certain energies a linear collider becomes <strong>the</strong> economically most viable options. It is because<br />

14


2.1 The ILC 15<br />

<strong>of</strong> such reasoning that <strong>the</strong> next high energy collider is expected to be a linear e + e − collider.<br />

This will complement <strong>the</strong> results from <strong>the</strong> current Large Hadron Collider with precision<br />

measurements.<br />

2.1 The ILC<br />

The International Linear Collider (ILC) is a proposed electron-positron collider operating<br />

in <strong>the</strong> centre <strong>of</strong> mass energy range √ s = 200 GeV - 500 GeV, with possible calibration<br />

runs at <strong>the</strong> Z mass energy. An upgrade to <strong>the</strong> centre <strong>of</strong> mass energy <strong>of</strong> 1 TeV is also<br />

envisaged. The basic layout <strong>of</strong> <strong>the</strong> ILC, which can be seen in Fig. 2.1, consists <strong>of</strong> two<br />

almost identical arms; one being <strong>the</strong> electron accelerator, <strong>the</strong> o<strong>the</strong>r <strong>the</strong> positron one. The<br />

electron source, a laser illuminating a photocatode in a DC gun, is located in <strong>the</strong> positron<br />

arm <strong>and</strong> generates 76 MeV electrons. These are afterwards accelerated to 5 GeV as <strong>the</strong>y<br />

are transported into <strong>the</strong> damping ring. The role <strong>of</strong> <strong>the</strong> damping ring is to accumulate <strong>the</strong><br />

beam <strong>and</strong> squeeze <strong>the</strong> electrons into <strong>the</strong> required bunch structure. As <strong>the</strong> bunches circulate<br />

in <strong>the</strong> damping rings <strong>the</strong>y loose energy via synchrotron radiation <strong>and</strong> are re-accelerated by<br />

an RF cavity. The synchrotron radiation decreases <strong>the</strong> motion in any direction, while <strong>the</strong><br />

cavity acceleration is directional; as a result <strong>the</strong> beam emittance is significantly reduced.<br />

After this has been performed <strong>the</strong> electrons are ready to be injected into <strong>the</strong> main linac for<br />

acceleration, which is performed from ≈ 280 RF cavities having an accelerating gradient <strong>of</strong><br />

31.5 MV/m. Some bunches continue straight through to <strong>the</strong> interaction point, o<strong>the</strong>r instead<br />

reach <strong>the</strong> energy <strong>of</strong> 150 GeV <strong>and</strong> are <strong>the</strong>n diverted into <strong>the</strong> undulator. This acts as a source<br />

<strong>of</strong> high energy photons that are later converted into electron positron pairs. The positrons<br />

that are separated from <strong>the</strong> rest <strong>of</strong> <strong>the</strong> beam (photons <strong>and</strong> electrons) are accelerated to 5<br />

GeV <strong>and</strong> transported into <strong>the</strong> damping ring. From here on <strong>the</strong>y follow a path through <strong>the</strong><br />

linac which is identical to <strong>the</strong> one previously described for <strong>the</strong> electrons; until <strong>the</strong> interaction<br />

point (IP) [18]. Here <strong>the</strong> electrons <strong>and</strong> positrons collide with a crossinf angle <strong>of</strong> 14mrad.<br />

Because <strong>of</strong> its size <strong>and</strong> complexity <strong>the</strong> design <strong>of</strong> <strong>the</strong> ILC pushes <strong>the</strong> limits <strong>of</strong> many


2.1 The ILC 16<br />

Figure 2.1: The Proposed International Linear Collider<br />

accelerator technologies. One <strong>of</strong> <strong>the</strong> most challenging tasks is reaching <strong>and</strong> maintaining<br />

<strong>the</strong> required beam emittance. This poses constraints on <strong>the</strong> size <strong>and</strong> <strong>the</strong> technology <strong>of</strong><br />

<strong>the</strong> damping rings. In particular it is difficult to control <strong>the</strong> electron cloud effect that is<br />

generated by squeezing <strong>the</strong> positrons toge<strong>the</strong>r <strong>and</strong> <strong>the</strong> fast ion instabilities that are intrinsic<br />

in <strong>the</strong> electron beam [18]. Even if <strong>the</strong>se sources <strong>of</strong> emittance are kept under control <strong>and</strong><br />

<strong>the</strong> beam reaches <strong>the</strong> desired parameters when exiting <strong>the</strong> damping rings, one must also<br />

make sure to maintain <strong>the</strong> emittance as low as possible while transporting <strong>the</strong> beam to <strong>the</strong><br />

interaction point.<br />

It is <strong>the</strong> emittance requirements at <strong>the</strong> IP, combined with <strong>the</strong> small beam size <strong>of</strong> 640×<br />

5.7 nm, that allows <strong>the</strong> ILC to reach <strong>the</strong> high design luminosity <strong>of</strong> 2 × 10 34 cm −2 s −1 ; but<br />

to achieve <strong>the</strong>se goals <strong>the</strong> alignment <strong>of</strong> <strong>the</strong> machine must be extremely precise. The required<br />

precision is <strong>of</strong> 200 µm over a distance <strong>of</strong> 200m for <strong>the</strong> main accelerator components<br />

<strong>and</strong> slightly lower for <strong>the</strong> damping rings (150 µm over 100 m) [19]. Additionally <strong>the</strong> machine<br />

components have a rotational tolerance <strong>of</strong> only 20 µrad. At <strong>the</strong> present moment <strong>the</strong><br />

technological feasibility <strong>of</strong> <strong>the</strong>se targets is in <strong>the</strong> process <strong>of</strong> being proven.<br />

To deliver any significant luminosity, one must make sure that <strong>the</strong> beams hit precisely<br />

head on. For this purpose an on line beam alignment system will have to be developed. It<br />

will consist <strong>of</strong> a electronic feedback system that will use <strong>the</strong> first few bunches <strong>of</strong> <strong>the</strong> train<br />

to adjust <strong>the</strong> alignment <strong>of</strong> <strong>the</strong> remaining bunches. This can be achieved with <strong>the</strong> usage <strong>of</strong><br />

a a stripline kicker [20]. The problem is complicated by <strong>the</strong> structure <strong>of</strong> <strong>the</strong> ILC beam,<br />

composed from a series <strong>of</strong> trains with very tightly squeezed bunches. In <strong>the</strong> nominal beam<br />

configuration <strong>the</strong>re are 2625 bunches separated by only 369 ns giving a total active train


2.1 The ILC 17<br />

time <strong>of</strong> 0.97 ms followed by a dead time <strong>of</strong> approximately 199 ms [19]. A description <strong>of</strong><br />

<strong>the</strong> main beam parameters <strong>of</strong> <strong>the</strong> ILC can be seen in table 2.1. If <strong>the</strong> aim is to attempt a<br />

realignment in between each bunch crossing, <strong>the</strong>n, by using <strong>the</strong> fastest <strong>of</strong> modern processors,<br />

only a few ma<strong>the</strong>matical operations can be performed in <strong>the</strong> available time. The transport<br />

<strong>of</strong> information also introduces a delay that needs to be taken into account. Ideally <strong>the</strong>refore<br />

a process that will require very few calculations <strong>and</strong> allow for a hardware system that can<br />

be located close to <strong>the</strong> interaction point needs to be developed. The same beam structure,<br />

imposed by <strong>the</strong> choice <strong>of</strong> a cold superconducting technology for <strong>the</strong> RF cavities, is clearly<br />

challenging also for <strong>the</strong> vertex <strong>and</strong> tracking detectors.<br />

ILC Beam Parameters<br />

Parameter<br />

Unit<br />

Bunch Population 2 × 10 10<br />

Number <strong>of</strong> Bunches 2625<br />

Linac Bunch interval ns 369<br />

Horizontal Beam size at IP nm 640<br />

Vertical Beam size at IP nm 5.7<br />

Normalized horizontal emittance at IP mm x mrad 10<br />

Normalized vertical emittance at IP mm x mrad 0.04<br />

Table 2.1: Main beam parameters for <strong>the</strong> ILC.<br />

Technological challenges exist in o<strong>the</strong>r areas <strong>of</strong> <strong>the</strong> ILC project as well. In order to<br />

contain <strong>the</strong> main accelerator in a reasonable length <strong>of</strong> 31 km <strong>the</strong> RF cavities must have<br />

an average acceleration gradient <strong>of</strong> at least 31.5 MV/m. Presently tested RF cavities have<br />

achieved gradients <strong>of</strong> up to 50 MV/m. While this is routinely achieved with only a single<br />

accelerating cell <strong>the</strong> result is not easily scalable. Only recently a first succesful attempt <strong>of</strong><br />

adjacent cavities reaching <strong>the</strong> required acceleration has been performed [21]. Additionally<br />

<strong>the</strong> production yield <strong>of</strong> RF cavities with gradients that meet or exceed <strong>the</strong>se targets are<br />

very low <strong>and</strong> will have to rise substantially in order to make <strong>the</strong> ILC a viable project. The<br />

currently achieved production yield is in <strong>the</strong> region <strong>of</strong> ≈ 50% [22], while <strong>the</strong> aim is to increase<br />

it to 80% [19].<br />

As a broad generalization regarding <strong>the</strong> whole ILC R&D program it is safe to say that,<br />

although not all <strong>the</strong> ILC parameters can be met at <strong>the</strong> present time, all <strong>the</strong> technologies


2.2 The Silicon Detector Concept 18<br />

have been proven <strong>and</strong> all <strong>the</strong> R&D targets appear to be within sight. It is <strong>the</strong>refore expected<br />

that <strong>the</strong> goals will be met in <strong>the</strong> coming years <strong>and</strong> given <strong>the</strong> right scientific <strong>and</strong> political will<br />

<strong>the</strong> building <strong>of</strong> <strong>the</strong> ILC could <strong>the</strong>n start.<br />

2.2 The Silicon Detector Concept<br />

The Silicon Detector concept (SiD) is a general purpose linear detector concept designed<br />

to perform precision measurements <strong>and</strong> at <strong>the</strong> same time be sensitive to a wide range <strong>of</strong><br />

possible new phenomena. The distinguishing features, with respect to <strong>the</strong> o<strong>the</strong>r proposed<br />

detectors, are <strong>the</strong> combination <strong>of</strong> silicon-only tracking <strong>and</strong> highly granular calorimetry optimized<br />

for particle flow algorithms. SiD is composed <strong>of</strong> many separate subdetectors; in<br />

order <strong>of</strong> <strong>the</strong> radial distance from <strong>the</strong> interaction point <strong>the</strong>se are <strong>the</strong> pixel vertex detector,<br />

<strong>the</strong> silicon strip tracker, <strong>the</strong> electromagnetic calorimeter, <strong>the</strong> hadronic calorimeter <strong>and</strong> <strong>the</strong><br />

muon identification system. With <strong>the</strong> exclusion <strong>of</strong> <strong>the</strong> muon system all <strong>the</strong> barrel region<br />

subdetectors are encompassed by a 5 Tesla solenoid. An iron flux return after <strong>the</strong> solenoid<br />

acts as <strong>the</strong> absorbing material for <strong>the</strong> muon identification system <strong>and</strong> also helps with <strong>the</strong><br />

radiation shielding. The picture is completed with <strong>the</strong> presence at very small polar angles<br />

<strong>of</strong> a Luminosity Calorimeter (LumiCal) <strong>and</strong> a Beam Calorimeter (BeamCal) which is actually<br />

mounted inside <strong>the</strong> accelerator’s final focusing magnetic system QD0[23]. An artistic<br />

drawing <strong>and</strong> schematic model <strong>of</strong> <strong>the</strong> SiD detector can be seen in fig. 2.2 <strong>and</strong> 2.2<br />

Before continuing with <strong>the</strong> detailed description <strong>of</strong> <strong>the</strong> subdetectors present in <strong>the</strong> SiD<br />

design it is important to define <strong>the</strong> coordinate scheme that will be used. Given <strong>the</strong> physical<br />

shape <strong>of</strong> <strong>the</strong> detector it is natural to use a set <strong>of</strong> cylindrical coordinates. As it is common in<br />

e + e − colliders <strong>the</strong> z direction is defined as p⃗<br />

e − −p⃗<br />

e +, which given <strong>the</strong> very small beam crossing<br />

angle <strong>of</strong> 14 mrad envisaged at <strong>the</strong> ILC can be approximated to <strong>the</strong> direction <strong>of</strong> <strong>the</strong> electron<br />

beam. The origin is <strong>the</strong>n defined at <strong>the</strong> interaction point. The radius <strong>of</strong> <strong>the</strong> cylinder r is<br />

defined as <strong>the</strong> distance to <strong>the</strong> z-axis. Finally <strong>the</strong> angle φ is defined as <strong>the</strong> angle <strong>of</strong> rotation<br />

around <strong>the</strong> z-axis, with <strong>the</strong> 0 being defined as parallel to <strong>the</strong> ground. Additionally <strong>the</strong> angle


2.2 The Silicon Detector Concept 19<br />

Figure 2.2: An artist’s impression <strong>of</strong> <strong>the</strong> SiD detector<br />

θ is defined from <strong>the</strong> relationship: θ = arctan(r/z).<br />

2.2.1 Vertex Detector<br />

The vertex detector is <strong>the</strong> innermost subdetector <strong>and</strong> its first layer is only a few millimetres<br />

away from <strong>the</strong> beampipe. As <strong>the</strong> name itself suggests <strong>the</strong> primary aim <strong>of</strong> <strong>the</strong> detector is<br />

to enable <strong>the</strong> precise reconstruction <strong>of</strong> vertices. As such it must be able to pinpoint <strong>the</strong><br />

coordinates <strong>of</strong> <strong>the</strong> tracked particle as precisely as possible. SiD proposes a silicon based<br />

pixel detector composed <strong>of</strong> five cylindric silicon layers in <strong>the</strong> barrel region <strong>and</strong> four disks in<br />

each forward region. An additional three outer pixel disks at larger |z| are envisaged in each<br />

forward region to provide a uniform coverage in <strong>the</strong> transition between <strong>the</strong> vertex <strong>and</strong> <strong>the</strong><br />

tracking detector. The barrel layers are 125 mm long <strong>and</strong> are distributed at radii between<br />

14 mm <strong>and</strong> 60 mm. The inner endcaps are located at distances between 76 mm <strong>and</strong> 180<br />

mm from <strong>the</strong> interaction point <strong>and</strong> have an outer radius <strong>of</strong> 75 mm. The outer radius <strong>of</strong><br />

all <strong>the</strong> outer disks is 166 mm <strong>and</strong> <strong>the</strong>ir distances from <strong>the</strong> IP vary between 211 mm <strong>and</strong>


2.2 The Silicon Detector Concept 20<br />

Figure 2.3: Subdetectors’ disposition in a quadrant <strong>of</strong> <strong>the</strong> SiD detector.<br />

834 mm. The inner radius <strong>of</strong> all <strong>the</strong> endcap disks is determined from <strong>the</strong> conical shape <strong>of</strong><br />

<strong>the</strong> beampipe at <strong>the</strong> interaction region <strong>and</strong> spans between 15 mm for <strong>the</strong> closest disk up to<br />

118 mm for <strong>the</strong> fur<strong>the</strong>st one (fig. 2.4). The ultimate aim <strong>of</strong> <strong>the</strong> disposition is to achieve a<br />

hermeticity in <strong>the</strong> polar region <strong>of</strong> | cos θ| ≤ 0.984 [23].<br />

Figure 2.4: R-z view <strong>of</strong> <strong>the</strong> vertex detector. The right h<strong>and</strong> side has been drawn without<br />

<strong>the</strong> support structures.<br />

At present a clear technology for <strong>the</strong> vertex detector has not been selected; although <strong>the</strong><br />

need for a pixel detector has been established. Most <strong>of</strong> <strong>the</strong> present designs base <strong>the</strong>mselves<br />

on CCD, CMOS or DEPFET technologies [23, 24]. A lot <strong>of</strong> research is also being performed<br />

on <strong>the</strong> possibility <strong>of</strong> using vertically integrated “3D” technologies [24]. A more detailed


2.2 The Silicon Detector Concept 21<br />

description <strong>of</strong> one <strong>of</strong> <strong>the</strong> proposed CCD technologies <strong>the</strong> Column Parallel CCD will be<br />

presented in <strong>the</strong> fourth chapter. For <strong>the</strong> purposes <strong>of</strong> <strong>the</strong> simulation a generic 20 × 20 µm 2<br />

pixel is assumed, with a 3.5 µm expected point resolution <strong>of</strong> each layer for <strong>the</strong> barrel <strong>and</strong><br />

<strong>the</strong> inner endcap disks. The resolution <strong>of</strong> <strong>the</strong> outer disks is instead modelled as 7 µm. It is<br />

expected that such a detector, when combined with <strong>the</strong> tracking detector, will allow for an<br />

impact parameter resolution <strong>of</strong> better than σ rφ = σ rz = 5 ⊕ 10/(p × sin 2/3 (θ))[µm] for high<br />

momentum tracks. The final aim <strong>of</strong> having such a small impact parameter resolution is to<br />

permit a very precise determination <strong>of</strong> <strong>the</strong> IP vertex. The impact parameter <strong>of</strong> a track is<br />

defined as <strong>the</strong> distance between <strong>the</strong> track’s point <strong>of</strong> closest approach to <strong>the</strong> IP <strong>and</strong> <strong>the</strong> IP<br />

in <strong>the</strong> selected projection.<br />

Lastly a word must be spent on <strong>the</strong> material budget <strong>and</strong> <strong>the</strong>refore also <strong>the</strong> mechanical<br />

support <strong>of</strong> <strong>the</strong> detector. In <strong>the</strong> detector model used in this <strong>the</strong>sis <strong>the</strong> silicon sensors are<br />

113 µm thick <strong>of</strong> which only <strong>the</strong> outer 20 µm, <strong>the</strong> epitaxial layer, are sensitive. Each barrel<br />

<strong>and</strong> endcap layer is <strong>the</strong>n supported by a 260 µm thick carbon fibre with a 25% coverage.<br />

The combined thickness <strong>of</strong> <strong>the</strong> ladder is <strong>the</strong>refore approximately 0.15% <strong>of</strong> a radiation length<br />

(X 0 ). Different designs also exist with an all silicon support structure, <strong>the</strong>se allow X 0 0.1%<br />

<strong>and</strong> <strong>the</strong>refore an even fur<strong>the</strong>r reduced effect from multiple scattering [25].<br />

2.2.2 Silicon Tracker<br />

The aim <strong>of</strong> <strong>the</strong> silicon tracker is to provide precision track measurements in <strong>the</strong> intermediate<br />

radial region <strong>and</strong> hence contribute to <strong>the</strong> measurement <strong>of</strong> <strong>the</strong> momentum <strong>and</strong> <strong>the</strong> vertex<br />

reconstruction. The proposed design achieves <strong>the</strong> goal by using 300 µm thick strip sensors.<br />

Each strip however gives only a one dimensional measurement in <strong>the</strong> sensor plane. Hence,<br />

in order to pinpoint a precise 3D location, two layers <strong>of</strong> strips tilted at an angle with respect<br />

to each o<strong>the</strong>r are needed for each module <strong>of</strong> <strong>the</strong> detector. The third dimension for <strong>the</strong><br />

barrel region, as in <strong>the</strong> case <strong>of</strong> <strong>the</strong> vertex detector, is given from <strong>the</strong> radial location <strong>of</strong> <strong>the</strong><br />

layer itself. Similarly for <strong>the</strong> endcap region <strong>the</strong> third dimension is determined by <strong>the</strong> layer’s<br />

displacement on <strong>the</strong> z axis.


2.2 The Silicon Detector Concept 22<br />

In <strong>the</strong> currently proposed SiD configuration <strong>the</strong> strips are 10 cm long <strong>and</strong> have a strip<br />

pitch <strong>of</strong> 25 µm; however only every second strip is read out <strong>and</strong> hence <strong>the</strong> readout pitch is<br />

50 µm. The resulting modelled resolution is <strong>of</strong> 7µm in <strong>the</strong> direction <strong>of</strong> <strong>the</strong> strip axis <strong>of</strong> each<br />

layer. The stereo angle between <strong>the</strong> layers is 12 ◦ , a value chosen in order to maximize <strong>the</strong><br />

precision <strong>of</strong> <strong>the</strong> resolution in both angular directions while minimizing <strong>the</strong> reconstruction<br />

confusion in <strong>the</strong> matching <strong>of</strong> <strong>the</strong> hits in <strong>the</strong> two layers. The resulting resolution in <strong>the</strong> z<br />

direction is 7µm/ sin(12 ◦ ) or ≈ 34 µm.<br />

In terms <strong>of</strong> geometry <strong>the</strong> detector will consist <strong>of</strong> 5 barrel modules <strong>and</strong> 4 endcap modules.<br />

Each module consists <strong>of</strong> two layers <strong>of</strong> strip sensors attached back to back in order to allow<br />

for a 3 dimensional measurement. The barrel modules located at radii between 22 cm <strong>and</strong><br />

122 cm. The length <strong>of</strong> <strong>the</strong> modules is between 111.6 cm <strong>and</strong> 304.5 cm. A schematic view <strong>of</strong><br />

<strong>the</strong> tracker can be seen in fig. 2.5.<br />

Figure 2.5: R-z view <strong>of</strong> <strong>the</strong> whole tracking system.<br />

The total material budget <strong>of</strong> <strong>the</strong> combined tracking <strong>and</strong> vertexing detectors is 8% X 0 in<br />

<strong>the</strong> transverse region, rising to approximately 20% in <strong>the</strong> transition between <strong>the</strong> barell <strong>and</strong><br />

endcaps at θ = 40 ◦ <strong>and</strong> <strong>the</strong>n diminishing again to 12% in <strong>the</strong> central endcap region [23, 25].<br />

Such construction should allow for a distance <strong>of</strong> closest approach (DCA) resolution <strong>of</strong> just<br />

below 5 µm for tracks with momentum higher than 10 GeV located in <strong>the</strong> transverse region;


2.2 The Silicon Detector Concept 23<br />

clearly in agreement with <strong>the</strong> stated design aims. The results deteriorates significantly for<br />

angles lower than 20 ◦ <strong>and</strong> momenta lower than a few GeV. The performance <strong>of</strong> <strong>the</strong> tracker<br />

can be seen in fig. 2.6(a) <strong>and</strong> fig. 2.6(b).<br />

(a)<br />

(b)<br />

Figure 2.6: Tracking Resolution: a) momentum resolution with respect to <strong>the</strong> energy <strong>of</strong> <strong>the</strong><br />

track plotted for different detector regions, b) Equivalent plot for <strong>the</strong> impact parameter,<br />

Distance <strong>of</strong> Closest Approach (DCA).<br />

2.2.3 Calorimetry<br />

The SiD design uses a Particle Flow Algorithm (PFA) based approach to Calorimetry. This<br />

algorithm takes advantage <strong>of</strong> precise momentum measurements <strong>of</strong> charged particles in <strong>the</strong><br />

tracker. In practical terms this implies <strong>the</strong> possibility <strong>of</strong>: associating <strong>the</strong> energy deposited<br />

in <strong>the</strong> calorimeter to a specific neutral or charged particle; associating <strong>the</strong> energy deposited<br />

in <strong>the</strong> electromagnetic calorimeter to a specific track <strong>and</strong> separating electrons from charged<br />

hadron tracks. In order to perform <strong>the</strong>se tasks efficiently <strong>the</strong> calorimeter is required to have<br />

excellent segmentation [23, 26].<br />

The resulting PFA reconstructed particles will <strong>the</strong>refore be represented, when <strong>the</strong> particles<br />

are charged, by a track <strong>and</strong> a calorimeter cluster; if <strong>the</strong> particles are not charged by <strong>the</strong><br />

calorimeter cluster only. A visual representation <strong>of</strong> <strong>the</strong> calorimeter <strong>and</strong> <strong>the</strong> particle flow algorithm<br />

can be seen in 2.2.3. Algorithms attempting to perform such functions have already<br />

been applied in existing detectors: ALEPH <strong>and</strong> ZEUS are just two examples <strong>and</strong> have re-


2.2 The Silicon Detector Concept 24<br />

sulted in a significant improvement in <strong>the</strong> energy resolution. However, unlike <strong>the</strong>se detectors,<br />

<strong>the</strong> SiD is being designed in order to make full usage <strong>of</strong> <strong>the</strong> PFA capabilities <strong>and</strong> <strong>the</strong>refore<br />

<strong>the</strong> improvement in <strong>the</strong> performance is expected to be even greater[23]. More specifically <strong>the</strong><br />

SiD detector calorimeters should have imaging capabilities <strong>and</strong> hence be highly segmented in<br />

<strong>the</strong> longitudinal <strong>and</strong> transverse directions. The calorimeters should also be placed inside <strong>the</strong><br />

solenoid in order to minimize <strong>the</strong> amount <strong>of</strong> energy that is deposited in <strong>the</strong> passive material<br />

<strong>and</strong> maximize <strong>the</strong> successful cluster to particle assignment.<br />

Figure 2.7: Simulated ρ → π + π 0 decay in <strong>the</strong> SiD Detector, demonstrating <strong>the</strong> concept <strong>of</strong><br />

Particle Flow<br />

Electromagnetic Calorimeter<br />

In order to be able to confine <strong>the</strong> jet clusters in <strong>the</strong> calorimeter one not only needs to have a<br />

very finely segmented calorimeter, but also needs to prevent <strong>the</strong> overlap <strong>of</strong> <strong>the</strong> electromagnetic<br />

showers. The showers should <strong>the</strong>refore be confined to <strong>the</strong> smallest volume possible <strong>and</strong><br />

<strong>the</strong> lateral spread should be minimized. It is <strong>the</strong>refore essential to choose a design <strong>and</strong> a<br />

material that has a small Moliere radius.<br />

The SiD Electromagnetic Calorimeter (ECAL) has alternating layers <strong>of</strong> tungsten <strong>and</strong><br />

silicon pixel detectors. In both <strong>the</strong> barrel <strong>and</strong> <strong>the</strong> endcap regions <strong>the</strong> inner 20 layers <strong>of</strong><br />

tungsten are 2.5 mm thick, while <strong>the</strong> next 10 layers are 5 mm thick; <strong>the</strong> silicon layers are


2.2 The Silicon Detector Concept 25<br />

always 1.25 mm thick. The resulting detector has a depth <strong>of</strong> 26 radiation lengths (X 0 ) <strong>and</strong><br />

a Moliere radius <strong>of</strong> 13.5 mm. Importantly <strong>the</strong> configuration allows for a track resolution <strong>of</strong><br />

∼ 1mm from <strong>the</strong> calorimeter <strong>and</strong> it can <strong>the</strong>refore be used as a final point in any track reconstruction<br />

algorithm. Apart from <strong>the</strong> obvious advantage in <strong>the</strong> performance <strong>of</strong> <strong>the</strong> tracking<br />

this also helps immensely with <strong>the</strong> association <strong>of</strong> <strong>the</strong> track to a specific calorimeter cluster.<br />

This method <strong>of</strong> calorimetry assisted tracking is particularly well suited for reconstructing<br />

tracks that have originated outside <strong>the</strong> vertex detector as sometimes occurs with Λ <strong>and</strong> K S<br />

decays [23].<br />

The specific technology <strong>of</strong> <strong>the</strong> silicon layers has not yet been chosen <strong>and</strong> two main options<br />

are under consideration. The baseline option is to use commonly available <strong>of</strong>f <strong>the</strong> shelf large<br />

pixel sensors. Each <strong>of</strong> <strong>the</strong>se would be segmented in 1024 13 mm 2 pixels. All readout would<br />

be concentrated in a single chip (KPiX) bump bonded in <strong>the</strong> centre <strong>of</strong> <strong>the</strong> wafer (fig. 2.8(a)).<br />

An alternative option is to use CMOS Monolithic Active Pixel Sensors with pixel sizes <strong>of</strong><br />

50 µm × 50 µm [23]. In this case digital electromagnetic calorimetry would be performed<br />

<strong>and</strong> <strong>the</strong> ECAL would effectively serve as a particle counter. The advantages <strong>of</strong> <strong>the</strong> design<br />

are an excellent imaging resolution <strong>of</strong> <strong>the</strong> calorimeter <strong>and</strong> a simple design <strong>of</strong> <strong>the</strong> pixels; <strong>the</strong>y<br />

effectively act only as counters <strong>of</strong> minimum ionizing particles (MIPs). The problem however<br />

lies in <strong>the</strong> enormous number <strong>of</strong> channels: 1,000,000 for each 5 cm × 5 cm sensor. The stated<br />

aims for both technologies are an efficient (95%) reconstruction <strong>of</strong> photons in jets <strong>and</strong> an<br />

EM energy resolution <strong>of</strong> 18%/ √ E [23]. The possibility <strong>of</strong> obtaining such results has already<br />

been proven with <strong>the</strong> use <strong>of</strong> full simulation using <strong>the</strong> GEANT4 s<strong>of</strong>tware[27].<br />

(a)<br />

(b)<br />

Figure 2.8: The active layer technologies used as a baseline design in <strong>the</strong> a) ECAL: <strong>the</strong> pixel<br />

silicon sensor using <strong>the</strong> KPiX readout b) HCAL: an RPC cell


2.2 The Silicon Detector Concept 26<br />

Hadronic Calorimeter<br />

Facing very similar challenges to <strong>the</strong> ECAL is <strong>the</strong> hadronic calorimeter (HCAL), which is<br />

located just after <strong>the</strong> ECAL (r=142 cm), but still inside <strong>the</strong> solenoid magnet(r=258 cm). The<br />

geometry presented here applies to <strong>the</strong> barrel region, but it is conceptually <strong>the</strong> same also for<br />

<strong>the</strong> forward region where <strong>the</strong> HCAL outer radius is delimited from <strong>the</strong> Muon Identification<br />

System ra<strong>the</strong>r than <strong>the</strong> solenoid. Its size, dictated by <strong>the</strong> nuclear interaction length, needs<br />

to be substantially bigger than <strong>the</strong> one <strong>of</strong> <strong>the</strong> ECAL.<br />

Ideally <strong>the</strong> HCAL would have a very similar segmentation to <strong>the</strong> one just presented for<br />

<strong>the</strong> ECAL; however in order to maintain reasonable complexity <strong>and</strong> costs a much coarser<br />

readout is foreseen for this subdetector. In fact it is composed <strong>of</strong> 40 layers <strong>of</strong> 2 cm thick<br />

stainless steel, acting as absorber plates, separated by 8 mm <strong>of</strong> Resistive Plate Chambers<br />

(RPCs), acting as active detector elements (fig. 2.8(b)). RPCs are gaseous detectors limited<br />

by two resistive plates, typically ei<strong>the</strong>r glass or Bakelite. The outer surface <strong>of</strong> <strong>the</strong> plates is<br />

<strong>the</strong>n coated with a layer <strong>of</strong> resistive paint to which a high voltage is applied. In <strong>the</strong> presented<br />

detector a potential <strong>of</strong> approximately 6.3kV is envisaged. In this operational mode a charged<br />

particle crossing <strong>the</strong> <strong>the</strong> gaseous region (composed mainly <strong>of</strong> Freon R134A) will initiate an<br />

avalanche <strong>of</strong> electrically charged particles; generating a potential difference across <strong>the</strong> RPC<br />

that can be detected. The size <strong>of</strong> each RPC sensor is 1 cm 2 . Such a detector should be able<br />

to give good imaging quality, because <strong>of</strong> <strong>the</strong> small sizes <strong>of</strong> each RPC element, <strong>and</strong> an energy<br />

resolution <strong>of</strong> 65%/ √ E for neutral hadrons [23]. Again <strong>the</strong> result has been validated by a<br />

simulation using <strong>the</strong> GEANT4 s<strong>of</strong>tware [27]. The results <strong>of</strong> a small calorimeter prototype in<br />

a test-beam can be seen in fig. 2.2.3.<br />

It has to be stressed that <strong>the</strong> described technology is <strong>the</strong> one that at presents constitutes<br />

<strong>the</strong> baseline scenario, but it is by no means <strong>the</strong> only possibility being considered. For <strong>the</strong><br />

active detector elements <strong>the</strong>re is a wide variety <strong>of</strong> different proposals; all <strong>of</strong> which are still in<br />

<strong>the</strong> R&D <strong>and</strong> testing stages. These vary from scintillators with Silicon photomultipliers to<br />

Gas Electron Multiplies (GEMs) or MICRO MEsh GAseous Structures (MICROMEGAS)<br />

[23].


2.2 The Silicon Detector Concept 27<br />

Figure 2.9: Response <strong>of</strong> a small prototype hadronic calorimeter to positrons with energies <strong>of</strong><br />

1, 2, 4, 8, 16 GeV. The data points are presented in black, with a gaussian fit performed on<br />

<strong>the</strong>m, <strong>the</strong> red histograms are <strong>the</strong> GEANT4 Simulations. The yellow histograms represent<br />

<strong>the</strong> same data on a different scale. As <strong>the</strong> axis is in this case common to all <strong>the</strong> energies, it<br />

is possible to directly compare <strong>the</strong> performance <strong>of</strong> <strong>the</strong> calorimeter for <strong>the</strong> different energies.<br />

2.2.4 Muon Identification System<br />

The Muon Identification System starts outside <strong>the</strong> calorimeters <strong>and</strong> <strong>the</strong> 5 T magnet solenoid<br />

system at a radius <strong>of</strong> 3.40 m. The system is expected to consist <strong>of</strong> eleven 20 cm thick stainless<br />

steel absorber plates in <strong>the</strong> barrel <strong>and</strong> <strong>the</strong> endcap regions. These also act as a flux return<br />

structure for <strong>the</strong> solenoid magnet. In between <strong>the</strong> steel plates, in gaps <strong>of</strong> 4 cm, RPC active<br />

detector layers are inserted. An additional layer <strong>of</strong> RPC is present between <strong>the</strong> magnetic<br />

solenoid <strong>and</strong> <strong>the</strong> first barrel layer. The RPCs will use a strip detector geometry with a pitch<br />

<strong>of</strong> 3 cm <strong>and</strong> a maximum area covered by each strip <strong>of</strong> 300 cm 2 . In this way an occupancy<br />

from background hits <strong>of</strong> less than 1% is expected. In each detector layer <strong>the</strong>re will be two<br />

strips placed orthogonally to each o<strong>the</strong>r allowing for a precise two dimensional measurement.<br />

Currently investigations are in progress to determine whe<strong>the</strong>r smaller sensors are needed in<br />

<strong>the</strong> layers closest to <strong>the</strong> beampipe. The system is expected to identify more than 96% <strong>of</strong> <strong>the</strong><br />

muons with energies above 4 GeV <strong>and</strong> have extremely high purity. Hadron punch-through<br />

is <strong>the</strong> most problematic background [23].


2.2 The Silicon Detector Concept 28<br />

2.2.5 Forward Detector<br />

The so called Forward Detector represents <strong>the</strong> region with | cos θ| ≥ 0.99. Here one can find<br />

two separate subdetectors: <strong>the</strong> Beam Calorimeter (BeamCal) <strong>and</strong> <strong>the</strong> Luminosity Calorimeter<br />

(LumiCal). Their main aim is to extend <strong>the</strong> hermeticity <strong>of</strong> <strong>the</strong> calorimeter into very<br />

small angles. This is essential for a lot <strong>of</strong> new physics searches since many <strong>of</strong> <strong>the</strong>se reactions<br />

are expected to have a substantial component <strong>of</strong> missing energy. At <strong>the</strong> same time <strong>the</strong>se detectors<br />

can help monitor <strong>the</strong> luminosity produced by <strong>the</strong> ILC. More specifically <strong>the</strong> LumiCal<br />

is designed to monitor <strong>the</strong> integrated luminosity by using <strong>the</strong> small angle Bhabha scattering<br />

<strong>and</strong> <strong>the</strong> e + e − → W + W − interaction. The goal is to measure <strong>the</strong> luminosity with an<br />

accuracy better than 10 −3 . The aim <strong>of</strong> <strong>the</strong> Beamcal is instead to evaluate <strong>the</strong> instantaneous<br />

luminosity by using Bremsstrahlung pairs.<br />

In terms <strong>of</strong> design <strong>the</strong> LumiCal uses <strong>the</strong> same technology as <strong>the</strong> one used for <strong>the</strong> ECAL<br />

<strong>and</strong> it extends from 158 cm to 173 cm in <strong>the</strong> direction <strong>of</strong> <strong>the</strong> beampipe which it surrounds.<br />

The inner radius is 6 cm <strong>and</strong> <strong>the</strong> outer radius is 20 cm. The energy resolution <strong>of</strong> <strong>the</strong><br />

LumiCal for electrons with 1 GeV energy is 15%/ √ E <strong>and</strong> it rises to 20%/ √ E for <strong>the</strong> most<br />

energetic events envisaged at <strong>the</strong> ILC. The BeamCal is instead composed <strong>of</strong> 50 layers <strong>of</strong> 2.5<br />

cm tungsten intercalated with an active silicon layer. The detector is located at 295 cm from<br />

<strong>the</strong> interaction point <strong>and</strong> spans between 2 cm <strong>and</strong> 13.5 cm in radius as it can be seen in fig.<br />

2.10 [23, 25].<br />

Figure 2.10: Forward Region


2.2 The Silicon Detector Concept 29<br />

2.3 The International Large Detector<br />

The International Large Detector (ILD) is ano<strong>the</strong>r proposed multi-purpose detector for <strong>the</strong><br />

ILC [28]. Its general features are similar to <strong>the</strong> ones just presented for <strong>the</strong> SiD. The main<br />

differences lie in <strong>the</strong> presence <strong>of</strong> a Time Projection Chamber that replaces part <strong>of</strong> <strong>the</strong> silicon<br />

strip tracker, in <strong>the</strong> use <strong>of</strong> a lower magnetic field parameter (3.5 T instead <strong>of</strong> 5 T) <strong>and</strong> in<br />

<strong>the</strong> consequently resulting larger overall dimensions. The choice <strong>of</strong> magnetic field also forces<br />

<strong>the</strong> beampipe to be slightly larger than in SiD (1.45 cm) in order to filter out most <strong>of</strong> <strong>the</strong><br />

pair production background.<br />

Although <strong>the</strong> presented analysis has been ultimately performed on a model <strong>of</strong> <strong>the</strong> SiD<br />

Detector a lot <strong>of</strong> <strong>the</strong> s<strong>of</strong>tware tools have been developed <strong>and</strong> tested for <strong>the</strong> ILD <strong>and</strong> its<br />

extremely similar predecessor <strong>the</strong> Large Detector Concept (LDC). In particular <strong>the</strong> flavour<br />

tagging package that is presented in this <strong>the</strong>sis has been widely tested on <strong>the</strong> LDC detector<br />

<strong>and</strong> only afterwards used for SiD.


Chapter 3<br />

Simulation <strong>and</strong> Reconstruction<br />

Given <strong>the</strong> size, complexity <strong>and</strong> <strong>the</strong> financial commitment that modern detectors require <strong>the</strong>y<br />

need to be simulated <strong>and</strong> <strong>the</strong>ir performance analysed in a virtual environment before <strong>the</strong>ir<br />

construction is started. The purpose <strong>of</strong> such an exercise is to optimize <strong>the</strong> detector design,<br />

implement <strong>the</strong> needed algorithms <strong>and</strong> test <strong>the</strong> physics capabilities <strong>of</strong> <strong>the</strong> project. This<br />

process can be separated into three distinctive stages: Monte Carlo generation, detector<br />

simulation <strong>and</strong> event reconstruction. In a running experiment <strong>the</strong>se simulations would be<br />

compared to <strong>the</strong> collected data.<br />

3.1 Monte Carlo Generation<br />

The first step <strong>of</strong> <strong>the</strong> process is to reproduce our underst<strong>and</strong>ing <strong>of</strong> <strong>the</strong> St<strong>and</strong>ard Model. The<br />

Feynman diagrams <strong>of</strong> <strong>the</strong> desired events are hence generated <strong>and</strong> run through <strong>the</strong> hadronization,<br />

fragmentation <strong>and</strong> showering processes. It is possible at this stage to simplify <strong>the</strong> modelling<br />

by omitting any <strong>of</strong> <strong>the</strong> Initial State Radiation (ISR) or Final State Radiation processes<br />

(FSR). This is usually done in <strong>the</strong> initial stage <strong>of</strong> an analysis in order to gain an intuitive<br />

underst<strong>and</strong>ing <strong>of</strong> <strong>the</strong> event by concentrating solely on <strong>the</strong> underlying physics. Although all<br />

<strong>the</strong> results here presented incorporate <strong>the</strong>se effects many algorithms <strong>and</strong> techniques were<br />

initially studied with a simplified sample. Additionally one should also model <strong>the</strong> back-<br />

30


Chapter 3. Simulation <strong>and</strong> Reconstruction 31<br />

ground generated from <strong>the</strong> beam-beam interactions which is a source <strong>of</strong> underlying e + e −<br />

pairs. These however have not been included in <strong>the</strong> present study because <strong>of</strong> <strong>the</strong> substantial<br />

increase <strong>of</strong> computational resources required. A small study has instead been performed<br />

that shows <strong>the</strong> very limited effect that beam-beam background has on <strong>the</strong> b quark tagging<br />

efficiencies. For this specific study <strong>the</strong> GUINEA PIG [29] s<strong>of</strong>tware has been used. The<br />

programs used for <strong>the</strong> Feynman diagrams generation were: Whizard Monte Carlo [30, 31]<br />

for all <strong>the</strong> analysis presented except for <strong>the</strong> ILD/LDC flavour identification sections where<br />

PYTHIA was used. All <strong>the</strong> remaining processes were modelled in PYTHIA[32]. The output<br />

<strong>of</strong> <strong>the</strong> program is a list <strong>of</strong> stable <strong>and</strong> decayed particles with <strong>the</strong>ir points <strong>of</strong> origin, points <strong>of</strong><br />

decay (if applicable), energy <strong>and</strong> momenta.<br />

3.2 Detector Simulation<br />

The detector simulation is <strong>the</strong> subsequent step <strong>of</strong> <strong>the</strong> process. In fact, since <strong>the</strong> ILC <strong>and</strong> <strong>the</strong><br />

SiD are still in <strong>the</strong> design stages it is clear that no physical detector exists <strong>and</strong> any detector<br />

description such as <strong>the</strong> one that has been just performed in <strong>the</strong> previous chapter, is <strong>the</strong><br />

result <strong>of</strong> a computational simulation <strong>of</strong> a plausible detector. Also in quite a few areas <strong>of</strong> <strong>the</strong><br />

detector many technologies are still under investigation. In such occurrences <strong>the</strong> modelling<br />

has been performed by using as generic a technology as possible <strong>and</strong> a series <strong>of</strong> details <strong>and</strong><br />

technicalities were omitted. In this process <strong>the</strong> interaction <strong>of</strong> <strong>the</strong> generated particles with<br />

<strong>the</strong> detector material is reproduced. The product is a list <strong>of</strong> simulated detector hits which<br />

contain <strong>the</strong> location where <strong>the</strong> particles have interacted with <strong>the</strong> detector <strong>and</strong> <strong>the</strong> energy<br />

deposited by <strong>the</strong> particle at each specific location. However <strong>the</strong> information is available only<br />

for <strong>the</strong> material that has been labelled as active in <strong>the</strong> detector description.<br />

As most <strong>of</strong> <strong>the</strong> presently produced particle physics analyses based on simulation also this<br />

one uses <strong>the</strong> GEANT4 framework [27]. However, given its immense complexity <strong>the</strong> code<br />

is usually accessed via a simplified interface. The SiD detector concept uses <strong>the</strong> SLIC [33]<br />

s<strong>of</strong>tware as its wrapper <strong>of</strong> choice. The ILD collaboration uses <strong>the</strong> Mokka package [34, 35].


Chapter 3. Simulation <strong>and</strong> Reconstruction 32<br />

Digitization is <strong>the</strong> next step; <strong>the</strong> simulated hits are smeared according to <strong>the</strong> actual<br />

detector resolution capabilities. The aim is to model <strong>the</strong> physical properties <strong>of</strong> <strong>the</strong> detector<br />

(such as pixel sizes or strip pitches) <strong>and</strong> hence smear <strong>the</strong> position <strong>and</strong> <strong>the</strong> energy <strong>of</strong> <strong>the</strong><br />

simulated hits as a real detector would.<br />

3.3 Reconstruction<br />

Finally <strong>the</strong> reconstruction stage is performed. It is now assumed that all <strong>the</strong> data that<br />

would be produced from a real detector has been effectively simulated. The remaining<br />

reconstruction algorithms should <strong>the</strong>refore be <strong>the</strong> same as for <strong>the</strong> case <strong>of</strong> real data. Their aim<br />

is to reconstruct <strong>the</strong> event by using only <strong>the</strong> data that <strong>the</strong> detector would provide. Clearly<br />

in simulated data much more information is available, this information however must not be<br />

used for <strong>the</strong> physics analysis but only for <strong>the</strong> purpose <strong>of</strong> algorithm benchmarking. In <strong>the</strong><br />

following subsection we will describe <strong>the</strong> tracking, <strong>the</strong> PFA <strong>and</strong> <strong>the</strong> jet finding algorithms.<br />

The flavour tagging <strong>and</strong> <strong>the</strong> quark charge reconstruction algorithms which are more vital to<br />

this analysis will be described in more detail in separate chapters.<br />

3.3.1 Tracking<br />

The aim <strong>of</strong> <strong>the</strong> tracking algorithm is to transform a series <strong>of</strong> tracker hits into a set <strong>of</strong> tracks<br />

with <strong>the</strong> highest possible reconstruction efficiency <strong>and</strong> <strong>the</strong> lowest possible number <strong>of</strong> fake<br />

tracks. Apart from describing <strong>the</strong> path taken by <strong>the</strong> particle through <strong>the</strong> vertex <strong>and</strong> tracking<br />

detector <strong>the</strong> track object must also contain <strong>the</strong> relevant error matrix on <strong>the</strong> reconstructed<br />

path <strong>and</strong> a track momentum estimation. In <strong>the</strong> SiD detector <strong>the</strong> tracking is separated into<br />

two stages <strong>and</strong> <strong>the</strong>refore algorithms[23, 36].<br />

In <strong>the</strong> first algorithm track finding is controlled by a set <strong>of</strong> strategies. A strategy consist<br />

<strong>of</strong> <strong>the</strong> list <strong>of</strong> detector layers to be used <strong>and</strong> <strong>the</strong> role <strong>of</strong> each layer, requirements on <strong>the</strong><br />

number <strong>of</strong> hits, kinematic constraints (momentum <strong>and</strong> impact parameters) <strong>and</strong> <strong>the</strong> χ 2 cut


Chapter 3. Simulation <strong>and</strong> Reconstruction 33<br />

to be applied. The layers’ roles can be <strong>of</strong> three types: seed, confirm <strong>and</strong> extend. Given a<br />

strategy <strong>the</strong> tracking algorithm proceeds in four steps:<br />

1. Three seed layers are expected in each strategy; all <strong>the</strong> hits found in <strong>the</strong>se three layers<br />

are used to perform all possible helix fits given <strong>the</strong> kinematic constraints. If helixes<br />

can be formed that are consistent with <strong>the</strong> constraints <strong>and</strong> <strong>the</strong> χ 2 <strong>the</strong>n <strong>the</strong> helixes are<br />

considered as track c<strong>and</strong>idates, o<strong>the</strong>rwise <strong>the</strong>y are rejected.<br />

2. The algorithm <strong>the</strong>n proceeds to consider <strong>the</strong> confirmation layer(s). Each track c<strong>and</strong>idate<br />

must have a suitable hit that will allow a helix refit without exceeding <strong>the</strong> required<br />

χ 2 in each <strong>of</strong> <strong>the</strong> confirmation layers, else <strong>the</strong> track is rejected. Generally it has been<br />

found that strategies with only one confirmation layer work best.<br />

3. Then <strong>the</strong> hits <strong>of</strong> <strong>the</strong> extension layers are considered. If a hit is consistent with any <strong>of</strong><br />

<strong>the</strong> tracks that passed <strong>the</strong> previous stages it is assigned to this track, else <strong>the</strong> hit is<br />

ignored. Regardless whe<strong>the</strong>r <strong>the</strong>re has been a sucessful match in any <strong>of</strong> <strong>the</strong> extension<br />

layers <strong>the</strong> track is kept.<br />

4. In <strong>the</strong> final stage all track c<strong>and</strong>idates that do not meet <strong>the</strong> required number <strong>of</strong> hits are<br />

rejected. Additionally all ambiguities deriving from different tracks sharing hits are<br />

resolved.<br />

A complete tracking algorithm consists <strong>of</strong> multiple strategies which are designed to be exhaustive<br />

so that all possible hit combinations are considered. Any possible conflict between<br />

strategies are resolved so that all tracks can share a maximum <strong>of</strong> one hit. All <strong>the</strong> strategies<br />

used in this <strong>the</strong>sis require 1 confirmation hit, a minimum <strong>of</strong> 7 hits (6 in <strong>the</strong> barrel region),<br />

p t > 200 MeV, xy <strong>and</strong> z distances <strong>of</strong> closest approach to <strong>the</strong> IP < 10 mm.<br />

The second algorithm aims to perform calorimeter assisted tracking <strong>and</strong> it is run subsequently<br />

to <strong>the</strong> one just described. All <strong>the</strong> tracks that are initially found are propagated into<br />

<strong>the</strong> ECAL. Clusters <strong>of</strong> calorimeter hits that match <strong>the</strong> track trajectory are <strong>the</strong>n associated<br />

to <strong>the</strong> found tracks. All o<strong>the</strong>r clusters that contain hits in <strong>the</strong> inner layers <strong>of</strong> <strong>the</strong> ECAL


Chapter 3. Simulation <strong>and</strong> Reconstruction 34<br />

<strong>and</strong> that are consistent with <strong>the</strong> hypo<strong>the</strong>sis <strong>of</strong> being produced by a MIP are considered as<br />

c<strong>and</strong>idate track hits <strong>and</strong> are used as seeds. The seeds, which contain both <strong>the</strong> position <strong>and</strong><br />

<strong>the</strong> direction <strong>of</strong> <strong>the</strong> track at <strong>the</strong> entry point in <strong>the</strong> ECAL, are <strong>the</strong>n propagated back into<br />

<strong>the</strong> tracker. In this process <strong>the</strong>y can pick up any unassigned track hits. After checking for<br />

<strong>the</strong> quality <strong>of</strong> <strong>the</strong> track <strong>and</strong> resolving ambiguities a fur<strong>the</strong>r list <strong>of</strong> tracks is provided. This<br />

method is particularly powerful for reconstructing tracks from long lived particles that decay<br />

outside <strong>the</strong> vertex detector [23, 36].<br />

The final performance <strong>of</strong> <strong>the</strong> combined tracking algorithms can be seen in fig. 3.1(a) <strong>and</strong><br />

3.1(b).<br />

(a)<br />

(b)<br />

Figure 3.1: Tracking Efficiency: a) efficiency vs. cos(θ) where <strong>the</strong> gap between <strong>the</strong> barrel<br />

<strong>and</strong> <strong>the</strong> forward regions at values <strong>of</strong> ≈ cos(θ) = 0.7 can be clearly seen , b) efficiency vs.<br />

<strong>the</strong> momentum <strong>of</strong> <strong>the</strong> track that is being reconstructed, where <strong>the</strong> excellent efficiency <strong>of</strong><br />

<strong>the</strong> reconstruction above 0.2 GeV can be seen. The tracks from <strong>the</strong> t¯t sample described in<br />

chapter 5 were used as <strong>the</strong> sample.<br />

3.3.2 Particle Flow Algorithm<br />

The idea behind <strong>the</strong> PFA algorithm is to separate <strong>the</strong> calorimeter energy deposits <strong>of</strong> each<br />

individual particle allowing <strong>the</strong>refore <strong>the</strong> measurement <strong>of</strong> <strong>the</strong> energy <strong>of</strong> <strong>the</strong> particle in <strong>the</strong><br />

optimal subsystem: <strong>the</strong> ECAL for <strong>the</strong> photon, <strong>the</strong> tracker for charged particles <strong>and</strong> both<br />

calorimeters for neutral hadrons. In order to achieve <strong>the</strong> goal <strong>the</strong> PFA takes as inputs <strong>the</strong><br />

energy deposits in <strong>the</strong> calorimeters <strong>and</strong> <strong>the</strong> muon systems <strong>and</strong> <strong>the</strong> tracks derived from <strong>the</strong><br />

tracking algorithm. The algorithm can be described as observing <strong>the</strong> following steps [26, 23]:<br />

1. Validation <strong>of</strong> <strong>the</strong> Input: all calorimeter hits that are not-physical or unmeasurable are


Chapter 3. Simulation <strong>and</strong> Reconstruction 35<br />

discarded. Generally <strong>the</strong>se are hits below an energy threshold or occurring later than<br />

100 ns from <strong>the</strong> primary interaction.<br />

2. Electron, Muon <strong>and</strong> Photon Reconstruction: muons are identified by extrapolating<br />

tracks through <strong>the</strong> ECAL <strong>and</strong> HCAL <strong>and</strong> requiring <strong>the</strong>m to connect to a MIP stub in<br />

<strong>the</strong> muon ID system. Electrons <strong>and</strong> photons leave a distinctive shower in <strong>the</strong> ECAL<br />

which are reconstructed via a clustering algorithm. If a cluster momentum <strong>and</strong> position<br />

matches <strong>the</strong> parameters <strong>of</strong> one <strong>of</strong> <strong>the</strong> tracks it is declared as an electron cluster else it<br />

is defined as a photon.<br />

3. Charged Hadrons Identification: <strong>the</strong> first step is to identify <strong>the</strong> MIP stubs that derive<br />

from charged hadrons travelling through <strong>the</strong> detector, but not showering. In <strong>the</strong> process<br />

<strong>the</strong> tracking information is once again used as non-leptonic tracks are propagated into<br />

<strong>the</strong> detector <strong>and</strong> nearby MIP hits are considered being part <strong>of</strong> <strong>the</strong> stubs. At this point<br />

all <strong>the</strong> remaining hits are expected to derive from hadronic showers. A set <strong>of</strong> algorithms<br />

is <strong>the</strong>n used to separate <strong>the</strong> shower into distinctive clusters. The tracking information<br />

is used again; for each track that has not been identified as a muon or electron a seed<br />

ECAL cluster is found. The track can also be associated first to a MIP stub <strong>and</strong> <strong>the</strong>n<br />

to a cluster. Clusters are <strong>the</strong>n iteratively added to <strong>the</strong> seed by considering <strong>the</strong> point<br />

<strong>of</strong> closest approach <strong>of</strong> <strong>the</strong> track <strong>and</strong> o<strong>the</strong>r discriminating variables. The algorithm<br />

stops when ei<strong>the</strong>r <strong>the</strong>re are no more clusters that would pass our predetermined cuts<br />

or when <strong>the</strong> energy <strong>of</strong> <strong>the</strong> cluster becomes too large with respect to <strong>the</strong> energy <strong>of</strong> <strong>the</strong><br />

track. The same procedure is repeated for every non-leptonic track. A whole list <strong>of</strong><br />

ambiguities are <strong>the</strong>n resolved. For example if <strong>the</strong> energy <strong>of</strong> <strong>the</strong> track is substantially<br />

higher than <strong>the</strong> energy <strong>of</strong> <strong>the</strong> cluster <strong>the</strong> association criteria are relaxed <strong>and</strong> nearby<br />

cluster are added.<br />

4. Neutral Hadrons Identification: a similar algorithm is applied to <strong>the</strong> one previously<br />

described for <strong>the</strong> charged hadrons. However given that <strong>the</strong>re are no tracks associated<br />

with neutral hadrons <strong>the</strong> tracking information cannot be used <strong>and</strong> clusters are merged<br />

only by <strong>the</strong>ir calorimetry signatures.


Chapter 3. Simulation <strong>and</strong> Reconstruction 36<br />

5. Production <strong>of</strong> Reconstructed Particles: finally all <strong>the</strong> information is collected in a set<br />

<strong>of</strong> unambiguously reconstructed particles that are ready to be used in any physics<br />

analysis. The energies <strong>of</strong> <strong>the</strong> particles are established from <strong>the</strong> optimal subsystem.<br />

The performance <strong>of</strong> this algorithm at different energies can be seen in table 3.1 [26]. It<br />

can bee noted that <strong>the</strong> performance deteriorates with <strong>the</strong> increase in <strong>the</strong> jet energy. This<br />

is because <strong>the</strong> leakage out <strong>of</strong> <strong>the</strong> back <strong>of</strong> <strong>the</strong> calorimeter becomes an important effect <strong>and</strong><br />

because <strong>the</strong> separation <strong>of</strong> <strong>the</strong> clusters becomes more difficult.<br />

PFA Performance<br />

Resolution<br />

Process Barrel Endcap<br />

e + e − → q¯q, √ s = 100GeV 3.7% 3.8%<br />

e + e − → q¯q, √ s = 200GeV 3.0% 3.2%<br />

e + e − → q¯q, √ s = 360GeV 2.7% 2.7%<br />

e + e − → q¯q, √ s = 500GeV 3.5% 3.3%<br />

e + e − → Z(¯q)Z(ν¯ν), 4.7% 3.9%<br />

Table 3.1: PFA performance for SiD02. For <strong>the</strong> e + e → q¯q processes, <strong>the</strong> rms90 <strong>of</strong> <strong>the</strong> energy<br />

sum is quoted as a fraction <strong>of</strong> √ s, <strong>and</strong> for <strong>the</strong> e + e → ZZ process <strong>the</strong> rms90 <strong>of</strong> <strong>the</strong> dijet<br />

mass is quoted as a fraction <strong>of</strong> m Z . The term rms90 is defined as <strong>the</strong> root mean square <strong>of</strong><br />

<strong>the</strong> contiguous block <strong>of</strong> 90% <strong>of</strong> events with smallest root mean square values.<br />

3.3.3 Jet Finding<br />

It is well known that a quark cannot remain free <strong>and</strong> it hence undergoes hadronization <strong>and</strong><br />

fragmentation into a number <strong>of</strong> colourless hadronic particles. Assuming that <strong>the</strong> momenta <strong>of</strong><br />

<strong>the</strong> particles are substantially higher than <strong>the</strong>ir masses <strong>the</strong> relativistic boost will ensure that<br />

<strong>the</strong> direction <strong>of</strong> motion will be similar for all resulting particles. Typically <strong>the</strong> directions<br />

<strong>of</strong> <strong>the</strong> newly created particles form a cone around <strong>the</strong> direction <strong>of</strong> motion <strong>of</strong> <strong>the</strong> original<br />

parton. The main purpose <strong>of</strong> any jet finding algorithm is to combine <strong>the</strong>se particles into<br />

a new entity called a jet. It is clear from <strong>the</strong> definition that <strong>the</strong> concept <strong>of</strong> a jet is very<br />

variable. For example a hadronically decaying top will generate three jets if decaying at rest<br />

<strong>and</strong> only one if it is highly boosted. Never-<strong>the</strong>-less <strong>the</strong> concept is extremely powerful as it<br />

enables us to study <strong>the</strong> original jet-producing particle.


Chapter 3. Simulation <strong>and</strong> Reconstruction 37<br />

The jet algorithm used in this analysis is <strong>the</strong> y-cut algorithm with a fixed number <strong>of</strong> jets.<br />

More specifically in <strong>the</strong> flavour tagging section <strong>the</strong> number <strong>of</strong> jets is fixed to two <strong>and</strong> in <strong>the</strong><br />

hadronic t¯t analysis is fixed to 6. The reason is <strong>the</strong> topology <strong>of</strong> <strong>the</strong> events. In <strong>the</strong> flavour<br />

tagging section e + e − → Z/γ → q¯q events are considered, which naturally produce 2 jets. A<br />

hadronic t¯t event at 500 GeV is instead expected to have 6 separate jets. The following is a<br />

brief description <strong>of</strong> <strong>the</strong> algorithm [37]:<br />

1. The discriminating quantity y is calculated for each pair <strong>of</strong> reconstructed particles:<br />

y ij = 2min(E2 i ,E 2 j )(1 − cos(θ ij ))<br />

E 2 vis<br />

(3.1)<br />

where E i (E j ) is <strong>the</strong> energy <strong>of</strong> <strong>the</strong> reconstructed particle i (j), θ ij is <strong>the</strong> angle between<br />

<strong>the</strong> two particles <strong>and</strong> E vis is <strong>the</strong> total visible energy defined as E vis = Σ i E i .<br />

2. If <strong>the</strong> number <strong>of</strong> particles is more than <strong>the</strong> desired one combine <strong>the</strong> two entities that<br />

produce <strong>the</strong> lowest y ij value into a new pseudo-particle where ⃗p = ⃗p i + ⃗p j <strong>and</strong> repeat<br />

step one, else quit algorithm.<br />

Additionally to <strong>the</strong> list <strong>of</strong> jets <strong>the</strong> algorithm also outputs <strong>the</strong> value <strong>of</strong> y for <strong>the</strong> last<br />

pseudo-particle association made <strong>and</strong> <strong>the</strong> value <strong>of</strong> y for <strong>the</strong> next association that could have<br />

been made. The first value (y 56 ) represents <strong>the</strong> jet separation between an event with five<br />

<strong>and</strong> an event with six jet topology. The second value (y 67 ) is <strong>the</strong> six <strong>and</strong> seven jet topology<br />

separation. These values are used in <strong>the</strong> presented analysis.<br />

3.3.4 The S<strong>of</strong>tware Framework<br />

Although it is not an algorithm it is important to mention <strong>the</strong> framework in which <strong>the</strong><br />

analysis has been reconstructed. Two separate frameworks were used: <strong>the</strong> SiD developed<br />

org.lcsim package [36] <strong>and</strong> <strong>the</strong> ILD Marlin plus Marlin Reco package [38]. The tracking <strong>and</strong><br />

<strong>the</strong> PFA have always been done within <strong>the</strong> recommended package, so org.lcsim [36] for <strong>the</strong><br />

analysis events <strong>and</strong> Marlin for <strong>the</strong> flavour tagging events. Jet clustering <strong>and</strong> flavour tagging


Chapter 3. Simulation <strong>and</strong> Reconstruction 38<br />

have however always been performed in <strong>the</strong> Marlin environment [38, 39]. Also <strong>the</strong> physics<br />

analysis s<strong>of</strong>tware has been developed <strong>and</strong> run within <strong>the</strong> Marlin framework. As we can see<br />

for all <strong>the</strong> events produced in SiD <strong>the</strong> s<strong>of</strong>tware framework has been switched in <strong>the</strong> middle<br />

<strong>of</strong> <strong>the</strong> reconstruction process. The procedure has been facilitated by <strong>the</strong> shared data storage<br />

s<strong>of</strong>tware LCIO [40] <strong>and</strong> by <strong>the</strong> almost complete lack <strong>of</strong> dependence on <strong>the</strong> detector geometry<br />

<strong>of</strong> <strong>the</strong> flavour tagging <strong>and</strong> analysis s<strong>of</strong>tware. Although <strong>the</strong> transition has been smooth this<br />

analysis has been <strong>the</strong> first one to perform it <strong>and</strong> it has helped to underst<strong>and</strong> <strong>and</strong> resolve any<br />

remaining incompatibility.


Chapter 4<br />

The Column Parallel CCD as an ILC<br />

Vertex Detector Technology<br />

The simplest form <strong>of</strong> a Charged Coupled Device (CCD) invented in 1969 [41] is essentially<br />

an array <strong>of</strong> Metal Oxide Semiconductor (MOS) capacitors. By applying different voltages to<br />

nearby gates a potential well can be created under a specific gate <strong>and</strong> charge can <strong>the</strong>refore<br />

be stored. In order to accumulate <strong>the</strong> charge in a constrained space, but also to prevent <strong>the</strong><br />

charge from being located on <strong>the</strong> sensor’s surface, all modern imaging CCDs add a buried<br />

channel structure. The CCD can <strong>the</strong>refore be interpreted as a MOS capacitor with a p-n<br />

junction structure beneath it as seen in fig. 4.1(a).<br />

If <strong>the</strong> potential <strong>of</strong> <strong>the</strong> gates is <strong>the</strong>n altered it is possible to transfer charge from one pixel<br />

to ano<strong>the</strong>r. The charge can be transferred in <strong>the</strong> desired direction ei<strong>the</strong>r by <strong>the</strong> use <strong>of</strong> three<br />

different clock voltages (three phase CCD) or by using only two clock voltages in combination<br />

with an inbuilt barrier in <strong>the</strong> form <strong>of</strong> altered ion implantation under part <strong>of</strong> <strong>the</strong> pixel (two<br />

phase CCD). The CCD technology studied by <strong>the</strong> LCFI collaboration is <strong>of</strong> <strong>the</strong> latter type.<br />

In a traditional CCD a single readout is used for a whole matrix <strong>of</strong> pixels; charge is <strong>the</strong>refore<br />

transported first in one direction <strong>and</strong> <strong>the</strong>n perpendicularly. In <strong>the</strong> proposed Column Parallel<br />

CCD (CPC) an amplifier is instead attached at <strong>the</strong> end <strong>of</strong> each column. The number <strong>of</strong> clock<br />

cycles needed to read <strong>the</strong> whole array is hence reduced by a factor equal to <strong>the</strong> width <strong>of</strong> <strong>the</strong><br />

39


Chapter 4. The Column Parallel CCD as an ILC Vertex Detector Technology40<br />

(a)<br />

(b)<br />

Figure 4.1: Representation <strong>of</strong> Charged Coupled Devices: a) <strong>Cross</strong> section <strong>and</strong> physical<br />

concept behind charge storage. The line below <strong>the</strong> diagram represents <strong>the</strong> potential well<br />

below every gate. The plot is shown when gate one is in accumulation state. Q1, Q2 <strong>and</strong><br />

Q3 are <strong>the</strong> charges that have been accumulated due to <strong>the</strong> interaction <strong>of</strong> three Minimum<br />

Ionizing Particles (MIP), with <strong>the</strong> silicon <strong>and</strong> <strong>the</strong> subsequent creation <strong>of</strong> electron-hole pairs.<br />

b) Photograph <strong>of</strong> <strong>the</strong> 2 nd generation Column Parallel CCD by LCFI, CPC2-10<br />

sensor expressed in number <strong>of</strong> pixels. The two different concepts are illustrated in fig. 4.2(a)<br />

<strong>and</strong> fig. 4.2(b). The charge is prevented from spreading in <strong>the</strong> not desired directions by <strong>the</strong><br />

usage <strong>of</strong> doped ion implants <strong>and</strong> channel stops underneath <strong>the</strong> gates.<br />

CCDs have numerous proprieties that make <strong>the</strong>m suitable for <strong>the</strong> ILC Vertex detector<br />

[42, 43]. Their pixels can be very small; in <strong>the</strong> proposed CPC <strong>the</strong> size is 20 µm × 20 µm, but<br />

much smaller sizes have been achieved. They can also be made very thin since <strong>the</strong> sensitive<br />

epitaxial layer can be as small as 20 µm, which allows for a very low material budget. In<br />

fact <strong>the</strong> CCDs can be thinned so that only <strong>the</strong> epitaxial layer is left. Finally although <strong>the</strong>y<br />

usually collect only about 80 e − for each MIP per micrometer <strong>of</strong> track (so about 1600 e − for<br />

a 20 µm path in silicon), <strong>the</strong>y also have extremely low readout noise, as low as 10 electron<br />

equivalent, <strong>and</strong> a very high charge transfer efficiency.<br />

As a downside CCDs are vulnerable to radiation damage which tends to generate charge<br />

traps <strong>and</strong> reduce <strong>the</strong> charge transfer efficiency. Additionally electrical connections <strong>of</strong> a CCD<br />

can be problematic due to <strong>the</strong> difficulty to drive <strong>the</strong> large gate capacitance <strong>of</strong> <strong>the</strong> imaging<br />

area [44]. This leads to <strong>the</strong> need to supply large enough clock currents to <strong>the</strong> detector. The<br />

need to have large storage capacitors nearby to support <strong>the</strong> peak current may substantially


Chapter 4. The Column Parallel CCD as an ILC Vertex Detector Technology41<br />

(a)<br />

(b)<br />

Figure 4.2: Readout <strong>of</strong> a) st<strong>and</strong>ard CCD; readout time = N × M/f b) LCFI CPC; readout<br />

time = N/f, where f is <strong>the</strong> clock frequency. The factor <strong>of</strong> two derives from <strong>the</strong> fact that<br />

two phase CCDs are considered each pixel is hence composed <strong>of</strong> 2 separate gates.<br />

increase <strong>the</strong> material budget. Since <strong>the</strong> CCDs are operated at low temperatures (-10/-50 ◦ C)<br />

in order to minimize dark current <strong>and</strong> to improve charge transfer after radiation damage,<br />

an efficient cooling system is also required. It is <strong>the</strong>refore essential to underst<strong>and</strong> <strong>the</strong> clock<br />

distribution <strong>and</strong> to attempt to optimize it. The power stored in <strong>the</strong> form <strong>of</strong> capacitance<br />

(<strong>and</strong> subsequently dissipated by <strong>the</strong> resistive components in <strong>the</strong> form <strong>of</strong> heat) in a two phase<br />

CCD imaging area is [41]:<br />

P = 2C PIX ∆V 2<br />

IGf c N PIX (4.1)<br />

where C PIX is <strong>the</strong> total capacitance <strong>of</strong> each pixel, ∆V IG is <strong>the</strong> clock voltage amplitude, f c<br />

is <strong>the</strong> clock frequency <strong>and</strong> N PIX is <strong>the</strong> number <strong>of</strong> pixels present. Most <strong>of</strong> <strong>the</strong>se parameters<br />

are however constrained by <strong>the</strong> requirements <strong>of</strong> <strong>the</strong> SiD <strong>and</strong> <strong>the</strong> ILC design. The size <strong>and</strong><br />

hence <strong>the</strong> number <strong>of</strong> pixels is defined by <strong>the</strong> required resolution. The frequency <strong>of</strong> <strong>the</strong> clock<br />

is defined by <strong>the</strong> beam structure <strong>and</strong> <strong>the</strong> inner layer occupancy requirement. The ocupancy<br />

<strong>of</strong> each layer <strong>of</strong> <strong>the</strong> vertex detector should in fact be small, in this case a value <strong>of</strong> < 1%<br />

is required in <strong>the</strong> innermost layer. This is done in order to ensure that track finding is<br />

not compromised. A fast readout is <strong>the</strong>refore needed; simulation have shown that <strong>the</strong> clock<br />

frequency required to drive <strong>the</strong> CPC is 50 MHz. A lot <strong>of</strong> effort has <strong>the</strong>refore been put in


4.1 CPC2 Capacitance Characterization 42<br />

reducing <strong>the</strong> driving clock voltage <strong>and</strong> <strong>the</strong> capacitance <strong>of</strong> <strong>the</strong> CPC to minimize <strong>the</strong> stored<br />

power P. As a consequence <strong>the</strong> clock amplitude was reduced to less than 2 V; substantially<br />

less than <strong>the</strong> st<strong>and</strong>ard 10-15 V used while operating a conventional CCD. Never <strong>the</strong> less<br />

given <strong>the</strong> size <strong>of</strong> ≈ 10 cm 2 for each sensor, a total capacitance <strong>of</strong> <strong>the</strong> sensor <strong>of</strong> 50 nF <strong>and</strong> a<br />

clock voltage <strong>of</strong> 2 V <strong>the</strong> total required power per sensor is still ≈ 20 Watts; corresponding<br />

to a clock current <strong>of</strong> 10 A. It was <strong>the</strong>refore reputed important to try to underst<strong>and</strong> <strong>and</strong><br />

minimize <strong>the</strong> capacitance [45].<br />

In all <strong>the</strong> fur<strong>the</strong>r sections <strong>the</strong> analysis have been performed on a second generation CPC<br />

sensor, CPC2, produced at e2V [5]. The CPC2 pixel size is 20 µm square with 750 pixels per<br />

row. Several length variations are available achieved by a stitching process. The used sensor,<br />

CPC2-40, has an overall imaging area <strong>of</strong> 1.5 cm × 5.28 cm. The pixel is two-phase clocked<br />

with <strong>the</strong> active lengths <strong>of</strong> <strong>the</strong> barrier <strong>and</strong> storage parts <strong>of</strong> each phase being correspondingly<br />

3.5 µm/6.5 µm. The inter-column isolation region is 6.0 µm wide. The silicon wafers used<br />

for CPC2 production are 500 µm thick <strong>and</strong> have a p-type epitaxial layer <strong>of</strong> about 20 µm.<br />

4.1 CPC2 Capacitance Characterization<br />

It is in this light that a characterization <strong>of</strong> <strong>the</strong> CPC2 capacitances was undertaken. The first<br />

step towards <strong>the</strong> measurement <strong>of</strong> <strong>the</strong> capacitance is to create a simple model <strong>of</strong> <strong>the</strong> CCD<br />

<strong>and</strong> <strong>the</strong>refore also distinguish all <strong>the</strong> capacitances embedded in <strong>the</strong> C PIX term (equation<br />

4.1). For this purpose we need to underst<strong>and</strong> <strong>the</strong> behaviour <strong>of</strong> <strong>the</strong> CCD when a gate (V G )<br />

<strong>and</strong>/or a reference voltage (V Ref ) are applied. V Ref <strong>and</strong> V G are defined as voltage across<br />

<strong>the</strong> p-n junction <strong>and</strong> a voltage differential between <strong>the</strong> gate <strong>and</strong> <strong>the</strong> substrate. In particular<br />

<strong>the</strong>re is a need <strong>of</strong> underst<strong>and</strong>ing <strong>the</strong> behaviour <strong>of</strong> <strong>the</strong> depletion regions <strong>and</strong> <strong>the</strong>refore <strong>of</strong> <strong>the</strong><br />

capacitances associated. From <strong>the</strong>oretical considerations we expect that an increase <strong>of</strong> <strong>the</strong><br />

reference voltage will increase <strong>the</strong> junction depletion. The application <strong>of</strong> a reference voltage<br />

effectively reverse biases <strong>the</strong> p-n junction. Similarly an increase in <strong>the</strong> gate voltage will<br />

increase <strong>the</strong> gate depletion region <strong>and</strong> shift <strong>the</strong> depletion <strong>of</strong> <strong>the</strong> p-n junction away from <strong>the</strong>


4.1 CPC2 Capacitance Characterization 43<br />

gate region. Because <strong>of</strong> this shift <strong>the</strong> total depletion distance will be larger at saturation<br />

when V G is larger. A graphical representation <strong>of</strong> this effect can be seen in fig.4.3. Saturation<br />

is considered as <strong>the</strong> state where <strong>the</strong> gate <strong>and</strong> <strong>the</strong> p-n junction depletion regions merge; hence<br />

<strong>the</strong> buried channel is completely depleted <strong>and</strong> any increase <strong>of</strong> <strong>the</strong> reference voltage will have<br />

no fur<strong>the</strong>r effect[41].<br />

Figure 4.3: Graphical representation <strong>of</strong> a CCD, in particular interesting for <strong>the</strong> purpose<br />

<strong>of</strong> this analysis are <strong>the</strong> p-n junction depletion regions <strong>and</strong> <strong>the</strong> gate depletion regions. The<br />

picture presents a CCD with both a V Ref <strong>and</strong> a V G being applied. Note that <strong>the</strong> gate<br />

depletion region is different between <strong>the</strong> gates labelled OTG <strong>and</strong> OSW. This is due to an<br />

intergate voltage differential being applied as when <strong>the</strong> CCD is being run.<br />

Finally it must be remarked that in order for a potential well to be created under <strong>the</strong> CCD<br />

gates <strong>the</strong> device must be in a saturated regime. It is now necessary to make a capacitance<br />

model <strong>of</strong> each pixel 4.4(a) <strong>and</strong> <strong>of</strong> <strong>the</strong> whole device 4.4(b).<br />

4.1.1 Substrate Capacitance<br />

The simplest measurement that can be performed is <strong>the</strong> determinations <strong>of</strong> <strong>the</strong> substrate<br />

capacitance for different V Ref <strong>and</strong> V G . By shorting points A <strong>and</strong> B <strong>and</strong> <strong>the</strong>n applying a<br />

potential with respect to point C all <strong>the</strong> C S are in parallel <strong>and</strong> each pixel can be hence<br />

treated independently; see fig. 4.4(b). The results <strong>of</strong> this measurement can be seen in fig.<br />

4.5. All <strong>the</strong> results presented in this section have been retrieved by using <strong>the</strong> Hewlett-


4.1 CPC2 Capacitance Characterization 44<br />

V−A<br />

G1<br />

C−IG<br />

G2<br />

V−A<br />

V−B<br />

G1 C−IG G2 C−IG G3 C−IG G4 C−IG G5<br />

V−B<br />

C−S<br />

C−S<br />

C−S<br />

C−S<br />

C−S<br />

C−S<br />

C−S<br />

V−C<br />

V−C<br />

(a)<br />

(b)<br />

Figure 4.4: Circuit model <strong>of</strong> a) one pixel, b) series <strong>of</strong> interconnected pixels or <strong>the</strong> CCD<br />

imaging area. GN is <strong>the</strong> gate number N, C-IG is <strong>the</strong> intergate capacitance, C-S is <strong>the</strong><br />

substrate capacitance, V is <strong>the</strong> <strong>the</strong> potential at different points in <strong>the</strong> circuit.<br />

Packard LCR 4284A. Various different frequencies <strong>and</strong> amplitudes for <strong>the</strong> AC current have<br />

been used in <strong>the</strong> capacitance measurement, no substantial dependence on ei<strong>the</strong>r <strong>of</strong> <strong>the</strong>se<br />

parameters was found. The AC frequency at which <strong>the</strong> measurement is performed has been<br />

varied between 5 kHz <strong>and</strong> 80 kHz <strong>and</strong> <strong>the</strong> measurement sinusoidal voltage has been varied<br />

between RMS <strong>of</strong> 0.2 V <strong>and</strong> 1 V. In this <strong>and</strong> <strong>the</strong> following sections all V Ref <strong>and</strong> V G are to be<br />

interpreted as a DC voltage.<br />

1/Csub (1/nF)<br />

0.11<br />

0.10<br />

0.09<br />

0.08<br />

0V Vgate<br />

1V Vgate<br />

2V Vgate<br />

3V Vgate<br />

4V Vgate<br />

5V Vgate<br />

6V Vgate<br />

7V Vgate<br />

8V Vgate<br />

9V Vgate<br />

10V Vgate<br />

11V Vgate<br />

12V Vgate<br />

13V Vgate<br />

14V Vgate<br />

15V Vgate<br />

0.07<br />

6 8 10 12 14 16<br />

Vref (V)<br />

Figure 4.5: Total substrate capacitance C Sub = N PIX × 2 × C S plotted as 1/C Sub (since this<br />

is a value proportional to <strong>the</strong> depletion region x d ) <strong>of</strong> <strong>the</strong> CPC2 for different values <strong>of</strong> V Ref<br />

<strong>and</strong> V G . The graph is presented without uncertainties for clarity purposes. However all C Sub<br />

measurements should be interpreted as having an uncertainty <strong>of</strong> ± 0.5 nF<br />

The results are consistent with <strong>the</strong> expectations regarding <strong>the</strong> behaviour <strong>of</strong> <strong>the</strong> depletion<br />

presented in <strong>the</strong> previous paragraphs. In fact under <strong>the</strong> assumption that <strong>the</strong> C Sub is an<br />

infinite plate capacitor <strong>the</strong> depletion distance is inversely proportional to C Sub . It is also<br />

possible to conclude that <strong>the</strong> minimum reference voltage at which <strong>the</strong> CPC2 can be operated


4.1 CPC2 Capacitance Characterization 45<br />

is larger than 10 V. It is only at this potential that <strong>the</strong> buried channel is fully depleted as<br />

seen in fig. 4.5. The V Ref value at which <strong>the</strong> depletion occurs has only a weak dependence<br />

on V G unless this potential is comparable to V Ref where <strong>the</strong> dependence is accentuated. The<br />

relationship can be interpreted as <strong>the</strong> previously described shift <strong>of</strong> <strong>the</strong> p-n junction depletion<br />

region. The capacitance dependence on <strong>the</strong> V G once <strong>the</strong> buried channel is depleted is clearly<br />

visible, in particular for larger V G . Before moving to <strong>the</strong> next topic it must be noted that<br />

this is not <strong>the</strong> regime in which <strong>the</strong> CPC2 is expected to be operated. The running <strong>of</strong> <strong>the</strong><br />

CPC2 is performed with a substantially lower gate potential, about 2 V, <strong>and</strong> <strong>the</strong> buried<br />

channel completely depleted.<br />

By assuming that <strong>the</strong> substrate capacitance behaves as a uniform infinite plate capacitor<br />

<strong>the</strong> thickness <strong>of</strong> <strong>the</strong> depletion region can also be evaluated: x d = ǫ r ǫ 0 A/C Sub , where A is <strong>the</strong><br />

imaging area <strong>and</strong> ǫ r is <strong>the</strong> dielectric constant <strong>of</strong> silicon. It can be <strong>the</strong>refore seen that when<br />

V G =0 V x d ≈ 6 µm while by applying a gate voltage <strong>of</strong> V G =10 V due to <strong>the</strong> extension <strong>of</strong> <strong>the</strong><br />

depletion region x d changes to 8 µm, rising up to 9 µm at higher gate voltages. This value<br />

is substantially below <strong>the</strong> stated thickness <strong>of</strong> <strong>the</strong> epitaxial layer <strong>of</strong> ≈ 20 µm which would<br />

imply that not all <strong>the</strong> epitaxial layer is depleted. A different interpretation <strong>of</strong> <strong>the</strong> result<br />

can also be made. In fact almost half <strong>of</strong> <strong>the</strong> area <strong>of</strong> <strong>the</strong> CPC has implants serving ei<strong>the</strong>r<br />

as channel stops or as barriers enabling <strong>the</strong> two-phase operation. Because <strong>of</strong> <strong>the</strong> different<br />

doping concentration this areas will not deplete as quickly, if at all, as <strong>the</strong> remaining part<br />

<strong>of</strong> <strong>the</strong> CPC. The estimation <strong>of</strong> <strong>the</strong> capacitance <strong>of</strong> <strong>the</strong>se elements is difficult as it depends<br />

strongly on <strong>the</strong>ir geometry <strong>and</strong> on assumptions on whe<strong>the</strong>r <strong>the</strong>y are floating or grounded.<br />

However it is reasonably expected that C Sub will be at least partially (if not largely) driven<br />

by <strong>the</strong>se areas. As a consequence <strong>the</strong> o<strong>the</strong>r regions may have an x d that is substantially<br />

larger than 9 µm.<br />

4.1.2 Intergate Capacitance<br />

Once <strong>the</strong> substrate capacitance has been understood also <strong>the</strong> intergate capacitance has to<br />

be measured. This is however a more challenging task as each single pixel cannot be treated


4.1 CPC2 Capacitance Characterization 46<br />

independently. It is now important to reason in terms <strong>of</strong> voltages. Considering an infinite<br />

number <strong>of</strong> pixels, see fig. 4.4(b), a fair assumption given <strong>the</strong> considerable size <strong>of</strong> <strong>the</strong> CPC2,<br />

<strong>the</strong> AC voltage amplitude between points A <strong>and</strong> C is half <strong>of</strong> <strong>the</strong> amplitude between <strong>the</strong><br />

points A <strong>and</strong> B: ∆V A−C = ∆V B−C = 1/2∆V A−B . The total energy <strong>of</strong> <strong>the</strong> circuit is <strong>the</strong>refore:<br />

{<br />

E T = 1 2 ∆V A−BC 2 Mes = N PIX × ∆VA−BC 2 IG +<br />

(<br />

∆VA−B<br />

2<br />

) 2<br />

C S<br />

}<br />

(4.2)<br />

This allows <strong>the</strong> evaluation <strong>of</strong> <strong>the</strong> intergate capacitance as:<br />

C Mes = N PIX ×<br />

(<br />

2 × C IG + 1 )<br />

2 C S<br />

(4.3)<br />

Which is exactly equivalent to considering <strong>the</strong> CPC as being simply composed <strong>of</strong> N independent<br />

pixels, fig. 4.4(a), but with a doubled intergate capacitance component. This is<br />

due to <strong>the</strong> fact that between each pixel <strong>the</strong>re is an additional gate <strong>and</strong> hence an additional<br />

C IG in parallel. In terms <strong>of</strong> measurable quantities <strong>the</strong> total contribution <strong>of</strong> <strong>the</strong> intergate<br />

capacitance C IG,Tot to <strong>the</strong> CPC2 is:<br />

C IG,Tot = C Mes − 1 4 C Sub (4.4)<br />

where C Mes was measured by applying a sinusoidal potential with RMS Voltage <strong>of</strong> 1V between<br />

V A <strong>and</strong> V B ; <strong>the</strong> V Ref was instead varied, but always kept above 10V where <strong>the</strong> buried<br />

channel is completely depleted. In this region <strong>the</strong>re should be almost no dependence <strong>of</strong> C Mes<br />

on V Ref . In fact to first order C IG does not depend on V Ref ; any dependence derives only<br />

from <strong>the</strong> edge effect caused by <strong>the</strong> change in <strong>the</strong> depletion distance below <strong>the</strong> gates, which<br />

at V Ref = 10 is essentially fixed. The measurements performed are summarized in table 4.1;<br />

all measurements having an error <strong>of</strong> 0.5nF.<br />

Intergate Capacitance<br />

V Ref 11 V 12 V 13 V 14 V 15 V<br />

C Mes 25.7 nF 24.9 nF 24.7 nF 24.6 nF 24.5 nF<br />

Table 4.1: Measured intergate capacitance at different V Ref levels.


4.1 CPC2 Capacitance Characterization 47<br />

A graph <strong>of</strong> <strong>the</strong> capacitance measurements after <strong>the</strong>y have been renormalized with respect<br />

to <strong>the</strong> gate overlap length can be seen in in fig. 4.8. Clearly some dependence on V Ref can<br />

be seen, but this is <strong>of</strong> <strong>the</strong> order <strong>of</strong> a percent once <strong>the</strong> 11V point is discounted.<br />

4.1.3 Finite Element Model <strong>of</strong> CPC2 Capacitance<br />

In order to confirm that we correctly underst<strong>and</strong> <strong>the</strong> capacitance <strong>of</strong> <strong>the</strong> CPC2 imaging area<br />

a finite element model <strong>of</strong> a single pixel was generated (see fig. 4.6) inside <strong>the</strong> Finite Element<br />

Method Magnetics (FEMM) s<strong>of</strong>tware [46]. The C IG <strong>and</strong> C S capacitances have <strong>the</strong>n been<br />

estimated <strong>and</strong> compared to <strong>the</strong> ones previously measured.<br />

Figure 4.6: Geometrical model used in <strong>the</strong> FEMM s<strong>of</strong>tware to represent a single CPC2 pixel.<br />

The presented structure is for <strong>the</strong> C IG calculations. In <strong>the</strong> calculations for C S <strong>the</strong> potential<br />

is applied across <strong>the</strong> depleted region <strong>and</strong> both gates are at <strong>the</strong> same voltage.<br />

To a first approximation <strong>the</strong> modelling <strong>of</strong> <strong>the</strong> intergate capacitance is simple; all one<br />

needs to know is <strong>the</strong> specifics <strong>of</strong> <strong>the</strong> CPC2 geometry <strong>and</strong> assume that it is operated in a<br />

regime where <strong>the</strong> buried channel is fully depleted. Differently certain assumptions must be<br />

made for <strong>the</strong> C S modelling. It is in fact impossible to know <strong>the</strong> depletion depths under<br />

<strong>the</strong> gates directly. One measurement point has <strong>the</strong>refore been used to determine <strong>the</strong> total<br />

depletion depth at saturation; under <strong>the</strong> assumption <strong>of</strong> an infinite parallel plate capacitor.<br />

Subsequently all <strong>the</strong> non saturated depletion depths have been calculated by using <strong>the</strong> relationship<br />

x d ∝ √ V Ref ,where x d is <strong>the</strong> depleted depth. In order to compare <strong>the</strong> results both<br />

<strong>the</strong> measured <strong>and</strong> <strong>the</strong> modelled values have been converted into units <strong>of</strong> nF/cm 2 instead <strong>of</strong><br />

being expressed as capacitances <strong>of</strong> <strong>the</strong> whole sensor. As previously mentioned <strong>the</strong> imaging<br />

area <strong>of</strong> <strong>the</strong> CPC2-40 sensor is 1.5 cm × 5.28 cm; so all that this conversion implies for <strong>the</strong>


4.1 CPC2 Capacitance Characterization 48<br />

measured data is <strong>the</strong> division by <strong>the</strong> constant value <strong>of</strong> 7.92 cm 2 . If one would <strong>the</strong>n like to<br />

quote <strong>the</strong> capacitance in terms <strong>of</strong> <strong>the</strong> length <strong>of</strong> <strong>the</strong> gate overlaps one needs to know that<br />

<strong>the</strong>re are 10 m <strong>of</strong> gate overlap in each square centimetre <strong>of</strong> imaging area.<br />

Csub(nF/cm^2)<br />

3.4 FEMM Model<br />

3.2<br />

Data<br />

3.0<br />

2.8<br />

2.6<br />

2.4<br />

2.2<br />

2.0<br />

1.8<br />

1.6<br />

1.4<br />

1.2<br />

1.0<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0.0<br />

0 2 4 6 8 10 12 14 16<br />

Vref (V)<br />

Figure 4.7: Modelled (red curve) <strong>and</strong> measured (black data points) substrate capacitance<br />

normalized to an area <strong>of</strong> 1 cm 2 plotted vs. <strong>the</strong> applied V Ref .<br />

It can be immediately seen in fig. 4.7 that <strong>the</strong> modelled <strong>and</strong> measured C S agree very well;<br />

<strong>the</strong> only exception being at V Ref < 3V . Because <strong>of</strong> <strong>the</strong> small depletion region this area is <strong>the</strong><br />

most vulnerable to idiosyncratic features <strong>of</strong> <strong>the</strong> CPC2 that have not been included in <strong>the</strong><br />

simple geometrical model. It <strong>the</strong>refore seems to validate <strong>the</strong> previously explained <strong>the</strong>ory that<br />

<strong>the</strong> actual depletion depth is deeper than <strong>the</strong> one calculated <strong>and</strong> that a partial capacitance<br />

contribution does derive from <strong>the</strong> fixed capacitance to <strong>the</strong>se un-modelled features. The<br />

regime at V Ref < 3 V is <strong>the</strong> most difficult to model. On <strong>the</strong> o<strong>the</strong>r h<strong>and</strong> it is also <strong>the</strong> one<br />

least interesting for <strong>the</strong> running <strong>of</strong> <strong>the</strong> CPC2.<br />

Slightly different is <strong>the</strong> situation for <strong>the</strong> intergate capacitance. Although <strong>the</strong> model <strong>and</strong><br />

<strong>the</strong> data agree in predicting an essentially flat capacitance in <strong>the</strong> depleted buried channel<br />

regime, <strong>the</strong> only one here analysed, <strong>the</strong>re is a 20%-25% disagreement in <strong>the</strong> actual value (fig.<br />

4.8). The model predicts a capacitance <strong>of</strong> 2.24 nF/cm 2 while <strong>the</strong> calculated C IG is <strong>of</strong> 2.74<br />

± 0.08 nF/cm 2 .<br />

It is <strong>the</strong>n possible to decompose <strong>the</strong> C PIX capacitance as presented in equation 4.1. Since<br />

<strong>the</strong> voltage is driven across <strong>the</strong> gates, C PIX will follow <strong>the</strong> same circuit diagram described


4.1 CPC2 Capacitance Characterization 49<br />

3.0<br />

FEMM Model<br />

Data<br />

2.5<br />

2.0<br />

Cig(nF/cm^2)<br />

1.5<br />

1.0<br />

0.5<br />

0.0<br />

11 12 13 14 15<br />

Vref (V)<br />

Figure 4.8: Comparison <strong>of</strong> <strong>the</strong> modelled (red curve) <strong>and</strong> measured (black data points) intergate<br />

capacitance normalized to an area <strong>of</strong> 1 cm 2 plotted vs. <strong>the</strong> applied V Ref .<br />

in fig. 4.4(b). Hence for <strong>the</strong> purposes <strong>of</strong> driving <strong>the</strong> CPC2 C PIX is equal to C Mes (once<br />

both have been normalized to <strong>the</strong> same area) as defined in equation 4.1 <strong>and</strong> equation 4.2<br />

respectively. Additionally C IG as expressed in eq. 4.3 can be decomposed into <strong>the</strong> part<br />

modelled with <strong>the</strong> FEMM s<strong>of</strong>tware <strong>and</strong> a part that we hypo<strong>the</strong>size being due to <strong>the</strong> presence<br />

<strong>of</strong> additional CPC2 features. Once this is done it is easy to see how C PIX = 3.11 ± 0.05<br />

nF/cm 2 <strong>of</strong> which 0.39 ± 0.02 nF/cm 2 derives from <strong>the</strong> C S , 2.24 ± 0.08 nF/cm 2 derives from<br />

C IG <strong>and</strong> was modelled in FEMM, 0.48 ± 0.09 nF/cm 2 is <strong>the</strong> C IG contribution not modelled<br />

by FEMM <strong>and</strong> most likely associated to <strong>the</strong> coupling <strong>of</strong> <strong>the</strong> gate to o<strong>the</strong>r elements <strong>of</strong> <strong>the</strong><br />

CPC that have not been modelled. These are <strong>the</strong> channel stops that occupy roughly a third<br />

<strong>of</strong> <strong>the</strong> area <strong>and</strong> <strong>the</strong> ion implants needed for <strong>the</strong> two phase running, but also <strong>the</strong> shape <strong>of</strong><br />

<strong>the</strong> intergate region. In fact while we have modelled <strong>the</strong> gates as regular parallelepipeds, in<br />

reality <strong>the</strong>y always have irregular edges that are <strong>of</strong>ten substantially higher than <strong>the</strong> rest <strong>of</strong><br />

<strong>the</strong> gate. A more realistic model can be seen in 4.9. No attempt was performed to model<br />

this effect because <strong>the</strong> precise geometry <strong>of</strong> both gates would have to be known.<br />

The effect <strong>of</strong> channel stops <strong>and</strong> ion implants is not easily evaluated. Their presence<br />

reduces <strong>the</strong> edge effect <strong>of</strong> <strong>the</strong> intergate capacitance <strong>and</strong> hence has a, relatively small, negative<br />

effect on <strong>the</strong> intergate capacitance. However <strong>the</strong>ir coupling to each o<strong>the</strong>r might also add an<br />

additional capacitance in parallel to <strong>the</strong> intergate capacitance <strong>and</strong> hence add to <strong>the</strong> total<br />

measured capacitance. The effect due to <strong>the</strong> presence <strong>of</strong> irregular gate edges, expected to


4.2 CPC2 Capacitance Test Structures 50<br />

Figure 4.9: Sketch <strong>of</strong> realistic intergate region: pink is polysilicon, light pink is depleted<br />

silicon, red is silicon oxide <strong>and</strong> yellow is nitride.<br />

be significantly larger, would instead increase <strong>the</strong> area <strong>and</strong> <strong>the</strong>refore proportionally increase<br />

<strong>the</strong> intergate capacitance. Already a height variation <strong>of</strong> ≈ 0.1 µ would explain <strong>the</strong> seen<br />

discrepancy. The discrepancy can also depend on <strong>the</strong> way that <strong>the</strong> C PIX was modelled or<br />

on external parasitic capacitances.<br />

In any event <strong>the</strong> most important conclusion <strong>of</strong> this study is that <strong>the</strong> capacitance is<br />

mainly driven by <strong>the</strong> intergate contribution. Any effort to substantially decrease <strong>the</strong> power<br />

consumption <strong>of</strong> <strong>the</strong> CPC will <strong>the</strong>refore start by addressing this area. The FEMM s<strong>of</strong>tware<br />

was used to do just that. Three separate parameters to which <strong>the</strong> C IG is expected to be<br />

most sensitive have been identified <strong>and</strong> <strong>the</strong> behaviour <strong>of</strong> <strong>the</strong> capacitance while <strong>the</strong>ir values<br />

are varied was analysed. It has <strong>the</strong>refore been seen that <strong>the</strong> thickness <strong>of</strong> <strong>the</strong> nitride layer<br />

has only a small influence on <strong>the</strong> capacitance (fig. 4.10(c)). A bit more significant is <strong>the</strong><br />

capacitance relation with <strong>the</strong> thickness <strong>of</strong> <strong>the</strong> oxide (fig. 4.10(b)). Ultimately however <strong>the</strong><br />

C IG depends strongly only on <strong>the</strong> gap between subsequent gates; fig. 4.10(a).<br />

4.2 CPC2 Capacitance Test Structures<br />

Once <strong>the</strong> FEMM modelling results have been analysed it was decided to produce small test<br />

structures, as seen in fig. 4.11, in order to study fur<strong>the</strong>r <strong>the</strong> dependence <strong>of</strong> <strong>the</strong> intergate


4.2 CPC2 Capacitance Test Structures 51<br />

Cig(nF/cm^2)<br />

3.6<br />

3.4<br />

3.2<br />

3.0<br />

2.8<br />

2.6<br />

2.4<br />

2.2<br />

2.0<br />

1.8<br />

0.0 0.1 0.2 0.3 0.4 0.5 0.6<br />

Gap between neighbouring gates ( m)<br />

FEMM Model<br />

Currently used value<br />

Cig(nF/cm^2)<br />

2.35<br />

2.30<br />

2.25<br />

2.20<br />

2.15<br />

2.10<br />

FEMM Model<br />

Currently used value<br />

2.05<br />

0.00 0.05 0.10 0.15 0.20 0.25<br />

Thickness <strong>of</strong> oxide layer ( m)<br />

(a)<br />

(b)<br />

2.27<br />

FEMM Model<br />

Currently used value<br />

2.26<br />

Cig(nF/cm^2)<br />

2.25<br />

2.24<br />

2.23<br />

0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 0.16 0.18 0.20<br />

Thickness <strong>of</strong> nitride layer ( m)<br />

(c)<br />

Figure 4.10: Study <strong>of</strong> <strong>the</strong> intergate capacitance a) FEMM study <strong>of</strong> C IG vs. gap between two<br />

adjacent gates. b) FEMM study <strong>of</strong> C IG vs. thickness <strong>of</strong> <strong>the</strong> oxide layer present under <strong>the</strong><br />

gates. c) FEMM study <strong>of</strong> C IG vs. thickness <strong>of</strong> <strong>the</strong> nitride layer present under <strong>the</strong> gates. In<br />

all figures <strong>the</strong> currently used parameter values are highlighted by a black data point.<br />

capacitance on <strong>the</strong> gap/overlap between <strong>the</strong> gates. Ideally <strong>the</strong>se test structures should<br />

reproduce <strong>the</strong> CPC2 as closely as possible <strong>and</strong> <strong>the</strong>refore have a buried channel. However due<br />

to some production issues at e2V this has not been <strong>the</strong> case. The delivered test structures<br />

are in fact an array <strong>of</strong> gates on field oxide <strong>and</strong> highly doped p + silicon. The substrate<br />

<strong>the</strong>refore cannot be depleted. Therefore in all <strong>of</strong> <strong>the</strong> following measurements <strong>the</strong> edge effect<br />

<strong>of</strong> <strong>the</strong> intergate capacitance was underestimated since <strong>the</strong>re was no layer <strong>of</strong> depleted silicon<br />

present underneath <strong>the</strong> gates. Never <strong>the</strong> less <strong>the</strong>re is a lot <strong>of</strong> useful information that can<br />

be deduced from studying <strong>the</strong> test structures. The test structures that have been produced<br />

have 4 different polysilicon layers, P0, P1, P2 <strong>and</strong> P3 acting as conducting material. In<br />

<strong>the</strong> production process flow first <strong>the</strong> P0 layer will be placed on <strong>the</strong> oxide, followed by a


4.2 CPC2 Capacitance Test Structures 52<br />

layer <strong>of</strong> oxide, <strong>and</strong> <strong>the</strong>n P1 will be placed followed by a layer <strong>of</strong> oxide, <strong>and</strong> so on. This<br />

means that different polysilicon layers are separated by different thicknesses <strong>of</strong> oxide. It<br />

also means that upper <strong>and</strong> lower gates, as in fig. 4.12, are produced from different layers<br />

<strong>of</strong> polysilicon. A thin layer <strong>of</strong> oxide is applied not only below but also around <strong>the</strong> gates<br />

produced with <strong>the</strong> lowest layer <strong>of</strong> polysilicon (<strong>the</strong> overlapped gates). This oxide layer was<br />

estimated to be approximately 0.5 µm. In all <strong>the</strong> future work <strong>the</strong> capacitance structures<br />

will be labelled according to <strong>the</strong> gate overlap (a negative value representing a gap) as in fig.<br />

4.12(a), fig.4.12(b) <strong>and</strong> <strong>the</strong> layers <strong>of</strong> polysilicon used for creating <strong>the</strong> gates.<br />

Figure 4.11: Picture <strong>of</strong> a test structure taken while a measurement is being performed on<br />

<strong>the</strong> probe station.<br />

(a)<br />

(b)<br />

Figure 4.12: Schematic representation <strong>of</strong> <strong>the</strong> test structures for: a) negative overlap values;<br />

b) positive overlap values.<br />

In <strong>the</strong> first instance <strong>the</strong> intergate capacitance was to be analysed. The same ma<strong>the</strong>matical<br />

<strong>and</strong> circuit considerations as <strong>the</strong> ones presented in <strong>the</strong> previous sections were used. It has<br />

to be however noted that <strong>the</strong> approximation that <strong>the</strong> substrate capacitances are <strong>the</strong> same


4.2 CPC2 Capacitance Test Structures 53<br />

under both gates has been dropped. The assumption that <strong>the</strong> CPC2 can be modelled as<br />

N independent pixels with double intergate capacity has however still been used, as it was<br />

explained when interpreting eq. 4.2. The measurement performed <strong>and</strong> <strong>the</strong> equation relating<br />

<strong>the</strong>m to <strong>the</strong> individual capacitance components are <strong>the</strong>refore:<br />

C A−B = N PIX ×<br />

C A−C = N PIX ×<br />

C B−C = N PIX ×<br />

(<br />

(<br />

(<br />

2 × C IG +<br />

C S1 +<br />

C S2 +<br />

)<br />

1<br />

1<br />

C S1<br />

+ 1<br />

C S2<br />

)<br />

1<br />

1<br />

2×C IG<br />

+ 1<br />

C S2<br />

)<br />

1<br />

1<br />

2×C IG<br />

+ 1<br />

C S1<br />

(4.5)<br />

where C A−B is <strong>the</strong> measurement between voltage points A <strong>and</strong> B in fig. 4.4(a) <strong>and</strong> C S1<br />

<strong>and</strong> C S2 are <strong>the</strong> substrate capacitance below gates one <strong>and</strong> two. While using this method it<br />

was discovered that by placing <strong>the</strong> test structure directly onto <strong>the</strong> probe station <strong>the</strong> results<br />

for C IG become negative due to <strong>the</strong> parasitic coupling to <strong>the</strong> metal probe base which has<br />

not been included in <strong>the</strong> model. In order to confirm this different thicknesses <strong>of</strong> insulating<br />

material have been inserted between <strong>the</strong> test structure <strong>and</strong> <strong>the</strong> base <strong>of</strong> <strong>the</strong> probe station.<br />

Given <strong>the</strong> formula: C = ǫ r ǫ 0 A/d this should substantially decrease <strong>the</strong> parasitic capacitance<br />

as confirmed by fig. 4.13. The effect <strong>of</strong> <strong>the</strong> coupling is negligible for large separations<br />

between <strong>the</strong> base <strong>and</strong> <strong>the</strong> test structure; corresponding to small values <strong>of</strong> ǫ r /d. All fur<strong>the</strong>r<br />

measurements have been conducted using a large enough separation.<br />

In order to underst<strong>and</strong> fur<strong>the</strong>r <strong>the</strong> capacitance behaviour <strong>of</strong> <strong>the</strong> test structures <strong>the</strong> substrate<br />

capacitance was analysed (fig. 4.14). As it is clear from <strong>the</strong> results <strong>the</strong> lower <strong>and</strong><br />

upper gates have a significantly different substrate capacitance. It has subsequently been<br />

found that <strong>the</strong> reason is due to different gate areas. With <strong>the</strong> increase or decrease <strong>of</strong> <strong>the</strong><br />

overlap <strong>the</strong> gate area <strong>of</strong> <strong>the</strong> lower gates is kept constant while <strong>the</strong> gate area <strong>of</strong> <strong>the</strong> upper<br />

gates is increased/decreased. It is hence expected that, as it can be seen in <strong>the</strong> graph, <strong>the</strong><br />

upper gates will have lower capacitances at negative overlaps. Because <strong>of</strong> <strong>the</strong> increase in<br />

oxide thickness in between <strong>the</strong> gates one would expect that even at similar gate areas <strong>the</strong><br />

upper gates should still have slightly lower capacitances. This effect is however obscured by


4.2 CPC2 Capacitance Test Structures 54<br />

0<br />

Cig<br />

-2<br />

-4<br />

Cig (pF)<br />

-6<br />

-8<br />

-10<br />

-12<br />

0.1 1 10<br />

epsilon/distance(1/mm)<br />

Figure 4.13: The dependence <strong>of</strong> <strong>the</strong> C IG capacitance measurement on <strong>the</strong> insulation from<br />

<strong>the</strong> test probe st<strong>and</strong>.<br />

16<br />

15<br />

14<br />

Csub (pF)<br />

13<br />

12<br />

Total substrate capacitance lower gates<br />

Total substrate capacitance upper gates<br />

11<br />

-2 -1 0 1 2 3<br />

Overlap ( m)<br />

Figure 4.14: The substrate capacitances ot <strong>the</strong> lower gates <strong>and</strong> upper gates for <strong>the</strong> p3onp0<br />

structures.<br />

<strong>the</strong> fact that <strong>the</strong> oxide layer between <strong>the</strong> gates counts towards <strong>the</strong> area <strong>of</strong> <strong>the</strong> lower gates;<br />

hence at nominally equal areas <strong>the</strong> lower gates are actually slightly smaller. Assuming no effect<br />

from <strong>the</strong> different oxide thickness it is now possible to roughly calculate <strong>the</strong> thickness <strong>of</strong><br />

<strong>the</strong> oxide coating <strong>of</strong> <strong>the</strong> lower gates. This is, to a rough approximation, <strong>the</strong> point where <strong>the</strong><br />

curves <strong>of</strong> <strong>the</strong> lower <strong>and</strong> <strong>the</strong> upper gates intersect. The thickness has hence been estimated<br />

to be approximately 0.5 µm in agreement with <strong>the</strong> factory data sheet.<br />

In order to better underst<strong>and</strong> <strong>the</strong> influence <strong>of</strong> different layer selections on <strong>the</strong> substrate<br />

capacitance <strong>the</strong> value for each polysilicon combination <strong>and</strong> for each overlap was plotted for<br />

<strong>the</strong> lower gates in fig. 4.15(a) <strong>and</strong> for <strong>the</strong> upper gates in fig. 4.15(b).


4.2 CPC2 Capacitance Test Structures 55<br />

18<br />

17<br />

16<br />

P3onP0<br />

P2onP1<br />

P2onP0<br />

P3onP1<br />

P3onP2<br />

18<br />

17<br />

16<br />

P3onP0<br />

P2onP1<br />

P2onP0<br />

P3onP1<br />

P3onP2<br />

15<br />

15<br />

Csub (pF)<br />

14<br />

13<br />

Csub (pF)<br />

14<br />

13<br />

12<br />

12<br />

11<br />

11<br />

10<br />

-2 0 2 4<br />

Overlap ( m)<br />

10<br />

-2 0 2 4<br />

Overlap ( m)<br />

(a)<br />

(b)<br />

Figure 4.15: The total calculated substrate capacitances for <strong>the</strong> lower gates a) <strong>and</strong> upper<br />

gates b) in all <strong>the</strong> available test structures. All data points have uncertainties <strong>of</strong> 0.3 pF.<br />

Although <strong>the</strong> differences between <strong>the</strong> various manufacturing processes are small, between<br />

0.1 <strong>and</strong> 1.1pF, many are still significant when compared to our experimental error <strong>of</strong> 0.3 pF<br />

<strong>and</strong> an analysis is <strong>the</strong>refore possible. Starting from <strong>the</strong> lower gate (fig. 4.15(a)) it can<br />

immediately be seen how <strong>the</strong> bigger <strong>the</strong> difference between polysilicon layer numbers <strong>the</strong><br />

smaller is <strong>the</strong> substrate capacitance. Hence <strong>the</strong> P3onP0 sample is <strong>the</strong> one with <strong>the</strong> smallest<br />

capacitance. Differently <strong>the</strong> largest capacitance is obtained from <strong>the</strong> P3onP2 layer. These<br />

results suggest that <strong>the</strong> increased oxide thickness <strong>of</strong> <strong>the</strong> different production methods are<br />

only present in between <strong>the</strong> polysilicon layers. In <strong>the</strong> areas where <strong>the</strong> polysilicon is not<br />

present no additional layer <strong>of</strong> oxide seems to be added. If this was not <strong>the</strong> case C S,Lower<br />

should be minimal when <strong>the</strong> value <strong>of</strong> <strong>the</strong> lower gate is maximal.<br />

Interesting enough for <strong>the</strong> upper gates, described in fig. 4.15(a), <strong>the</strong> pattern is reversed<br />

with P3onP0 now having <strong>the</strong> highest capacitance <strong>and</strong> P3onP2 <strong>the</strong> lowest. Once again <strong>the</strong><br />

results suggest that <strong>the</strong> additional layer <strong>of</strong> oxides are deposited only in between polysilicon<br />

layers. If this were not <strong>the</strong> case <strong>the</strong> capacitance should be minimized when <strong>the</strong> value <strong>of</strong> <strong>the</strong><br />

upper gate is greatest. Given <strong>the</strong> extremely small differences between <strong>the</strong> C S <strong>of</strong> different<br />

manufacturing methods it is difficult to pinpoint <strong>the</strong> precise reason behind <strong>the</strong>m. Tiny<br />

variations <strong>of</strong> <strong>the</strong> area or <strong>the</strong> shape <strong>of</strong> <strong>the</strong> gate can indeed account for <strong>the</strong> seen results.


4.2 CPC2 Capacitance Test Structures 56<br />

Finally if one considers <strong>the</strong> sum <strong>of</strong> <strong>the</strong> capacitiance <strong>of</strong> <strong>the</strong> upper <strong>and</strong> lower gates one<br />

reaches a ra<strong>the</strong>r unsurprising conclusion: <strong>the</strong> different manufacturing processes do not substantially<br />

change <strong>the</strong> total substrate capacitances. If a pattern can be inferred it is very<br />

minimal. In fact if <strong>the</strong> capacitances <strong>of</strong> both gates are added toge<strong>the</strong>r all <strong>the</strong> points with<br />

overlap values <strong>of</strong> 0 or above are within 0.4 pF or 3% from each o<strong>the</strong>r which is actually<br />

also our experimental error. Therefore <strong>the</strong> total substrate capacitance <strong>of</strong> <strong>the</strong> test structures<br />

seems to depend almost exclusively on <strong>the</strong> total area in contact with <strong>the</strong> oxide layer present<br />

below <strong>the</strong> gates.<br />

It is now possible to perform <strong>the</strong> main task <strong>of</strong> <strong>the</strong> analysis <strong>and</strong> measure C IG . This was<br />

done with two methods: <strong>the</strong> one already presented <strong>and</strong> by shorting <strong>the</strong> substrate voltage<br />

to <strong>the</strong> virtual ground <strong>of</strong> <strong>the</strong> Hewlett-Packard LCR 4284A. In this case <strong>the</strong> capacitance<br />

across <strong>the</strong> gates, C IG , is directly measured <strong>and</strong> <strong>the</strong>re is no model dependence in <strong>the</strong> final<br />

results. The results <strong>of</strong> <strong>the</strong> exercise can be seen in fig. 4.16(a), which shows that <strong>the</strong> two<br />

methods yield similar yet slightly different capacitance values. In general <strong>the</strong> method that<br />

does not ground <strong>the</strong> substrate underestimates <strong>the</strong> total capacitance by 0.6 pF. This shift<br />

is constant <strong>and</strong> independent on <strong>the</strong> overlap. Incidentally <strong>the</strong> consistency <strong>of</strong> <strong>the</strong> results also<br />

validates <strong>the</strong> method used in performing <strong>the</strong> measurements on <strong>the</strong> actual CPC2. The most<br />

probable reason hypo<strong>the</strong>sized for this shift is in fact <strong>the</strong> presence <strong>of</strong> some residual parasitic<br />

capacitance. Because <strong>of</strong> <strong>the</strong> symmetry <strong>of</strong> <strong>the</strong> substrate calculation in eq. 4.5 we expect<br />

a similar constant shift to be present also for <strong>the</strong> previously presented substrate results.<br />

A similar correction to <strong>the</strong> absolute values <strong>of</strong> <strong>the</strong> measured capacitance should <strong>the</strong>refore<br />

be applied. This is however not <strong>the</strong> aim <strong>of</strong> <strong>the</strong> analysis, which is focused on <strong>the</strong> study<br />

<strong>of</strong> intergate capacitance <strong>and</strong> aims to underst<strong>and</strong> <strong>the</strong> substrate capacitance only as a tool<br />

for underst<strong>and</strong>ing <strong>the</strong> intergate component. The substrate capacitance is in fact a smaller<br />

component <strong>of</strong> <strong>the</strong> total load capacitance which determines <strong>the</strong> clock driving conditions.<br />

Fig. 4.16(b) presents <strong>the</strong> final results <strong>of</strong> this analysis as it shows <strong>the</strong> C IG dependence<br />

on <strong>the</strong> gate production method <strong>and</strong> on <strong>the</strong> gate overlap with <strong>the</strong> shorted substrate method<br />

being used. Both are shown to behave as expected; in fact <strong>the</strong> capacitance increases with <strong>the</strong>


4.2 CPC2 Capacitance Test Structures 57<br />

Capacitance (pF)<br />

10<br />

8<br />

6<br />

4<br />

2<br />

0<br />

Cig - With Shorted Substrate<br />

Cig - Using Calculation Method<br />

-2 -1 0 1 2<br />

Overlap ( m)<br />

Capacitance / Overlap (pF/cm)<br />

5.0<br />

4.5<br />

4.0<br />

3.5<br />

3.0<br />

2.5<br />

2.0<br />

1.5<br />

1.0<br />

0.5<br />

0.0<br />

p3onp0<br />

p3onp2<br />

p2onp0<br />

p3onp1<br />

p2onp1<br />

-2 -1 0 1 2 3<br />

Overlap ( m)<br />

(a)<br />

(b)<br />

Figure 4.16: The intergate capacitance <strong>of</strong> <strong>the</strong> test structures: a) for <strong>the</strong> p3onp0 test structures<br />

by using both <strong>the</strong> shorted substrate <strong>and</strong> <strong>the</strong> calculated method; b) only <strong>the</strong> shorted<br />

substrate method is used but <strong>the</strong> results are presented in <strong>the</strong> form <strong>of</strong> pF/cm <strong>of</strong> overlap for<br />

all <strong>the</strong> test structures. The errors in plot b) are 0.05 pF/cm for all <strong>the</strong> data points.<br />

overlap <strong>and</strong> decreases with <strong>the</strong> gap between <strong>the</strong> polysilicon layers used. Also as expected <strong>the</strong><br />

difference between <strong>the</strong> different polysilicon arrangements is minimal at small gate overlaps<br />

<strong>and</strong> it becomes substantial once <strong>the</strong>y become positive. Ideally one would like to compare<br />

<strong>the</strong>se values to <strong>the</strong> ones measured <strong>and</strong> simulated in <strong>the</strong> previous sections <strong>and</strong> because <strong>of</strong> this<br />

reason <strong>the</strong> results are quoted in pF per centimetre <strong>of</strong> gate overlap; <strong>the</strong>re are 4.8706 cm <strong>of</strong><br />

overlap in each test structure. It can be immediately seen that all <strong>the</strong> test structures predict<br />

a significantly smaller C IG for similar overlap values. The capacitance measured in <strong>the</strong> test<br />

structures is in <strong>the</strong> region <strong>of</strong> 1 pF/cm <strong>of</strong> gate overlap while in <strong>the</strong> actual CPC2 sensor<br />

<strong>the</strong> comparable value is 2.7pF/cm <strong>of</strong> gate overlap. This is not very surprizing given <strong>the</strong><br />

previously mentioned observation that <strong>the</strong> test structure geometry is substantially different<br />

from <strong>the</strong> imaging area <strong>of</strong> <strong>the</strong> CPC2. The explanation comes mainly from <strong>the</strong> mentioned<br />

reduced edge effect due to <strong>the</strong> lack <strong>of</strong> depleted substrate, which is very significant when <strong>the</strong><br />

depletion region is very small or as in this case non existent. In fact by using <strong>the</strong> FEMM<br />

s<strong>of</strong>tware one finds out that <strong>the</strong> C IG <strong>of</strong> <strong>the</strong> CPC2 without any depletion is 0.9 pF/cm <strong>of</strong> gate<br />

overlap. A result consistent with <strong>the</strong> one measured in <strong>the</strong> test capacitance structures. We<br />

have <strong>the</strong>refore been able to characterized <strong>the</strong> relative behaviour <strong>of</strong> <strong>the</strong> CPC2 capacitance<br />

when <strong>the</strong> gate overlap <strong>and</strong> <strong>the</strong> used polysilicon layers are varied.


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 58<br />

4.3 CPC2 Readout <strong>and</strong> Noise Analysis<br />

In addition to <strong>the</strong> CPC2 capacitance characterization a separate analysis has been performed<br />

in order to determine <strong>the</strong> noise performance <strong>of</strong> <strong>the</strong> sensor. The aim <strong>of</strong> this project was to<br />

develop an experimental set-up toge<strong>the</strong>r with <strong>the</strong> associated s<strong>of</strong>tware for <strong>the</strong> use <strong>of</strong> future<br />

studies <strong>and</strong> to concurrently analyse <strong>the</strong> noise sources. The initial hardware set-up used is<br />

described in fig. 4.17.<br />

Figure 4.17: Schematic representation <strong>of</strong> <strong>the</strong> experimental set-up used in <strong>the</strong> following noise<br />

analysis<br />

In this configurations <strong>the</strong> two power supplies provide <strong>the</strong> necessary bias voltages for <strong>the</strong><br />

CPC2 to run; a schematic <strong>of</strong> <strong>the</strong> role <strong>of</strong> <strong>the</strong>se voltages as well as <strong>the</strong>ir values can be found in<br />

Appendix A. The function generator acts as an input to <strong>the</strong> BVM2 module which produces<br />

all <strong>the</strong> clock <strong>and</strong> trigger timing sequences for <strong>the</strong> CPC2 <strong>and</strong> when <strong>the</strong> ADC is in an external<br />

triggering mode also for <strong>the</strong> ADC (not drawn). A separate pulse generator is used for <strong>the</strong><br />

Reset Gate (RG) in order to provide a fast signal with a well defined timing. The RG resets<br />

<strong>the</strong> output node to <strong>the</strong> default state after every readout <strong>and</strong> has a large influence on <strong>the</strong><br />

achievable noise performance. The output signal <strong>of</strong> <strong>the</strong> CPC2 is diverted via a low-pass<br />

filter <strong>and</strong> an amplifier before reaching <strong>the</strong> ADC. The ADC is triggered <strong>and</strong> clocked from<br />

<strong>the</strong> BVM2 <strong>and</strong> it can record amplitudes for up to four columns in parallel with a maximum


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 59<br />

frequency <strong>of</strong> 100 MHz. In order to visually monitor <strong>the</strong> output <strong>the</strong> channels can also be<br />

directed to an oscilloscope. An example <strong>of</strong> <strong>the</strong> oscilloscope display can be seen in fig. 4.18.<br />

In order to prove that <strong>the</strong> sensor is capable <strong>of</strong> acting as a particle detector <strong>and</strong> also for<br />

<strong>the</strong> calibration <strong>of</strong> <strong>the</strong> ADC output units a radioactive source ( 55 Fe) was used. The iron<br />

isotope decays to manganese via <strong>the</strong> absorption <strong>of</strong> an electron in <strong>the</strong> innermost K shell. In<br />

this process an electron from <strong>the</strong> outer shells L or M falls inwards to replace <strong>the</strong> absorbed<br />

electrons <strong>and</strong> emits a photon <strong>of</strong> 5.89 keV (L shell-K α ) or 6.49 keV (M shell-K β ). The X-rays<br />

can <strong>the</strong>n travel through <strong>the</strong> silicon detector <strong>and</strong> can interact through a photoelectric process.<br />

If this happens <strong>the</strong> signal is generated via <strong>the</strong> production <strong>of</strong> electron-hole pairs. Charge is<br />

<strong>the</strong>refore collected in <strong>the</strong> well structure <strong>of</strong> <strong>the</strong> gate potential. Given that in silicon it takes<br />

3.6 eV for each created electron-positron pair <strong>the</strong> signal is equivalent to 1620 electrons for<br />

K α decays <strong>and</strong> to 1778 electrons for <strong>the</strong> seven times less frequent K β decays [41]. X-rays <strong>of</strong><br />

different energies are also possible, but much less likely. Assuming that <strong>the</strong> two peaks cannot<br />

be resolved <strong>the</strong> average iron radioactive decay is expected to produce a signal <strong>of</strong> 1640 e − in<br />

<strong>the</strong> potential wells <strong>of</strong> <strong>the</strong> CPC2. By detecting <strong>the</strong> peak <strong>of</strong> <strong>the</strong> iron decay it is hence possible<br />

to calibrate <strong>the</strong> detector <strong>and</strong> to estimate <strong>the</strong> noise <strong>of</strong> <strong>the</strong> detector in terms <strong>of</strong> electrons.<br />

Figure 4.18: Oscilloscope picture <strong>of</strong> <strong>the</strong> CPC2 readout showing a series <strong>of</strong> pixels. The picture<br />

displayed shows <strong>the</strong> status <strong>of</strong> <strong>the</strong> pixels intgrated over time, each pixel is hence sampled<br />

multiple times. The wide b<strong>and</strong>s are hence <strong>the</strong> noise <strong>of</strong> <strong>the</strong> sensor. The lines directly below<br />

<strong>the</strong>m are instead <strong>the</strong> signal from <strong>the</strong> 55 Fe.<br />

Having discussed <strong>the</strong> hardware set-up it is important to briefly comment on <strong>the</strong> readout


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 60<br />

sequencing <strong>and</strong> s<strong>of</strong>tware. It should be mentioned that <strong>the</strong> CPC2 is not read out constantly,<br />

instead an integration time t int is allowed between <strong>the</strong> sequences to accumulate events with<br />

55 Fe interactions or to integrate some leakage current. In <strong>the</strong> active readout period <strong>the</strong> CPC2<br />

is clocked at <strong>the</strong> desired frequency <strong>and</strong> <strong>the</strong> output node is reset at <strong>the</strong> beginning <strong>of</strong> every<br />

clock cycle. The control s<strong>of</strong>tware for <strong>the</strong> BVM2 Module, <strong>the</strong> biases, <strong>the</strong> low pass filter <strong>and</strong><br />

<strong>the</strong> amplifier was developed. Additionally an on-line monitoring tool that is an integral part<br />

<strong>of</strong> <strong>the</strong> ADC control program has been coded. The user interface can be seen in fig. 4.19.<br />

At <strong>the</strong> bottom <strong>of</strong> <strong>the</strong> screen <strong>the</strong> raw data collected in <strong>the</strong> current run is displayed. Here<br />

<strong>the</strong> noise peak (on <strong>the</strong> left) <strong>and</strong> <strong>the</strong> 55 Fe calibration peak (on <strong>the</strong> right) can be clearly seen.<br />

Both peaks are <strong>the</strong>n automatically fitted with a Gaussian. The parameters <strong>of</strong> both Gaussian<br />

fit are shown at <strong>the</strong> top left. These parameters are <strong>the</strong>n used to calculate <strong>the</strong> noise in terms<br />

<strong>of</strong> electrons which is displayed on <strong>the</strong> top right graph.<br />

Figure 4.19: The user interface <strong>of</strong> <strong>the</strong> on-line noise monitoring tool.<br />

The first measurements performed were aimed at analysing <strong>the</strong> dependence <strong>of</strong> <strong>the</strong> noise<br />

on <strong>the</strong> time over which <strong>the</strong> 55 Fe signal is integrated between readouts <strong>and</strong> on <strong>the</strong> clock<br />

frequency. If <strong>the</strong> noise is dominated by dark current due to production <strong>of</strong> electron-hole pairs


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 61<br />

from <strong>the</strong>rmal excitation, <strong>the</strong>n a clear dependence on <strong>the</strong> integration time should be seen:<br />

σ ≈ √ N = √ I × t int /e (4.6)<br />

where N is <strong>the</strong> number <strong>of</strong> electron collected during <strong>the</strong> integration time t int , e is <strong>the</strong> electron<br />

charge <strong>and</strong> I is <strong>the</strong> dark current. If instead <strong>the</strong> noise is dominated by a clock feed-through<br />

effect <strong>the</strong>n <strong>the</strong> noise should increase with <strong>the</strong> increase <strong>of</strong> <strong>the</strong> clock frequency. The effect in<br />

fact arises from <strong>the</strong> capacitive coupling <strong>of</strong> <strong>the</strong> clock distributive network with <strong>the</strong> imaging<br />

area. The higher <strong>the</strong> clock frequency <strong>the</strong> larger is <strong>the</strong> current passing through <strong>the</strong> clock <strong>and</strong><br />

<strong>the</strong> sharper is <strong>the</strong> clock rising time, resulting in a higher noise from <strong>the</strong> feed-through effect.<br />

The clock network capacitative coupling can change substantially <strong>the</strong> charge stored in <strong>the</strong><br />

immaging area <strong>and</strong> <strong>the</strong>refore alter <strong>the</strong> signal. A fixed pattern noise term can derive from<br />

<strong>the</strong> different strenght <strong>of</strong> <strong>the</strong> coupling to different pixels, due to <strong>the</strong> topology <strong>of</strong> <strong>the</strong> clock<br />

network. A higher frequencies slight time variations in <strong>the</strong> clock periods, clock jitter, also<br />

induce a significant source <strong>of</strong> noise due to <strong>the</strong> clock capacitative coupling. However <strong>the</strong> noise<br />

was found to be ≈ 280 electrons at room temperature <strong>and</strong> independent <strong>of</strong> <strong>the</strong> integration<br />

time or <strong>the</strong> clock frequency, disproving both hypo<strong>the</strong>ses (fig. 4.20). It was decided to check<br />

<strong>the</strong> experimental set-up external to <strong>the</strong> chip <strong>and</strong> mo<strong>the</strong>rboard for possible noise sources.<br />

300<br />

Noise (e - )<br />

280<br />

260<br />

240<br />

220<br />

Original (34C)<br />

Original (-13C)<br />

Keithley (-13C)<br />

Agilent (-13C)<br />

Agilent (34C)<br />

200<br />

180<br />

0 100 200 300 400 500<br />

Integration Time (ms)<br />

Figure 4.20: The calculated noise <strong>of</strong> <strong>the</strong> CPC2 is presented for different power supplies at<br />

different temperatures <strong>and</strong> integration times. All data points to be regarded as having an<br />

error <strong>of</strong> ± 3 electrons.<br />

It has been hence discovered that <strong>the</strong> power supply used was delivering an unstable


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 62<br />

voltage <strong>and</strong> was <strong>the</strong>refore a major source <strong>of</strong> noise. In order to eliminate this effect <strong>and</strong> start<br />

studying <strong>the</strong> intrinsic noise <strong>of</strong> <strong>the</strong> CPC2 a new power supply was used; <strong>the</strong> Oxford designed<br />

ad-hoc supply was replaced with <strong>the</strong> Agilent N6700B. Fig. 4.20 shows <strong>the</strong> result <strong>of</strong> <strong>the</strong><br />

study performed regarding <strong>the</strong> effect <strong>of</strong> different power supplies on <strong>the</strong> measured CPC noise.<br />

In this way <strong>the</strong> noise was reduced to ≈ 180 electrons. However a clear dependence on <strong>the</strong><br />

integration time still cannot be seen.<br />

It was found that <strong>the</strong> noise was in fact dominated by its reset component. Since <strong>the</strong><br />

output node is reset after every cycle <strong>the</strong> reference level varies slightly for each pixel which<br />

translates into a source <strong>of</strong> noise. The common way <strong>of</strong> dispensing with <strong>the</strong> reset noise is to<br />

implement a correlated double sampling algorithm. In such algorithm, instead <strong>of</strong> reading<br />

out <strong>the</strong> absolute value <strong>of</strong> <strong>the</strong> signal, <strong>the</strong> voltage difference between <strong>the</strong> reference level, before<br />

<strong>the</strong> charge has been transferred to <strong>the</strong> output node, <strong>and</strong> <strong>the</strong> signal level, after <strong>the</strong> charge<br />

has been transferred to <strong>the</strong> output node, is taken. Since <strong>the</strong> node is not reset in between <strong>the</strong><br />

two measurements <strong>the</strong> two levels should have exactly <strong>the</strong> same contribution from <strong>the</strong> reset<br />

noise. The noise is hence cancelled by <strong>the</strong> algorithm. Fig. 4.21 shows <strong>the</strong> user interface <strong>of</strong><br />

<strong>the</strong> implemented correlated double sampling algorithm. On <strong>the</strong> top right <strong>the</strong> full readout<br />

<strong>of</strong> <strong>the</strong> CPC is presented. On <strong>the</strong> bottom a zoom <strong>of</strong> only two clock cycles is shown. On<br />

each cycle <strong>the</strong> reset signal can be seen (<strong>the</strong> high spike). The reference level is <strong>the</strong>n <strong>the</strong><br />

following plateau, <strong>the</strong> signal level is <strong>the</strong> lower flat plateau. In <strong>the</strong> implemented correlated<br />

double sampling <strong>the</strong> value <strong>of</strong> <strong>the</strong> signal between <strong>the</strong> blue square <strong>and</strong> <strong>the</strong> yellow square is<br />

averaged <strong>and</strong> subtracted from <strong>the</strong> average value between <strong>the</strong> red <strong>and</strong> <strong>the</strong> green box. The<br />

exact location <strong>of</strong> <strong>the</strong> squares can be determined by using <strong>the</strong> controls on <strong>the</strong> top left <strong>of</strong> <strong>the</strong><br />

user interface.<br />

The noise performance was greatly improved after <strong>the</strong>se amendments to <strong>the</strong> set up <strong>and</strong><br />

<strong>the</strong>refore <strong>the</strong> dependence <strong>of</strong> <strong>the</strong> noise on <strong>the</strong> integration time <strong>and</strong> <strong>the</strong> temperature has been<br />

re-examined. While <strong>the</strong> behaviour <strong>of</strong> <strong>the</strong> dark current with respect to <strong>the</strong> integration time<br />

has already been explained, <strong>the</strong> behaviour with respect to <strong>the</strong> temperature still needs to<br />

be addressed. The <strong>the</strong>ory states that with a higher temperature <strong>the</strong> number <strong>of</strong> electron-


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 63<br />

Figure 4.21: The user interface <strong>of</strong> <strong>the</strong> correlated double sampling program. The program<br />

calculates <strong>the</strong> average between a number <strong>of</strong> user defined points in <strong>the</strong> reference <strong>and</strong> <strong>the</strong><br />

signal levels <strong>and</strong> <strong>the</strong>n calculates <strong>the</strong> difference between <strong>the</strong>se two averages. On <strong>the</strong> picture<br />

<strong>the</strong> points between which <strong>the</strong> averaging is performed are graphically represented by <strong>the</strong> red<br />

<strong>and</strong> green dots for <strong>the</strong> reference level <strong>and</strong> <strong>the</strong> blue <strong>and</strong> yellow dots for <strong>the</strong> signal level.<br />

hole pairs present at <strong>the</strong>rmal equilibrium increases <strong>and</strong> with it <strong>the</strong> resulting dark current<br />

generated noise. More specifically <strong>the</strong> dark current relationship is [41]:<br />

I ∝ T 3/2 e −Eg/kT (4.7)<br />

where I is <strong>the</strong> dark current expressed in electrons per second, T is <strong>the</strong> temperature, E g is<br />

<strong>the</strong> silicon b<strong>and</strong> gap <strong>and</strong> k is <strong>the</strong> Boltzmann constant. Fig. 4.22 shows <strong>the</strong> results <strong>of</strong> a study<br />

<strong>of</strong> <strong>the</strong> noise dependence on <strong>the</strong> temperature <strong>and</strong> <strong>the</strong> integration time.<br />

It can be immediately seen that above <strong>the</strong> temperature <strong>of</strong> 0 ◦ C <strong>the</strong> noise increases with<br />

<strong>the</strong> temperature <strong>and</strong> with <strong>the</strong> integration time. This is a good indication that most external<br />

noise sources have been eliminated <strong>and</strong> that <strong>the</strong> main source <strong>of</strong> noise is <strong>the</strong> dark current. The<br />

best level <strong>of</strong> measured noise, achieved at room temperature <strong>and</strong> with very short integration


4.3 CPC2 Readout <strong>and</strong> Noise Analysis 64<br />

Noise (e - )<br />

200<br />

190<br />

180<br />

170<br />

160<br />

150<br />

140<br />

130<br />

120<br />

110<br />

100<br />

90<br />

80<br />

70<br />

60<br />

3ms<br />

12ms<br />

24ms<br />

48ms<br />

-10 0 10 20 30 40<br />

Temperature C<br />

Figure 4.22: The CPC noise is displayed for <strong>the</strong> new improved set-up. The noise is displayed<br />

vs. <strong>the</strong> temperature <strong>of</strong> <strong>the</strong> mo<strong>the</strong>rboard <strong>and</strong> for different integration times. It has to be<br />

noted that <strong>the</strong> temperature <strong>of</strong> <strong>the</strong> mo<strong>the</strong>rboard is only a rough indication <strong>of</strong> <strong>the</strong> temperature<br />

<strong>of</strong> <strong>the</strong> CCD. All <strong>the</strong> data is taken with a clock frequency <strong>of</strong> 0.375 MHz.<br />

times, decreased to 70 ± 3 e − , from <strong>the</strong> previously presented ≈ 280e − . Incidentally this level<br />

<strong>of</strong> noise can be also achieved by decreasing <strong>the</strong> temperature <strong>and</strong> prolonging <strong>the</strong> integration<br />

time. The similarity with <strong>the</strong> result <strong>of</strong> ∼ 65 e − noise that has been achieved at <strong>the</strong> Ru<strong>the</strong>rford<br />

Laboratory with a different set-up, but using <strong>the</strong> same sensor, also seems to confirm that all<br />

major sources <strong>of</strong> external noise deriving from <strong>the</strong> hardware <strong>and</strong> s<strong>of</strong>tware set-up have been<br />

eliminated [47]. Of course some element <strong>of</strong> set-up noise is inevitable. The remaining noise is<br />

<strong>the</strong>refore constant <strong>and</strong> mainly intrinsic in <strong>the</strong> CPC2 design. This is expected to be deriving<br />

primarily from <strong>the</strong> ultimately unavoidable <strong>the</strong>rmal (kTC) noise generated by <strong>the</strong> <strong>the</strong>rmal<br />

agitation <strong>of</strong> <strong>the</strong> charge carriers. This is in particular significant in <strong>the</strong> readout stages; in<br />

<strong>the</strong> first stage <strong>of</strong> internal amplification in <strong>the</strong> CPC2 (<strong>of</strong> <strong>the</strong> order <strong>of</strong> a few tens <strong>of</strong> electrons)<br />

<strong>and</strong> also from <strong>the</strong> mo<strong>the</strong>rboard on which <strong>the</strong> CPC2 is placed. Both <strong>of</strong> <strong>the</strong>se have been<br />

investigated later in different studies <strong>and</strong> <strong>the</strong> noise was fur<strong>the</strong>r reduced to ∼ 50 e − . The set<br />

up <strong>of</strong> <strong>the</strong> s<strong>of</strong>tware <strong>and</strong> hardware for future experiment <strong>and</strong> its debugging by <strong>the</strong> use <strong>of</strong> <strong>the</strong><br />

simple noise study has hence been successfully completed. In fact <strong>the</strong> experimental set up<br />

has since been used for many different measurements <strong>and</strong> also for different applications [48].


4.4 Summary 65<br />

4.4 Summary<br />

The first part <strong>of</strong> this chapter described a study <strong>of</strong> <strong>the</strong> CPC2 sensor capacitance, <strong>the</strong> final<br />

aim being <strong>the</strong> minimization <strong>of</strong> <strong>the</strong> clock current needed to operate <strong>the</strong> sensor; currently 10 A<br />

for a 10 cm 2 sensor. The sensor capacitance has been found to be 3.11 ± 0.05 nF/cm 2 . The<br />

main contribution has been found to come from <strong>the</strong> intergate capacitance which accounts<br />

for 87% <strong>of</strong> <strong>the</strong> total pixel capacitance. Subsequently a series <strong>of</strong> different designs have been<br />

investigated with <strong>the</strong> help <strong>of</strong> finite element modelling <strong>and</strong> <strong>of</strong> test structures.<br />

The chapter also described <strong>the</strong> experimental set-up developed for <strong>the</strong> testing <strong>of</strong> <strong>the</strong> CPC2<br />

sensor. A brief description has been given <strong>of</strong> <strong>the</strong> <strong>the</strong> hardware set-up as well as <strong>of</strong> <strong>the</strong> s<strong>of</strong>tware<br />

data aquisition. Finally a short study <strong>of</strong> <strong>the</strong> CPC2 noise has been performed. The noise <strong>of</strong><br />

<strong>the</strong> sensor has been measured as 70 e − .


Chapter 5<br />

The LCFI Vertex Flavour Tagging<br />

S<strong>of</strong>tware<br />

As has been already mentioned <strong>the</strong> role <strong>of</strong> <strong>the</strong> vertex detector is to facilitate <strong>the</strong> finding<br />

<strong>of</strong> vertices <strong>and</strong> <strong>the</strong>refore also to provide crucial information for tagging <strong>the</strong> flavour <strong>of</strong> <strong>the</strong><br />

original quarks that initiated <strong>the</strong> reconstructed jets. However, as is <strong>the</strong> case with any<br />

tracking particle detector, all it actually outputs is <strong>the</strong> location <strong>of</strong> <strong>the</strong> interaction with a<br />

c<strong>and</strong>idate particle. It is up to <strong>the</strong> s<strong>of</strong>tware to reconstruct <strong>the</strong> event. The first task, for <strong>the</strong><br />

tracking s<strong>of</strong>tware, is to produce a list <strong>of</strong> tracks. Tracks are <strong>the</strong>n merged into jets by <strong>the</strong> jet<br />

clustering. These jets are <strong>the</strong> inputs to <strong>the</strong> vertexing s<strong>of</strong>tware, which tries to determine if<br />

<strong>and</strong> how many displaced vertices are present in <strong>the</strong> reconstructed jets. A primary vertex,<br />

deriving from <strong>the</strong> e + e − interaction is always present in any event <strong>and</strong> is event specific, this<br />

is where all jets originate. All fur<strong>the</strong>r reconstructed vertices are jet specific; <strong>the</strong>se vertices<br />

are called secondary. There are a series <strong>of</strong> possible reasons behind <strong>the</strong> creation <strong>of</strong> secondary<br />

vertices. They can originate from <strong>the</strong> material interaction <strong>of</strong> <strong>the</strong> particles passing through<br />

<strong>the</strong> detector, <strong>the</strong>y can originate from γ → e + e − processes, <strong>the</strong>y can originate from τ leptons,<br />

<strong>the</strong>y can originate from decaying K S /Λ or <strong>the</strong>y can originate from decaying heavy b <strong>and</strong><br />

c quarks, fig. 5. The aim <strong>of</strong> <strong>the</strong> flavour tagging s<strong>of</strong>tware is to identify <strong>the</strong> later two cases.<br />

It <strong>the</strong>refore aims to separate jets originated from b quarks from jets created from c quarks<br />

66


5.1 Track Selection 67<br />

from all o<strong>the</strong>r light jets. The issue <strong>of</strong> <strong>the</strong> contamination from <strong>the</strong> τ originated jets has not<br />

been directly dealt with in <strong>the</strong> present s<strong>of</strong>tware.<br />

Figure 5.1: Schematic showing <strong>the</strong> creation <strong>of</strong> secondary <strong>and</strong> tertiary displaced vertices in<br />

<strong>the</strong> case <strong>of</strong> a B meson decay.<br />

In general <strong>the</strong> s<strong>of</strong>tware described below deals with separating <strong>the</strong> c <strong>and</strong> b quarks from <strong>the</strong><br />

uds quarks; c <strong>and</strong> uds quarks are <strong>the</strong>refore considered as a background when <strong>the</strong> b tagging<br />

is performed, while b <strong>and</strong> uds quarks are used as a background when c tagging is performed.<br />

A separate algorithm that aims at eliminating <strong>the</strong> tracks from all <strong>the</strong> K S /Λ <strong>and</strong> photon<br />

conversions a priori has also been implemented. All <strong>the</strong> plots presented in this chapter have<br />

been tested on <strong>the</strong> LDC/ILD framework <strong>and</strong> use <strong>the</strong> LDCPrime 02Sc detector model. The<br />

data sample used is 10000 e + e − → Z/γ → q¯q events <strong>and</strong> unless differently stated <strong>the</strong>y are<br />

at a center <strong>of</strong> mass energy <strong>of</strong> 91.2 GeV. An equivalent sample with a centre <strong>of</strong> mass energy<br />

<strong>of</strong> 500 GeV has also been used. In both cases <strong>the</strong> final quark states represent <strong>the</strong> natural<br />

production at <strong>the</strong> corresponding energies. The events have been processed as described in<br />

<strong>the</strong> simulation chapter <strong>and</strong> forced into a dijet topology at jet clustering level.<br />

Please note that this chapter is derived from a paper published by <strong>the</strong> LCFI Collaboration<br />

[49], to which <strong>the</strong> author <strong>of</strong> <strong>the</strong> <strong>the</strong>sis has made major contributions.<br />

5.1 Track Selection<br />

The first step in <strong>the</strong> process is to remove all <strong>the</strong> spurious tracks <strong>and</strong> all tracks deriving from<br />

photon conversions <strong>and</strong> from K S /Λ decays. Initially all two-track combinations present in


5.1 Track Selection 68<br />

<strong>the</strong> whole event are considered. The two track combination is considered as a K S /Λ decay<br />

or photon conversion if it passes all <strong>of</strong> <strong>the</strong> following criteria:<br />

• The tracks must have opposite charge sign.<br />

• The distance <strong>of</strong> closest approach between <strong>the</strong> two track helices must not exceed 1 mm.<br />

• The distance between <strong>the</strong> point <strong>of</strong> closest approach <strong>and</strong> <strong>the</strong> interaction point (IP) must<br />

be larger than 1 mm in order to reduce <strong>the</strong> risk <strong>of</strong> fake tags consisting <strong>of</strong> combinations<br />

<strong>of</strong> primary vertex tracks.<br />

• The invariant mass <strong>of</strong> <strong>the</strong> two track combination has to be compatible with that <strong>of</strong> <strong>the</strong><br />

K S ,Λ or <strong>the</strong> photon.<br />

To check <strong>the</strong> mass compatibility, <strong>the</strong> rest mass <strong>of</strong> <strong>the</strong> combination is calculated using<br />

three mass hypo<strong>the</strong>ses, choosing <strong>the</strong> masses <strong>of</strong> <strong>the</strong> decay products accordingly: both tracks<br />

are assumed to be electrons in <strong>the</strong> case <strong>of</strong> conversions, or pions for <strong>the</strong> K S hypo<strong>the</strong>sis; for <strong>the</strong><br />

Λ hypo<strong>the</strong>sis <strong>the</strong> track with larger momentum is assigned <strong>the</strong> proton mass <strong>and</strong> <strong>the</strong> o<strong>the</strong>r <strong>the</strong><br />

pion mass. The resulting rest mass <strong>of</strong> <strong>the</strong> combination is considered to be compatible with<br />

<strong>the</strong> hypo<strong>the</strong>sis if it differs from <strong>the</strong> PDG value [8] by not more than 5 MeV for conversions<br />

<strong>and</strong> kaons <strong>and</strong> by not more than 2 MeV for Lambdas. The efficiencies <strong>of</strong> <strong>the</strong> devised<br />

procedure are 25%, 72% <strong>and</strong> 70% for <strong>the</strong> photon conversion, <strong>the</strong> K S <strong>and</strong> Λ respectively. The<br />

equivalent fake rates are 4%, 33% <strong>and</strong> 38%. The increased performance from <strong>the</strong> usage <strong>of</strong><br />

this additional preliminary track selection algorithm will be analysed in <strong>the</strong> result section at<br />

<strong>the</strong> end <strong>of</strong> this chapter.<br />

All objects that are not identified as stemming from <strong>the</strong> above sources are passed on<br />

to <strong>the</strong> track selection processor preceding <strong>the</strong> subsequent steps such as vertex finding <strong>and</strong><br />

flavour tagging. In order to eliminate <strong>the</strong> presence <strong>of</strong> spurious tracks a set <strong>of</strong> cuts has also<br />

been developed. These tracks can be tuned independently for <strong>the</strong> various sections <strong>of</strong> <strong>the</strong><br />

flavour tagging s<strong>of</strong>tware. It is <strong>the</strong>refore possible to implement a set <strong>of</strong> cuts while fitting <strong>the</strong><br />

interaction point, a completely different set when searching for secondary vertices <strong>and</strong> yet a


5.2 The ZVRES Vertex Finding Algorithm 69<br />

different one when calculating <strong>the</strong> flavour discriminating variables. In order to calculate <strong>the</strong><br />

optimal cut values an extensive optimization analysis has been performed. Different values<br />

for cut parameters have been implemented <strong>and</strong> compared by looking at <strong>the</strong>ir flavour tagging<br />

performance; cross correlation between parameters has however not been considered in order<br />

to reduce <strong>the</strong> number <strong>of</strong> required iterations. The values used in <strong>the</strong> presented analysis are<br />

shown in table 5.1. Please note that this table only presents <strong>the</strong> track selection parameters.<br />

A detailed list <strong>of</strong> all remaining parameters <strong>of</strong> <strong>the</strong> LCFI Vertex s<strong>of</strong>tware can be found in<br />

Appendix B.<br />

Track Selection Parameters<br />

Parameter ≷ IP Fit Vertexing Flavour Tagging<br />

χ 2 / ndf <strong>of</strong> track fit < 5 4 -<br />

R − φ impact parameter d 0 (mm) < 20 2 20<br />

R − z impact parameter z 0 (mm) < 20 5 20<br />

d 0 uncertainty (mm) < - 0.007 -<br />

z 0 uncertainty (mm) < - 0.025 -<br />

track p T (GeV) > 0.1 0.2 0.1<br />

Table 5.1: Track selection parameters for all <strong>the</strong> LCFI Vertex algorithms. Please note that<br />

in <strong>the</strong> case <strong>of</strong> flavour tagging <strong>the</strong> presented track selections are only <strong>the</strong> ones applied to <strong>the</strong><br />

whole process. Additional track selection procedures can apply for specific discriminatory<br />

variables.<br />

5.2 The ZVRES Vertex Finding Algorithm<br />

The ZVTOP vertex finder, developed at <strong>the</strong> SLD experiment [50], provides two complementary<br />

algorithms which use topological information to identify track combinations that are<br />

likely to have <strong>the</strong>ir origin at a common vertex. The first <strong>of</strong> <strong>the</strong>se, <strong>the</strong> ZVRES algorithm, can<br />

be used to find multi-pronged secondary vertices with an arbitrary geometrical distribution<br />

<strong>and</strong> hence is most generally applicable, provided <strong>the</strong> detector system has a sufficiently high<br />

spatial resolution. The second algorithm, ZVKIN, has a more specialistic function <strong>and</strong> aims<br />

to address <strong>the</strong> case <strong>of</strong> one prong vertices originating from decays with only one charged track.<br />

Both algorithms have been completely re-coded <strong>and</strong> re-developed inside <strong>the</strong> ILC s<strong>of</strong>tware<br />

framework by <strong>the</strong> LCFI collaboration [49].


5.2 The ZVRES Vertex Finding Algorithm 70<br />

Since only <strong>the</strong> ZVRES algorithm has been used throughout <strong>the</strong> presented <strong>the</strong>sis this is<br />

<strong>the</strong> only one described here. A description <strong>of</strong> <strong>the</strong> ZVKIN algorithm can be found in [49, 51].<br />

The central idea <strong>of</strong> <strong>the</strong> ZVRES algorithm is to describe each track i by a probability density<br />

function f i (⃗r) in three-dimensional space <strong>and</strong> to use <strong>the</strong>se to define a vertex function V (⃗r)<br />

that yields higher values in <strong>the</strong> vicinity <strong>of</strong> true vertex locations <strong>and</strong> lower values elsewhere,<br />

as well as providing a criterion for when two vertex c<strong>and</strong>idates are resolved from each o<strong>the</strong>r.<br />

The track functions have a Gaussian pr<strong>of</strong>ile in <strong>the</strong> plane normal to <strong>the</strong> trajectory. With<br />

⃗p <strong>the</strong> point <strong>of</strong> closest approach <strong>of</strong> <strong>the</strong> track i to <strong>the</strong> space point ⃗r, <strong>the</strong> track function f i (⃗r) is<br />

defined as:<br />

{<br />

f i (⃗r) = exp − 1 }<br />

(⃗r − ⃗p)V−1 i (⃗r − ⃗p) T<br />

2<br />

(5.1)<br />

where V −1<br />

i<br />

is <strong>the</strong> inverse <strong>of</strong> <strong>the</strong> position covariance matrix <strong>of</strong> track ⃗p. A graphical representation<br />

<strong>of</strong> <strong>the</strong> track functions can be seen in 5.2(a). The vertex function, in its most basic<br />

form, is defined as:<br />

V (⃗r) =<br />

N∑<br />

∑ N<br />

i=1<br />

f i (⃗r) −<br />

f2 i (⃗r)<br />

∑ N<br />

i=1 f i(⃗r)<br />

i=1<br />

(5.2)<br />

with <strong>the</strong> second term ensuring that V (⃗r) approaches zero in spatial regions in which only<br />

one track contributes significantly to <strong>the</strong> first term <strong>and</strong> where hence no vertex should be<br />

found. A graphical representation <strong>of</strong> <strong>the</strong> track functions can be seen in 5.2(b).<br />

(a)<br />

(b)<br />

Figure 5.2: A graphical representation <strong>of</strong> <strong>the</strong> ZVRES algorithm. In a) <strong>the</strong> track function<br />

stage can be seen, in b) <strong>the</strong> vertex function stage is displayed.<br />

Optionally, fur<strong>the</strong>r knowledge on where vertices are more likely to be found can be used


5.2 The ZVRES Vertex Finding Algorithm 71<br />

to weight <strong>the</strong> vertex function, <strong>the</strong>reby suppressing fake vertices <strong>and</strong> increasing <strong>the</strong> purity <strong>of</strong><br />

<strong>the</strong> vertices found (i.e. <strong>the</strong> fraction <strong>of</strong> correctly assigned tracks). The knowledge <strong>of</strong> <strong>the</strong> IP<br />

position can be used to suppress fake vertices from tracks passing close to each o<strong>the</strong>r in <strong>the</strong><br />

vicinity <strong>of</strong> <strong>the</strong> IP. This is accomplished by representing <strong>the</strong> IP by a contribution:<br />

{<br />

f 0 (⃗r) = exp − 1 }<br />

(⃗r − ⃗p)V−1<br />

IP<br />

2 (⃗r − ⃗p)T<br />

(5.3)<br />

where ⃗p is now <strong>the</strong> position <strong>of</strong> <strong>the</strong> IP <strong>and</strong> V IP is its covariance matrix. With this inclusion<br />

<strong>the</strong> vertex function becomes:<br />

V (⃗r) = w IP f 0 (⃗r) +<br />

N∑<br />

i=1<br />

f i (⃗r) − w0 IP f 0(⃗r) + ∑ N<br />

i=1 f2 i (⃗r)<br />

w IP f 0 (⃗r) + ∑ N<br />

i=1 f i(⃗r)<br />

(5.4)<br />

where w IP is a weighting factor. Values <strong>of</strong> w IP > (


5.2 The ZVRES Vertex Finding Algorithm 72<br />

than at large angle from it, which is taken into account by weighting <strong>the</strong> vertex function<br />

outside a cylinder <strong>of</strong> radius 50 µm by:<br />

V ′ (⃗r) = V (⃗r) exp(−kE Jet α 2 ) (5.5)<br />

where k is a user defined parameter <strong>and</strong> α is <strong>the</strong> angle formed between <strong>the</strong> location r <strong>and</strong><br />

<strong>the</strong> edge <strong>of</strong> this cylinder. Once <strong>the</strong> vertex function has been calculated it is still important<br />

to define <strong>the</strong> parameters that will resolve <strong>the</strong> vertices. In order for two vertices at different<br />

positions to be declared resolved <strong>the</strong>y must satisfy <strong>the</strong> following condition:<br />

min{V ′ (⃗r) : ⃗r ∈ ⃗r 1 + α(⃗r 2 − ⃗r 1 ), 0 ≤ α ≤ 1}<br />

min{V ′ (⃗r 1 ),V ′ (⃗r 2 )}<br />

< R 0 (5.6)<br />

where V ′ (⃗r 1 ),V ′ (⃗r 2 ) are <strong>the</strong> values <strong>of</strong> <strong>the</strong> vertex function at its local maxima; min{V ′ (⃗r 1 ),V (⃗r 2 )}<br />

is <strong>the</strong>refore <strong>the</strong> lower <strong>of</strong> <strong>the</strong> two maxima. min{V (⃗r) : ⃗r ∈ ⃗r 1 + α(⃗r 2 − ⃗r 1 ), 0 ≤ α ≤ 1} is <strong>the</strong><br />

minimum <strong>of</strong> <strong>the</strong> vertex function on <strong>the</strong> straight line connecting <strong>the</strong> maxima V ′ (⃗r 1 ),V ′ (⃗r 2 ); R 0<br />

is instead an externally defined parameter. Having defined <strong>the</strong> ma<strong>the</strong>matical functionality<br />

<strong>of</strong> ZVRES we can now describe <strong>the</strong> process algorithmically:<br />

• Initially all track-track <strong>and</strong> track-IP combinations are fitted. Only fits with χ 2 < χ 0<br />

<strong>and</strong> that satisfy a minimum value <strong>of</strong> <strong>the</strong> vertex function are kept. χ 0 <strong>and</strong> <strong>the</strong> minimum<br />

value <strong>of</strong> <strong>the</strong> vertex function are user defined parameters.<br />

• Tracks that are associated with two or more vertices are removed from all <strong>the</strong> vertices<br />

that have V ′ (⃗r V ert ) < V ′ (⃗r MAX ). Where V ′ (⃗r MAX ) is <strong>the</strong> maximum vertex function <strong>of</strong><br />

all <strong>the</strong> vertices that <strong>the</strong> track is associated with.<br />

• All <strong>the</strong> resulting V ′ vertex functions are ordered from <strong>the</strong> highest to <strong>the</strong> lowest. The<br />

resolution criterion, equation 5.6, is used to attempt in combining vertices starting<br />

from <strong>the</strong> one with <strong>the</strong> highest V ij . The process stops when all remaining vertices are<br />

considered as being resolved.<br />

• Tracks with a high χ 2 contribution are removed from <strong>the</strong> resulting c<strong>and</strong>idate vertices:


5.2 The ZVRES Vertex Finding Algorithm 73<br />

iteratively, <strong>the</strong> track with <strong>the</strong> highest χ 2 contribution is removed if its contribution to<br />

χ 2 is above a threshold χ 2 TRIM . The vertex is <strong>the</strong>n refit.<br />

• If a track is still associated with more than two vertices <strong>the</strong> ambiguity is solved by<br />

removing it from <strong>the</strong> vertex with <strong>the</strong> lowest vertex function maximum.<br />

Throughout this process <strong>the</strong> vertices that are not associated with at least two tracks (or that<br />

do not represent <strong>the</strong> IP) are removed. The default parameters <strong>of</strong> <strong>the</strong> ZVRES algorithm are<br />

given in Appendix B.<br />

5.2.1 ZVRES Performance<br />

For <strong>the</strong> purposes <strong>of</strong> benchmarking a pure e + e − → Z/γ → b¯b sample produced at <strong>the</strong> centre<br />

<strong>of</strong> mass energy <strong>of</strong> 91.2 GeV has been used. The energy has been selected to allow for direct<br />

comparison with earlier studies <strong>and</strong> because <strong>the</strong> ILC calibration run will be operated at<br />

this energy. First <strong>of</strong> all <strong>the</strong> efficiency <strong>of</strong> secondary vertex finding has been analysed in fig.<br />

5.3. It can be immediately seen that <strong>the</strong> vertex finding efficiency is approximately constant<br />

∈(sec. vertex)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0 2 4 6 8 10 12 14<br />

Monte Carlo B decay length/mm<br />

Figure 5.3: Efficiency for finding a secondary vertex, for a pure sample <strong>of</strong> B mesons<br />

for all vertices that are displaced by more than 1 mm at MC level. In total 89% <strong>of</strong> <strong>the</strong><br />

vertices that are displaced by more than 1 mm at MC level are found by <strong>the</strong> algorithm. It<br />

is clear that at very small decay lengths is it very difficult, if not impossible, to distinguish<br />

<strong>the</strong> secondary vertex from <strong>the</strong> IP, hence <strong>the</strong> visible drop <strong>of</strong> efficiency. This result can be<br />

compared with <strong>the</strong> performance <strong>of</strong> previous detectors <strong>and</strong> implementations <strong>of</strong> <strong>the</strong> ZVRES


5.2 The ZVRES Vertex Finding Algorithm 74<br />

algorithm. In <strong>the</strong> case <strong>of</strong> <strong>the</strong> SLD-VDX3 detector <strong>the</strong> efficiency was 80% <strong>and</strong> <strong>the</strong> plateau <strong>of</strong><br />

constant efficiency started only at 2 mm [53]; <strong>the</strong> vertex finding efficiency for <strong>the</strong> ILC vertex<br />

detector model is clearly improved. The improvement is mostly due to <strong>the</strong> larger angular<br />

coverage, better impact parameter resolution <strong>and</strong> reduced material budget <strong>of</strong> <strong>the</strong> proposed<br />

ILC vertex detectors.<br />

Using <strong>the</strong> same sub-sample <strong>of</strong> b¯b jets, <strong>the</strong> reconstructed decay lengths <strong>of</strong> <strong>the</strong> B <strong>and</strong> D<br />

mesons were compared to <strong>the</strong> corresponding MC decay lengths on a jet-by-jet basis. In<br />

fig. 5.4(a), <strong>the</strong> B decay length comparison is shown for jets in which at least two vertices<br />

(<strong>the</strong> primary, or IP vertex, plus at least one secondary vertex) were found. This class <strong>of</strong><br />

jets includes cases in which <strong>the</strong> D meson decayed so close to <strong>the</strong> B decay vertex that <strong>the</strong><br />

D meson generated vertex could not be resolved. For <strong>the</strong>se cases <strong>the</strong> reconstructed decay<br />

length is shifted to larger values, if compared to <strong>the</strong> MC B decay length. Short B decay<br />

lengths, where <strong>the</strong> B meson secondary vertex cannot be resolved, but <strong>the</strong> D meson generated<br />

one can, also result in reconstructed secondary vertex decay length that are larger than <strong>the</strong><br />

MC truth value. In fig. 5.4(b), <strong>the</strong> decay length <strong>of</strong> <strong>the</strong> D meson is compared to <strong>the</strong> MC<br />

value for jets in which exactly three vertices were reconstructed by ZVRES. As expected for<br />

this category <strong>of</strong> jets, <strong>the</strong> correlation between reconstructed <strong>and</strong> MC values is better. The<br />

efficiencies for <strong>the</strong> finding <strong>of</strong> <strong>the</strong> tertiary vertex is 73%; 972 vertices are reconstructed <strong>of</strong> <strong>the</strong><br />

1331 present at MC level.<br />

In addition to <strong>the</strong> precision with which <strong>the</strong> decay lengths can be reconstructed, clearly an<br />

important parameter for many physics studies, also <strong>the</strong> track content <strong>of</strong> <strong>the</strong> reconstructed<br />

vertices has been investigated. For each type <strong>of</strong> reconstructed vertex: <strong>the</strong> primary, <strong>the</strong> secondary<br />

<strong>and</strong>, if available, <strong>the</strong> tertiary vertex, <strong>the</strong> average percentage <strong>of</strong> tracks that originated<br />

from <strong>the</strong> corresponding MC decay vertex was determined, yielding <strong>the</strong> purity <strong>of</strong> <strong>the</strong> track<br />

content <strong>of</strong> each type <strong>of</strong> vertex. Table 5.2 gives <strong>the</strong> purities for b jets at centre <strong>of</strong> mass energies<br />

<strong>of</strong> 91.2 GeV <strong>and</strong>, most importantly for this analysis, at 500 GeV. These purities are particularly<br />

important for <strong>the</strong> following determination <strong>of</strong> <strong>the</strong> flavour <strong>of</strong> <strong>the</strong> jet <strong>and</strong> <strong>of</strong> <strong>the</strong> charge<br />

<strong>of</strong> <strong>the</strong> vertex. The tracks associated to <strong>the</strong> vertex are widely used for <strong>the</strong>se determinations


5.2 The ZVRES Vertex Finding Algorithm 75<br />

rec. sec. vertex decay length / mm<br />

20<br />

18<br />

16<br />

14<br />

12<br />

10<br />

8<br />

6<br />

4<br />

2<br />

at least 2 ZVTOP vertices found<br />

rec. ter. - sec. vertex decay length / mm<br />

20<br />

18<br />

16<br />

14<br />

12<br />

10<br />

8<br />

6<br />

4<br />

2<br />

exactly 3 ZVTOP vertices found<br />

0<br />

0 2 4 6 8 10 12 14 16 18 20<br />

Monte Carlo B decay length / mm<br />

0<br />

0 2 4 6 8 10 12 14 16 18 20<br />

Monte Carlo D decay length / mm<br />

(a)<br />

(b)<br />

Figure 5.4: Reconstructed decay length vs. MC decay length for: a) B meson decays in b<br />

jets in which at least two vertices have been found, b) D meson decays in b jets with three<br />

vertices. The secondary vertex is defined as <strong>the</strong> non-primary vertex with <strong>the</strong> shortest decay<br />

length, distance to <strong>the</strong> IP, <strong>the</strong> tertiary vertex is instead <strong>the</strong> vertex with <strong>the</strong> second shortest<br />

distance to <strong>the</strong> IP. The secondary is <strong>the</strong>refore associated with <strong>the</strong> B meson decay <strong>and</strong> <strong>the</strong><br />

tertiary is associated with <strong>the</strong> D meson decay. The case where more than two vertices are<br />

present in <strong>the</strong> same jet is very rare.<br />

<strong>and</strong> <strong>the</strong> mis-association <strong>of</strong> even a single track can vary <strong>the</strong> output <strong>of</strong> <strong>the</strong> algorithms.<br />

The best assignment <strong>of</strong> tracks to vertices, corresponding to <strong>the</strong> highest purities, was<br />

obtained for b jets with three reconstructed vertices. This is underst<strong>and</strong>able given <strong>the</strong> fact<br />

that if both <strong>the</strong> bottom <strong>and</strong> <strong>the</strong> subsequent charm decay result in multi-prong vertices, <strong>the</strong><br />

result is a cleaner topology that can be more easily reconstructed. If a smaller number <strong>of</strong><br />

vertices is found in b jets, this can indicate ei<strong>the</strong>r that one <strong>of</strong> <strong>the</strong> decay vertices is onepronged<br />

<strong>and</strong> thus cannot be found by ZVRES, or that <strong>the</strong> decay length <strong>of</strong> one <strong>of</strong> <strong>the</strong> heavy<br />

flavour hadrons is so short that its decay vertex cannot be resolved from <strong>the</strong> preceding vertex<br />

in <strong>the</strong> decay chain, <strong>and</strong> that, for example, <strong>the</strong> secondary vertex found by ZVRES contains<br />

some tracks that actually originated from <strong>the</strong> D decay <strong>and</strong> some from <strong>the</strong> B decay. Both<br />

effects can result in a mis-assignment <strong>of</strong> tracks by ZVRES <strong>and</strong> hence to a reduced purity<br />

for vertices in this category. While comparing <strong>the</strong> results at 91.2GeV <strong>and</strong> 500GeV one must<br />

bear in mind that <strong>the</strong> increase <strong>of</strong> <strong>the</strong> centre <strong>of</strong> mass energy results in an increased collimation<br />

<strong>of</strong> jets as well as in <strong>the</strong> increased multiplicity <strong>of</strong> <strong>the</strong> tracks originated from <strong>the</strong> IP. As a result


5.3 Flavour Discriminating Variables 76<br />

Monte Carlo Reconstructed track-vertex association<br />

track origin Two vertex case Three vertex case<br />

pri sec iso pri sec ter iso<br />

91.2GeV Primary 91.5 1.4 36.2 95.2 3.1 1.9 49.9<br />

B Decay 6.7 46.7 29.6 3.1 75.3 10.6 22.8<br />

D Decay 1.8 51.9 34.2 1.7 21.6 87.5 27.2<br />

500 GeV Primary 93.7 2.6 35.3 97.4 4.9 4.0 48.5<br />

B Decay 4.6 47.3 29.8 1.8 72.3 13.5 24.5<br />

D Decay 1.7 50.1 34.9 0.8 22.9 82.5 27.0<br />

Table 5.2: Percentages <strong>of</strong> tracks assigned to <strong>the</strong> reconstructed primary, secondary <strong>and</strong> tertiary<br />

vertex <strong>and</strong> <strong>of</strong> tracks not associated to any vertex (labelled iso) which originate from<br />

<strong>the</strong> IP, <strong>the</strong> B or <strong>the</strong> D decay at MC level, for b jets.<br />

vertex finding is more challenging. This effect is at least partially counterbalanced by <strong>the</strong><br />

increased decay lengths <strong>of</strong> <strong>the</strong> hadronic particles due to <strong>the</strong> increased boost. As it can be<br />

seen <strong>the</strong> net result is a slightly lower purity in <strong>the</strong> track assignments when trying to separate<br />

between <strong>the</strong> secondary <strong>and</strong> <strong>the</strong> tertiary vertex.<br />

At last it has to be pointed out that <strong>the</strong> number <strong>of</strong> tracks not assigned to any vertex in<br />

<strong>the</strong> present configuration <strong>of</strong> <strong>the</strong> algorithm is very low. For <strong>the</strong> case <strong>of</strong> b quark jets in <strong>the</strong> 91.2<br />

GeV centre <strong>of</strong> mass simulation this is only 8.0% if <strong>the</strong> tertiary vertex is reconstructed <strong>and</strong><br />

13.9% if only <strong>the</strong> secondary is reconstructed. For c quark jets <strong>the</strong>se numbers are 7.0% <strong>and</strong><br />

7.6% respectively. In <strong>the</strong> case <strong>of</strong> <strong>the</strong> 500 GeV simulation <strong>the</strong> numbers rise by a percentage<br />

point or two.<br />

5.3 Flavour Discriminating Variables<br />

Once <strong>the</strong> vertices <strong>of</strong> <strong>the</strong> jet have been found it is important to attempt to distinguish between<br />

<strong>the</strong> jets originated from b, c <strong>and</strong> light quarks. In order to perform this procedure a series <strong>of</strong><br />

discriminating parameters based on <strong>the</strong> quark proprieties as well as kinematic <strong>and</strong> topological<br />

considerations are used. The variables presented in this analysis have been used in a previous<br />

study done on <strong>the</strong> TESLA experiment [54]. Most if not all <strong>of</strong> <strong>the</strong>m have however been<br />

developed <strong>and</strong> tested in previous experiments such as ALEPH, OPAL, SLD <strong>and</strong> DELPHI<br />

[55, 56, 57, 58]. Many <strong>of</strong> <strong>the</strong> variables most sensitive to <strong>the</strong> jet flavour are only defined for


5.3 Flavour Discriminating Variables 77<br />

jets in which non-IP vertices have been found, different procedures are <strong>the</strong>refore used for jets<br />

containing one vertex <strong>and</strong> jets containing more than one vertex. This process is ideal also<br />

because <strong>the</strong> number <strong>of</strong> vertices already provides a very good indication <strong>of</strong> <strong>the</strong> original quark.<br />

In fact in <strong>the</strong> events where two or more secondary vertices are found at reconstruction level<br />

91.7% <strong>of</strong> events have been found to be from b quark jets at MC level, 7.7% from c jets at<br />

MC level <strong>and</strong> <strong>the</strong> remaining from light jets at MC level. If one secondary vertex has been<br />

found 54.3% <strong>of</strong> events are from b quarks, 34.4% from c quarks <strong>and</strong> 11.3% from uds quarks.<br />

In <strong>the</strong> case that only <strong>the</strong> primary vertex is found <strong>the</strong>se values are 6.4%, 13.2% <strong>and</strong> 80.4%.<br />

As usual this study has been done on <strong>the</strong> 91.2 GeV sample.<br />

It must be also noted that, as <strong>the</strong> ZVTOP calculation, <strong>the</strong> flavour algorithm is done on<br />

a jet by jet basis, no information from o<strong>the</strong>r jets is <strong>the</strong>refore used for <strong>the</strong> determination <strong>of</strong><br />

<strong>the</strong> jet flavour.<br />

5.3.1 Variables for Jets without Secondary Vertices<br />

In <strong>the</strong> case that only one vertex, <strong>the</strong> IP vertex, has been found, <strong>the</strong> input jet is searched<br />

for <strong>the</strong> two tracks <strong>of</strong> highest impact parameter significance in <strong>the</strong> R-φ plane. These are<br />

referred to as <strong>the</strong> most significant <strong>and</strong> <strong>the</strong> second-most significant track in what follows.<br />

For finding <strong>the</strong>se two tracks, separate minimum momentum cuts p trk,NL ,min <strong>and</strong> p trk,NL −1,min<br />

are applied for tracks with hits on all N L vertex detector layers or with hits on only N L -<br />

1 layers. The momenta |p trk | <strong>and</strong> <strong>the</strong> impact parameter significances <strong>of</strong> <strong>the</strong>se tracks in <strong>the</strong><br />

R-φ plane <strong>and</strong> <strong>the</strong> R-z plane are used as flavour discriminating variables. A single track <strong>of</strong><br />

high impact parameter significance may indicate that <strong>the</strong> jet under consideration is a charm<br />

jet with <strong>the</strong> leading D ± having decayed to a single charged track (one-prong decay), which<br />

is expected for ≈ 40% <strong>of</strong> all D ± decays. The observables obtained from <strong>the</strong> second most<br />

significant track help distinguish between c <strong>and</strong> b jets, for which it is more likely that two<br />

tracks <strong>of</strong> high impact parameter significance are found, typically with one resulting from <strong>the</strong><br />

decay <strong>of</strong> <strong>the</strong> leading hadron <strong>and</strong> one from <strong>the</strong> decay <strong>of</strong> <strong>the</strong> charmed hadron produced in that<br />

decay. Fig. 5.5 shows <strong>the</strong> distributions <strong>of</strong> <strong>the</strong> inputs, used for <strong>the</strong> flavour tagging, for <strong>the</strong>


5.3 Flavour Discriminating Variables 78<br />

most significant <strong>and</strong> second-most significant tracks, separately for b, c <strong>and</strong> light flavour jets.<br />

Please note that <strong>the</strong> distributions plotted are cumulative; hence <strong>the</strong> b distribution includes<br />

<strong>the</strong> plotted c <strong>and</strong> uds events <strong>and</strong> <strong>the</strong> c distribution includes <strong>the</strong> plotted uds events. The<br />

sample used for <strong>the</strong> distribution is 10000 e + e − → Z/γ → q¯q events, with natural branching<br />

ratios at 91.2 GeV centre <strong>of</strong> mass energy.


5.3 Flavour Discriminating Variables 79<br />

N jets<br />

4<br />

10<br />

N jets<br />

4<br />

10<br />

3<br />

10<br />

3<br />

10<br />

2<br />

10<br />

2<br />

10<br />

10<br />

10<br />

1<br />

1<br />

-10 -5 0 5 10 15 20<br />

most signif. trk, d<br />

/σ(d<br />

)<br />

0<br />

0<br />

-10 -5 0 5 10 15 20<br />

2nd most signif. trk, d<br />

/σ(d<br />

)<br />

0<br />

0<br />

(a)<br />

(b)<br />

N jets<br />

N jets<br />

3<br />

10<br />

3<br />

10<br />

2<br />

10<br />

2<br />

10<br />

10<br />

10<br />

-10 -8 -6 -4 -2 0 2 4 6 8 10<br />

most signif. trk, z<br />

/σ(z<br />

)<br />

0<br />

0<br />

1<br />

-10 -8 -6 -4 -2 0 2 4 6 8 10<br />

2nd most signif. trk, z<br />

/σ(z<br />

)<br />

0<br />

0<br />

(c)<br />

(d)<br />

N jets<br />

1000<br />

800<br />

600<br />

N jets<br />

800<br />

700<br />

600<br />

500<br />

b-jets<br />

c-jets<br />

u,d,s-jets<br />

400<br />

400<br />

300<br />

200<br />

200<br />

100<br />

0<br />

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2<br />

most signif. trk, log|p<br />

(e)<br />

trk<br />

| / GeV<br />

0<br />

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2<br />

2nd most signif. trk, log|p<br />

(f)<br />

trk<br />

| / GeV<br />

Figure 5.5: Flavour discriminating variables used when only <strong>the</strong> IP vertex is reconstructed;<br />

based on <strong>the</strong> most [plots a), c), e)] <strong>and</strong> second-most [plots b), d), f)] significant track in <strong>the</strong><br />

jet. Shown are <strong>the</strong> impact parameter significance in R-φ in a), b), <strong>the</strong> impact parameter<br />

significance in R-z in c), d) <strong>and</strong> <strong>the</strong> track momentum in e), f).


5.3 Flavour Discriminating Variables 80<br />

Fur<strong>the</strong>r information is contained in <strong>the</strong> joint probability for all tracks to originate from<br />

<strong>the</strong> primary vertex, as introduced by ALEPH [55]. Two joint probability variables are<br />

calculated from <strong>the</strong> impact parameter significances in R-φ <strong>and</strong> in R-z <strong>of</strong> all <strong>the</strong> tracks in<br />

<strong>the</strong> jet that pass <strong>the</strong> specific selection criteria. The distribution f(x) <strong>of</strong> unsigned impact<br />

parameter significances for tracks originating from <strong>the</strong> IP is assumed to be known; it can<br />

be determined from <strong>the</strong> data, as described in <strong>the</strong> next subsection. The probability <strong>of</strong> an IP<br />

track having an impact parameter significance <strong>of</strong> b/σ b or larger is <strong>the</strong>n given by:<br />

P i =<br />

∫ ∞<br />

b/σ b<br />

f(x)dx<br />

∫ ∞<br />

0<br />

f(x)dx . (5.7)<br />

For a set <strong>of</strong> N tracks <strong>the</strong> probability,P J , that all N tracks originate from <strong>the</strong> IP is:<br />

N−1<br />

∑<br />

P J = y<br />

k=0<br />

−ln(y) k<br />

k!<br />

y = Π N−1<br />

i=0 P i (5.8)<br />

y is <strong>the</strong> product <strong>of</strong> <strong>the</strong> N individual track probabilities <strong>and</strong> P J is <strong>the</strong> probability that <strong>the</strong><br />

product <strong>of</strong> N r<strong>and</strong>om numbers uniformly distributed from 0 to 1 is y or smaller. The joint<br />

probability observable, P J , is calculated for <strong>the</strong> set <strong>of</strong> tracks that pass <strong>the</strong> track selection<br />

cuts described in table 5.1 as well as an upper cut on <strong>the</strong> impact parameter <strong>of</strong> 5 mm <strong>and</strong><br />

on <strong>the</strong> impact parameter significance <strong>of</strong> 200. As mentioned it is calculated separately for<br />

<strong>the</strong> R-φ <strong>and</strong> <strong>the</strong> R-z impact parameter significances. It can be seen from <strong>the</strong> resulting P J<br />

distributions shown in fig. 5.6(a) <strong>and</strong> fig. 5.6(b) that light quark jets tend to have values <strong>of</strong><br />

P J closer to 1, while <strong>the</strong> distributions for b <strong>and</strong> c jets peak at zero.<br />

It can be noted that, given <strong>the</strong> above definition, <strong>the</strong> joint probability distribution <strong>of</strong> all<br />

uds jet should be flat. However this is only true under <strong>the</strong> condition that <strong>the</strong> track selection<br />

used for calculating <strong>the</strong> impact parameter significance distribution <strong>and</strong> <strong>the</strong> track selection<br />

used when calculating <strong>the</strong> joint probability are equivalent. This has not been <strong>the</strong> case in this<br />

study. The track selections applied are in fact complex <strong>and</strong> still in <strong>the</strong> process <strong>of</strong> being fully<br />

optimized, with <strong>the</strong> consequence <strong>of</strong> time consuming recalculation <strong>of</strong> <strong>the</strong> impact parameter<br />

distribution being needed every time any parameter has been changed. Instead <strong>the</strong> impact


5.3 Flavour Discriminating Variables 81<br />

parameter significance distribution has been calculated prior to <strong>the</strong> track selections <strong>and</strong><br />

prior to <strong>the</strong> K S , Λ <strong>and</strong> photon conversions. Additionally, for similar reasons, <strong>the</strong> default IP<br />

value has been used when calculating <strong>the</strong> impact parameter significance distributions. The<br />

observed distribution is <strong>the</strong>refore not flat <strong>and</strong>, as it can be seen in fig. 5.6(a) <strong>and</strong> fig. 5.6(b),<br />

<strong>the</strong> probability <strong>of</strong> quarks originating from <strong>the</strong> IP is shifted to slightly higher values. However<br />

it is clear from <strong>the</strong> plotted distributions that <strong>the</strong> discriminating power <strong>of</strong> this method is still<br />

present.<br />

N jets<br />

N jets<br />

b-jets<br />

3<br />

10<br />

3<br />

10<br />

c-jets<br />

u,d,s-jets<br />

2<br />

10<br />

2<br />

10<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

joint probability, R-φ<br />

(a)<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

joint probability, R-z<br />

(b)<br />

Figure 5.6: Joint probability for all tracks in <strong>the</strong> jet passing <strong>the</strong> track selection cuts to<br />

originate from <strong>the</strong> primary vertex. This probability is calculated separately from a) <strong>the</strong> R-φ<br />

impact parameter significances <strong>and</strong> b) <strong>the</strong> impact parameter significances in R-z. Please note<br />

that <strong>the</strong> distributions plotted are cumulative; hence <strong>the</strong> b distribution includes <strong>the</strong> plotted c<br />

<strong>and</strong> uds events <strong>and</strong> <strong>the</strong> c distribution includes <strong>the</strong> plotted uds events. The sample used for<br />

<strong>the</strong> distribution is 10000 e + e − → Z/γ → q¯q events, with natural branching ratios, at 91.2<br />

GeV centre <strong>of</strong> mass energy.<br />

Impact Parameter Significance Distribution<br />

It follows from <strong>the</strong> definition <strong>of</strong> <strong>the</strong> joint probability P J , that it depends heavily on <strong>the</strong><br />

distribution <strong>of</strong> impact parameter significances <strong>of</strong> IP tracks. The function f(x) that approximates<br />

this distribution is determined from <strong>the</strong> data as follows: for IP tracks, <strong>the</strong> impact<br />

parameter significance distributions are symmetric. The shape <strong>of</strong> <strong>the</strong> distribution <strong>of</strong> positive<br />

impact parameters, used to find P J , can <strong>the</strong>refore be determined by fitting <strong>the</strong> distribution


5.3 Flavour Discriminating Variables 82<br />

<strong>of</strong> absolute values for tracks with negative impact parameter, corresponding to a very pure<br />

IP track sample. A negative impact parameter is defined as an impact parameter that is in<br />

<strong>the</strong> opposite hemisphere with respect to <strong>the</strong> jet direction. The distribution is approximated<br />

by fitting <strong>the</strong> sum <strong>of</strong> a Gaussian <strong>and</strong> two exponentials, i.e. by finding parameters p f0 , . . .,<br />

p f6 such that:<br />

f(x) = p f0 × exp(−0.5( x − p f1<br />

p f2<br />

) 2 ) + p f4 exp(p f3 x) + p f6 exp(p f5 x) (5.9)<br />

best describes <strong>the</strong> measured distribution. Within this model <strong>the</strong> integral ∫ ∞<br />

b/σ b<br />

f(x)dx can be<br />

written as:<br />

∫ ∞<br />

b/σ b<br />

f(x)dx ≈<br />

(∫<br />

2<br />

∞<br />

∫ ∞<br />

)<br />

√ × exp (−r 2 )dr −<br />

exp (−r 2 )dr<br />

π (b/σ b )/( √ 2p I0 )<br />

(b/σ b ) √ cut/( 2p I0 )<br />

+(p I1 exp (−p I2 )(b/σ b ) − p I1 exp (−p I2 )(b/σ b ) cut )<br />

+(p I3 exp (−p I4 )(b/σ b ) − p I1 exp (−p I4 )(b/σ b ) cut ) (5.10)<br />

where <strong>the</strong> integral over <strong>the</strong> Gaussian <strong>and</strong> <strong>the</strong> exponential functions is cut <strong>of</strong>f at (b/σ b ) cut .<br />

These are also <strong>the</strong> user definable significance cuts that have been mentioned in <strong>the</strong> previous<br />

section. When defining <strong>the</strong> parameters <strong>the</strong> following substitutions are used:<br />

p I0 = p f2<br />

p I1 = − 2 √ π<br />

× p f3 /(p f0 p f2 p f4 )<br />

p I2 = −p f4<br />

p I3 = − 2 √ π<br />

× p f5 /(p f0 p f2 p f6 )<br />

p I4 = −p f6 (5.11)<br />

As it can be seen in <strong>the</strong>se equations <strong>the</strong> parameter p f1 disappears. The assumption that<br />

<strong>the</strong> mean <strong>of</strong> <strong>the</strong> Gaussian is equal to zero has been applied. This should be a fair assumption<br />

for tracks that originate from <strong>the</strong> IP. The values calculated for <strong>the</strong> detector geometry<br />

LDCPrime 02Sc which has been used in this chapter are displayed in table 5.3.


5.3 Flavour Discriminating Variables 83<br />

Joint Probability Resolution Parameters<br />

Parameter R-φ projection R-z projection<br />

p I0 0.843 0.911<br />

p I1 0.365 0.306<br />

p I2 0.620 0.423<br />

p I3 0.150 0.139<br />

p I4 0.029 0.028<br />

Table 5.3: Parameters used in <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> joint probability in R-φ <strong>and</strong> in R-z,<br />

respectively, obtained from a fit <strong>of</strong> negative impact parameter distributions.<br />

Fig. 5.7 instead displays <strong>the</strong> automatically generated visual output that is shown from <strong>the</strong><br />

program that calculates <strong>the</strong> parameters <strong>of</strong> <strong>the</strong> impact parameter significance distributions.<br />

The impact parameter distribution can be seen, as well as <strong>the</strong> Gaussian component <strong>of</strong> <strong>the</strong><br />

fit, for both <strong>the</strong> d 0 <strong>and</strong> <strong>the</strong> z 0 impact parameters. The fit is a preliminary fit, using only a<br />

simple Gaussian, that is used to seed <strong>the</strong> parameters for <strong>the</strong> final fit.<br />

(a)<br />

(b)<br />

Figure 5.7: Graphical output <strong>of</strong> <strong>the</strong> program fitting <strong>the</strong> impact parameter distributions.<br />

Both <strong>the</strong> distribution <strong>and</strong> <strong>the</strong> preliminary Gaussian fit can be seen for <strong>the</strong>: a) d 0 impact<br />

parameter significance <strong>and</strong> b) z 0 impact parameter significance.<br />

5.3.2 Variables for Jets with Secondary Vertices<br />

In <strong>the</strong> case that more than one vertex is found, observables derived from <strong>the</strong>se additional<br />

vertices provide more powerful means to distinguish between b, c <strong>and</strong> light quark jets. The


5.3 Flavour Discriminating Variables 84<br />

following set <strong>of</strong> eight variables is used in such cases:<br />

• The decay length <strong>and</strong> decay length significance <strong>of</strong> <strong>the</strong> vertex with <strong>the</strong> largest decay<br />

length significance in three dimensions with respect to <strong>the</strong> IP;<br />

• The total momentum |p| <strong>of</strong> <strong>the</strong> set <strong>of</strong> tracks assigned to <strong>the</strong> non-primary vertices after<br />

<strong>the</strong> track attachment has been performed (see below);<br />

• The p T corrected vertex mass, calculated as described below;<br />

• The number N trk,vtx <strong>of</strong> tracks in all non-primary vertices;<br />

• The secondary vertex probability <strong>of</strong> <strong>the</strong> tracks assigned to <strong>the</strong> non primary vertices<br />

after <strong>the</strong> track attachment has been performed: a new vertex fit is performed using<br />

<strong>the</strong>se tracks <strong>and</strong> <strong>the</strong> probability that <strong>the</strong>y all originate from <strong>the</strong> same secondary vertex<br />

is calculated from <strong>the</strong> fit’s χ 2 ;<br />

• The already described joint probability variables.<br />

In addition to <strong>the</strong> general track selection used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> flavour tag variables,<br />

a special track selection is applied for <strong>the</strong> variables M PT<br />

, |p| <strong>and</strong> <strong>the</strong> secondary vertex<br />

probability. The seed vertex, i.e. <strong>the</strong> vertex fur<strong>the</strong>st away from <strong>the</strong> IP, is used to define<br />

<strong>the</strong> distances D between this vertex <strong>and</strong> <strong>the</strong> IP, <strong>and</strong> L as shown in fig. 5.8 The vertex<br />

axis is <strong>the</strong> straight line connecting <strong>the</strong> seed vertex with <strong>the</strong> IP. For each track, <strong>the</strong> point <strong>of</strong><br />

closest approach to <strong>the</strong> vertex axis is projected onto <strong>the</strong> vertex axis <strong>and</strong> L defined as <strong>the</strong><br />

distance <strong>of</strong> <strong>the</strong> resulting point on <strong>the</strong> vertex axis from <strong>the</strong> IP. Tracks with 0.18 < L/D <<br />

2.5 <strong>and</strong> a transverse distance T <strong>of</strong> <strong>the</strong> point <strong>of</strong> closest approach from <strong>the</strong> vertex axis below<br />

1.0 mm are attached to <strong>the</strong> decay chain. The real purpose <strong>of</strong> this process is to include<br />

tracks from possible one prong tertiary vertices <strong>and</strong> hence increase <strong>the</strong> precision <strong>of</strong> <strong>the</strong> mass<br />

calculation. A choice now has to be made with respect to whe<strong>the</strong>r <strong>the</strong> tracks from <strong>the</strong><br />

secondary vertices are automatically included. If <strong>the</strong>y are included <strong>the</strong>n we automatically<br />

select all tracks originating from <strong>the</strong> B <strong>and</strong> <strong>the</strong> subsequent D meson decay. Clearly this<br />

is <strong>the</strong> ideal case assuming perfect track assignment <strong>and</strong> that <strong>the</strong>re are no fake vertices or


5.3 Flavour Discriminating Variables 85<br />

vertices deriving from o<strong>the</strong>r decays. These however should already have been rejected by <strong>the</strong><br />

conversion tagger. Therefore in <strong>the</strong> presented algorithm all tracks from secondary vertices<br />

have been included regardless on whe<strong>the</strong>r <strong>the</strong> tracks have or have not passed <strong>the</strong> additional<br />

selection criteria. In <strong>the</strong> code <strong>the</strong> possibility not to automatically include <strong>the</strong>m also exists.<br />

The difference in <strong>the</strong> result between <strong>the</strong>se two selection processes is however minimal.<br />

Track<br />

Vertex Axis<br />

Secondary<br />

T<br />

L<br />

D<br />

IP<br />

Beam Axis<br />

Figure 5.8: Schematic diagram showing <strong>the</strong> definition <strong>of</strong> distances L, D <strong>and</strong> T used in <strong>the</strong><br />

track attachment algorithm.<br />

The momentum |p| is <strong>the</strong> modulus <strong>of</strong> <strong>the</strong> vector sum <strong>of</strong> all decay chain track momenta.<br />

The secondary vertex probability is found by fitting a common vertex to <strong>the</strong>se tracks <strong>and</strong><br />

calculating <strong>the</strong> probability from <strong>the</strong> χ 2 value <strong>of</strong> this fit in <strong>the</strong> same way as for <strong>the</strong> ZVKIN<br />

vertex finder, see <strong>Section</strong> 3.2. For <strong>the</strong> secondary vertex probability <strong>the</strong> number <strong>of</strong> tracks in<br />

<strong>the</strong> decay chain is required to exceed <strong>the</strong> value N trks,min <strong>and</strong> <strong>the</strong> normalized χ 2 is required to<br />

be below a user-settable value. For jets that do not meet <strong>the</strong>se requirements, <strong>the</strong> probability<br />

is set to 0, to lower <strong>the</strong> risk <strong>of</strong> such jets leaking into <strong>the</strong> heavy flavour samples. For <strong>the</strong><br />

calculation <strong>of</strong> <strong>the</strong> p T corrected vertex mass M Pt , first <strong>the</strong> vertex mass before correction M V tx ,


5.3 Flavour Discriminating Variables 86<br />

is obtained from <strong>the</strong> decay chain tracks, assigning a pion mass to each track. With θ V tx <strong>the</strong><br />

angle between <strong>the</strong> seed vertex axis as given by <strong>the</strong> vertex position <strong>and</strong> <strong>the</strong> vertex momentum<br />

with respect to <strong>the</strong> IP, it is required that p 2 V tx (1−cos2 θ V tx ) ≤ w PT ,maxMV 2 tx , where <strong>the</strong> factor<br />

w PT ,max is a user-defined LCFI Vertex parameter <strong>of</strong> default value 3. This cut ensures that<br />

cases in which both θ V tx <strong>and</strong> |p| are large are excluded from <strong>the</strong> correction procedure to<br />

reduce <strong>the</strong> risk <strong>of</strong> fake vertices being assigned a large correction <strong>and</strong> subsequently affecting<br />

<strong>the</strong> flavour tag. Jets failing this cut are assigned an M Pt value <strong>of</strong> 0. A conservative estimate<br />

<strong>of</strong> <strong>the</strong> transverse momentum p V tx<br />

T<br />

corresponding to θ V tx <strong>and</strong> taking <strong>the</strong> error matrices <strong>of</strong> <strong>the</strong><br />

seed axis <strong>and</strong> <strong>the</strong> IP into account, is obtained by iteratively minimizing <strong>the</strong> correction term<br />

p V tx<br />

T<br />

<strong>and</strong> recalculating <strong>the</strong> seed axis direction. For this minimization, <strong>the</strong> parameter N σ,max<br />

determines <strong>the</strong> permitted extent <strong>of</strong> <strong>the</strong> seed axis correction in units <strong>of</strong> its uncertainty. The<br />

p T corrected vertex mass M Pt is <strong>the</strong>n defined as:<br />

M Pt =<br />

√<br />

M 2 V tx<br />

tx<br />

+ |pV<br />

T<br />

| 2 + |p V T tx |. (5.12)<br />

Finally, it is required that <strong>the</strong> correction does not exceed <strong>the</strong> uncorrected value by a large<br />

factor, M Pt ≤ w corr,max × M V tx , where w corr,max is a code parameter.<br />

Figure 5.9 shows <strong>the</strong> flavour tag input variables used for jets for which more than one<br />

vertex was found, for b, c <strong>and</strong> light quark jets separately. Please note that <strong>the</strong> distributions<br />

plotted are cumulative; hence <strong>the</strong> b distribution includes <strong>the</strong> plotted c <strong>and</strong> uds events <strong>and</strong><br />

<strong>the</strong> c distribution includes <strong>the</strong> plotted uds events. The sample used for <strong>the</strong> distribution is<br />

10000 e + e − → Z/γ → q¯q events, with natural branching ratios, at 91.2 GeV centre <strong>of</strong> mass<br />

energy. Some <strong>of</strong> <strong>the</strong>se variables, such as M Pt , already provide a very good separation <strong>of</strong> <strong>the</strong><br />

different jet flavours on <strong>the</strong>ir own, with <strong>the</strong> correlations between <strong>the</strong> observables, exploited<br />

by <strong>the</strong> neural network approach, fur<strong>the</strong>r improving <strong>the</strong> tagging performance.<br />

Finally please note that, as <strong>the</strong> momenta <strong>of</strong> <strong>the</strong> most <strong>and</strong> second-most significant track<br />

(presented in <strong>the</strong> previous section), <strong>the</strong> decay length significance <strong>and</strong> <strong>the</strong> seed vertex momentum<br />

depend on <strong>the</strong> energy <strong>of</strong> <strong>the</strong> input jet, in order to be able to use <strong>the</strong> neural networks<br />

that are trained with jets from <strong>the</strong> e + e → Z/γ → q¯q events at <strong>the</strong> Z resonance for arbitrary


5.3 Flavour Discriminating Variables 87<br />

energy, <strong>the</strong>se observables are normalized to <strong>the</strong> jet energy before being fed into <strong>the</strong> neural<br />

networks described in <strong>the</strong> next section.


5.3 Flavour Discriminating Variables 88<br />

N jets<br />

300<br />

N jets<br />

3<br />

10<br />

250<br />

200<br />

2<br />

10<br />

150<br />

100<br />

10<br />

50<br />

0<br />

0 1 2 3 4 5 6 7 8 9 10<br />

1<br />

0 20 40 60 80 100 120 140 160 180 200<br />

decay length / mm<br />

decay length significance<br />

(a)<br />

(b)<br />

N jets<br />

500<br />

N jets<br />

400<br />

400<br />

350<br />

300<br />

300<br />

250<br />

200<br />

200<br />

150<br />

100<br />

100<br />

50<br />

0<br />

0 5 10 15 20 25 30 35 40 45 50<br />

0<br />

0 1 2 3 4 5 6 7<br />

raw momentum |p| / GeV<br />

M Pt<br />

/ GeV<br />

(c)<br />

(d)<br />

N jets<br />

2400<br />

2200<br />

2000<br />

1800<br />

1600<br />

1400<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

0 5 10 15 20<br />

N jets<br />

3<br />

10<br />

b-jets<br />

2<br />

10<br />

c-jets<br />

u,d,s-jets<br />

10<br />

1<br />

-10 -9 -8 -7 -6 -5 -4 -3 -2 -1 0<br />

number <strong>of</strong> non-prim. vtx tracks<br />

log secondary vertex probability<br />

(e)<br />

(f)<br />

Figure 5.9: Flavour discriminating variables used when two or more vertices have been<br />

reconstructed: a) <strong>the</strong> decay length <strong>of</strong> <strong>the</strong> seed vertex, b) <strong>the</strong> decay length significance, c) <strong>the</strong><br />

momentum <strong>of</strong> <strong>the</strong> seed vertex, d) <strong>the</strong> transverse momentum corrected vertex mass, e) <strong>the</strong><br />

number <strong>of</strong> tracks found in all secondary vertices, f) <strong>the</strong> probability that all tracks present in<br />

secondary vertices originate from <strong>the</strong> same secondary vertex.


5.4 Neural Network Combination 89<br />

5.4 Neural Network Combination<br />

For heavy flavour tagging, that is <strong>the</strong> identification <strong>of</strong> bottom <strong>and</strong> charm jets, neural networks<br />

are trained such that <strong>the</strong> target output provided in <strong>the</strong> training phase is 1 for signal jets <strong>and</strong><br />

0 for background. The output value <strong>of</strong> a trained network will be <strong>the</strong> closer to 1 <strong>the</strong> more<br />

signal-like <strong>the</strong> values <strong>of</strong> <strong>the</strong> input observables. By default, each <strong>of</strong> <strong>the</strong> networks has 8 input<br />

nodes, one hidden layer <strong>of</strong> 14 tan-sigmoid nodes <strong>and</strong> one output node <strong>and</strong> is trained using<br />

<strong>the</strong> conjugate gradient backpropagation algorithm. As explained in <strong>the</strong> previous section, <strong>the</strong><br />

flavour tag is based on different observables for jets with one <strong>and</strong> for jets with two or more<br />

found vertices. Fur<strong>the</strong>rmore, for a given jet flavour, <strong>the</strong> distributions <strong>of</strong> sensitive variables<br />

are significantly different for jets with two <strong>and</strong> jets with three or more vertices, so <strong>the</strong> ability<br />

to distinguish between b <strong>and</strong> c jets is enhanced by treating <strong>the</strong>se two cases separately. For<br />

each <strong>of</strong> <strong>the</strong>se categories <strong>of</strong> one, two or at least three vertices, three networks are trained, so<br />

a complete set consists <strong>of</strong> nine networks altoge<strong>the</strong>r. For b nets, <strong>the</strong> signal provided in <strong>the</strong><br />

training phase consists <strong>of</strong> b jets while c <strong>and</strong> light flavour jets form <strong>the</strong> background. The c<br />

nets are trained with c jets as signal <strong>and</strong> b <strong>and</strong> light flavour jets as background. As for some<br />

physics processes <strong>the</strong> background for <strong>the</strong> identification <strong>of</strong> c jets is known to consist almost<br />

exclusively <strong>of</strong> b jets, <strong>and</strong> charm jets are easier to distinguish from <strong>the</strong>se than from light<br />

flavour jets, dedicated networks are provided for <strong>the</strong>se cases. The networks are <strong>the</strong>refore<br />

only presented with b jets as background in <strong>the</strong> training run.<br />

It has to be noted that <strong>the</strong> LCFI Vertex s<strong>of</strong>tware is very flexible, permitting <strong>the</strong> use <strong>of</strong><br />

different input variables, network architectures, node types, transfer functions <strong>and</strong> training<br />

algorithms. In fact we will see in <strong>the</strong> next sections how this flexibility has been used in order<br />

to adapt <strong>the</strong> LCFI Vertex s<strong>of</strong>tware to <strong>the</strong> needs <strong>of</strong> <strong>the</strong> SiD detector.


5.4 Neural Network Combination 90<br />

N jets , 1 vertex<br />

3<br />

10<br />

2<br />

10<br />

N jets , 2 vertices<br />

3<br />

10<br />

b-jets<br />

2<br />

10<br />

c-jets<br />

u,d,s-jets<br />

10<br />

10<br />

1<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

c-tag neural net output<br />

c-tag neural net output<br />

(a)<br />

(b)<br />

N jets , >= 3 vertices<br />

3<br />

10<br />

2<br />

10<br />

N jets<br />

3<br />

10<br />

10<br />

2<br />

10<br />

1<br />

10<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

c-tag neural net output<br />

c-tag neural net output<br />

(c)<br />

(d)<br />

Figure 5.10: Output <strong>of</strong> <strong>the</strong> neural networks used for charm tagging. The plots show <strong>the</strong><br />

outputs for <strong>the</strong> three separate networks used in case a) one, b) two <strong>and</strong> c) three or more<br />

vertices are found in <strong>the</strong> input jet. In d), <strong>the</strong> resulting combined distribution for arbitrary<br />

number <strong>of</strong> vertices is shown. Please note that <strong>the</strong> distributions plotted are cumulative; hence<br />

<strong>the</strong> b distribution includes <strong>the</strong> plotted c <strong>and</strong> uds events <strong>and</strong> <strong>the</strong> c distribution includes <strong>the</strong><br />

plotted uds events.


5.4 Neural Network Combination 91<br />

Fig. 5.10 shows <strong>the</strong> distribution <strong>of</strong> <strong>the</strong> output variables <strong>of</strong> neural networks used for<br />

tagging charm jets (c nets) separately for <strong>the</strong> cases <strong>of</strong> one, two <strong>and</strong> at least three vertices, <strong>and</strong><br />

<strong>the</strong> combined distribution for an arbitrary number <strong>of</strong> vertices. The calculation is performed<br />

on a dijet sample at √ s = 91.2 GeV. The most straightforward way <strong>of</strong> using <strong>the</strong> charm<br />

tag (i.e. c net output) in an analysis is to require one or more jets in an event to have<br />

a charm tag exceeding a certain cut value, chosen as appropriate for <strong>the</strong> specific analysis.<br />

The resulting performance on a jet by jet basis is discussed in <strong>the</strong> next section. In some<br />

occasions event selection can also be improved by using information from both <strong>the</strong> charm<br />

<strong>and</strong> <strong>the</strong> bottom tags. This can, for example, be achieved by plotting charm versus bottom<br />

tag, as shown in fig. 5.11 for bottom (fig. 5.11(a)), charm (fig. 5.11(b)) <strong>and</strong> light flavour<br />

(fig. 5.11(c)) jets from <strong>the</strong> two jet Z peak sample, <strong>and</strong> placing a cut on <strong>the</strong> resulting two<br />

dimensional distribution. Note that in <strong>the</strong> two-dimensional distribution for c jets, <strong>the</strong> peak<br />

near (b-tag = 0, c-tag = 0) stems mainly from jets in which only <strong>the</strong> primary vertex was<br />

reconstructed, while <strong>the</strong> peaks near (0, 1) is usually due to jets in which secondary vertices<br />

were also found.


5.4 Neural Network Combination 92<br />

b-jets<br />

c-jets<br />

N jets<br />

1600<br />

1400<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

c-tag neural net output<br />

0 0<br />

0.2<br />

0.4<br />

0.6<br />

0.8<br />

1<br />

b-tag neural net output<br />

N jets<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

0<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

c-tag neural net output<br />

0 0<br />

0.2<br />

0.4<br />

0.6<br />

0.8<br />

1<br />

b-tag neural net output<br />

(a)<br />

(b)<br />

uds-jets<br />

N jets<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

c-tag neural net output<br />

0 0<br />

0.2<br />

0.4<br />

0.6<br />

0.8<br />

1<br />

b-tag neural net output<br />

(c)<br />

Figure 5.11: Charm tag vs. bottom tag for input samples consisting purely <strong>of</strong> a) bottom<br />

jets, b) charm jets <strong>and</strong> c) light quark jets.


5.5 Performance <strong>of</strong> Flavour Tagging 93<br />

5.5 Performance <strong>of</strong> Flavour Tagging<br />

As a measure <strong>of</strong> <strong>the</strong> flavour-tagging performance, <strong>the</strong> purity <strong>of</strong> selecting bottom <strong>and</strong> charm<br />

jets is studied as function <strong>of</strong> <strong>the</strong> efficiency. Fig. 5.12 shows purity vs. efficiency for <strong>the</strong><br />

three tags provided by <strong>the</strong> LCFI Vertex package, for <strong>the</strong> two jet sample at √ s = 91.2 GeV<br />

<strong>and</strong> at √ s = 500 GeV. A flavour composition <strong>of</strong> approximately 22% (15%) <strong>of</strong> bottom, 17%<br />

(25%) <strong>of</strong> charm <strong>and</strong> about 61% (60%) <strong>of</strong> light flavour jets at √ s = 91.2 GeV (500 GeV) is<br />

assumed. The plot is obtained by varying a simple cut on <strong>the</strong> neural net output variable <strong>and</strong><br />

calculating purity <strong>and</strong> efficiency at each cut value. At <strong>the</strong> Z resonance, for <strong>the</strong> b-tag a very<br />

pure sample, containing 92% b jets, can be selected at an efficiency <strong>of</strong> 70%. In comparison,<br />

high c-tag purities can only be achieved at lower efficiencies, mainly because light flavour jets<br />

leak into <strong>the</strong> sample. Because <strong>of</strong> <strong>the</strong>se reasons <strong>the</strong> c-tag with all backgrounds included has<br />

been found to be <strong>the</strong> most sensitive <strong>of</strong> <strong>the</strong> tags whenever changing <strong>the</strong> conditions used in <strong>the</strong><br />

study, such as using different tracking algorithms, code parameters <strong>and</strong> detector geometries.<br />

At 500 GeV, <strong>the</strong> b-tag performance is degraded with respect to that at <strong>the</strong> Z resonance,<br />

while c-tag purity is very similar at both energies <strong>and</strong> <strong>the</strong> purity <strong>of</strong> <strong>the</strong> c-tag with only b<br />

background is slightly improved at <strong>the</strong> higher energy. However it is clear that part <strong>of</strong> <strong>the</strong><br />

difference in <strong>the</strong> performance is explained from <strong>the</strong> different quark compositon <strong>of</strong> <strong>the</strong> sample.<br />

The decrease <strong>of</strong> <strong>the</strong> performance in tagging b jets is <strong>the</strong>refore at least partially driven from<br />

<strong>the</strong> lower number <strong>of</strong> b quarks (signal) <strong>and</strong> <strong>the</strong> higher number <strong>of</strong> light quarks (background)<br />

in <strong>the</strong> sample composition.<br />

A way <strong>of</strong> studying tagging performance that is independent <strong>of</strong> <strong>the</strong> sample composition<br />

is to look at <strong>the</strong> efficiency <strong>of</strong> selecting each <strong>of</strong> <strong>the</strong> wrong flavours when cutting on one <strong>of</strong> <strong>the</strong><br />

tagging variables, i.e. <strong>the</strong> wrong flavour efficiencies or mistags. Figures 5.13(a) <strong>and</strong> 5.13(b)<br />

show <strong>the</strong> c <strong>and</strong> light quark jet efficiencies as a function <strong>of</strong> <strong>the</strong> b-tag efficiency <strong>and</strong> <strong>the</strong> b<br />

<strong>and</strong> light flavour jet efficiencies vs. <strong>the</strong> c-tag efficiency. It is clearly seen that <strong>the</strong> main<br />

background to <strong>the</strong> b-tag is due to c jets. For <strong>the</strong> c-tag <strong>the</strong> main background, at efficiencies<br />

above about 75%, is due to light flavour jets, while at lower efficiencies it is dominated by<br />

misidentified b jets, as it would be expected from <strong>the</strong> comparison <strong>of</strong> <strong>the</strong> purities for c-tag


5.5 Performance <strong>of</strong> Flavour Tagging 94<br />

purity<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

b c bc<br />

MARLIN, LDCPrime_02Sc, 90 GeV<br />

MARLIN, LDCPrime_02Sc, 500 GeV<br />

0<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

efficiency<br />

Figure 5.12: Comparison <strong>of</strong> tagging performance at <strong>the</strong> Z resonance <strong>and</strong> at √ s = 500 GeV.<br />

Tagging purity is plotted as function <strong>of</strong> efficiency for b jets <strong>and</strong> c jets. Performance for c<br />

jets assuming only b background (labelled bc) is also shown.<br />

<strong>and</strong> c-tag with b background only.<br />

By using <strong>the</strong> same methods it is also possible to analyse <strong>the</strong> effect <strong>of</strong> <strong>the</strong> K s , Λ <strong>and</strong><br />

conversion tagger on <strong>the</strong> flavour tagging performance. The analysis displayed in Fig. 5.14<br />

does just that. The plot shows how <strong>the</strong> performance <strong>of</strong> <strong>the</strong> b quark tagging can be improved<br />

only minimally by using <strong>the</strong> implemented track removal algorithm or as a matter <strong>of</strong> fact any<br />

o<strong>the</strong>r similar algorithm. In fact given a certain efficiency <strong>the</strong> algorithm performs at worst<br />

a few percentage points worse in terms <strong>of</strong> purity than if all <strong>the</strong> reconstructable Ks, Λ <strong>and</strong><br />

conversion tracks are removed by using MC information. The same consideration applies<br />

also for <strong>the</strong> c quark tagging when only <strong>the</strong> b quark background is present. Differently a<br />

substantial improvement can be seen by using <strong>the</strong> conversion tagger on <strong>the</strong> c quark.<br />

It is at this point also interesting to study <strong>the</strong> relative importance <strong>of</strong> <strong>the</strong> input variables<br />

on <strong>the</strong> flavour tag results. The different contribution to every neural network has been<br />

determined using an estimator from <strong>the</strong> Toolkit for Multivariate Data Analysis with ROOT


5.5 Performance <strong>of</strong> Flavour Tagging 95<br />

wrong flavour efficiency<br />

1<br />

-1<br />

10<br />

-2<br />

10<br />

wrong flavour efficiency<br />

1<br />

-1<br />

10<br />

-2<br />

10<br />

c jet leakage<br />

u,d,s jet leakage<br />

-3<br />

10<br />

b jet leakage<br />

u,d,s jet leakage<br />

-3<br />

10<br />

-4<br />

10<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

c-tag efficiency<br />

(a)<br />

-4<br />

10<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

b-tag efficiency<br />

(b)<br />

Figure 5.13: Efficiencies for selecting jets with <strong>the</strong> wrong flavour when tagging a) bottom<br />

jets <strong>and</strong> b) charm jets.<br />

(TMVA) [59]. Following this approach, <strong>the</strong> input importance <strong>of</strong> a variable i is defined as:<br />

I i = ¯x 2 i<br />

∑n h<br />

j=1<br />

(w ij ) j = 1,...,n var (5.13)<br />

where ¯x i is <strong>the</strong> average <strong>of</strong> <strong>the</strong> values <strong>of</strong> variable i in <strong>the</strong> input sample <strong>and</strong> <strong>the</strong> sum extends<br />

over <strong>the</strong> weights w ij corresponding to <strong>the</strong> connections <strong>of</strong> <strong>the</strong> neural network node <strong>of</strong> variable<br />

i with <strong>the</strong> n h nodes in <strong>the</strong> adjacent network layer. The calculation was implemented as part<br />

<strong>of</strong> <strong>the</strong> neural network code provided with LCFI Vertex. Tables 5.4, 5.5, 5.6 <strong>and</strong> 5.7, 5.8, 5.9<br />

summaries <strong>the</strong> results obtained at <strong>the</strong> Z resonance <strong>and</strong> at 500 GeV, respectively, where for<br />

each neural network, <strong>the</strong> values I i are normalized to <strong>the</strong> maximum value I max .<br />

Some quite significant differences can be seen in <strong>the</strong> importance <strong>of</strong> <strong>the</strong> inputs for <strong>the</strong><br />

91.2 GeV <strong>and</strong> <strong>the</strong> 500 GeV case. In particular, as expected due to <strong>the</strong> increased boost <strong>of</strong><br />

<strong>the</strong> particles, <strong>the</strong> decay length <strong>of</strong> <strong>the</strong> vertices gains in importance. In <strong>the</strong> case that only<br />

<strong>the</strong> primary vertex is found <strong>the</strong> most striking difference is <strong>the</strong> increased importance <strong>of</strong> <strong>the</strong><br />

impact parameter significances, at <strong>the</strong> expense <strong>of</strong> <strong>the</strong> joint probability. This can derive from<br />

an increased discriminatory power <strong>of</strong> <strong>the</strong>se variable due to an increase boost <strong>of</strong> <strong>the</strong> jets. This<br />

seems however unlikely because <strong>the</strong> track significances are also inputs to <strong>the</strong> joint probability.


5.5 Performance <strong>of</strong> Flavour Tagging 96<br />

purity<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

b c bc<br />

no V0 finding in vertex detector<br />

ConversionTagger V0 finding<br />

Cheater: all reconstructable V0s masked<br />

0<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

efficiency<br />

Figure 5.14: Comparison <strong>of</strong> tagging with <strong>and</strong> without K s , Λ <strong>and</strong> conversion tagging <strong>and</strong><br />

performance obtained when using MC information at √ s=91.2 GeV<br />

Alternatively it can derive from decreased performance <strong>of</strong> <strong>the</strong> joint probability. This seems<br />

much more likely as <strong>the</strong> negative impact parameter distribution has been calculated only for<br />

91.2 GeV jets. If this is <strong>the</strong> case an energy renormalization <strong>of</strong> <strong>the</strong>se parameters should be<br />

designed <strong>and</strong> <strong>the</strong> performance <strong>of</strong> <strong>the</strong> tagging at 500 GeV can <strong>the</strong>refore be improved.


5.5 Performance <strong>of</strong> Flavour Tagging 97<br />

Z mass resonance - 1 vertex found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

most signif. track d 0 /σ(d 0 ) 0.002 ± 0.004 0.005 ± 0.008 0.000 ± 0.000<br />

2 nd -most signif. track d 0 /σ(d 0 ) 0.058 ± 0.016 0.009 ± 0.007 0.026 ± 0.010<br />

most signif. track z 0 /σ(z 0 ) 0.001 ± 0.001 0.003 ± 0.003 0.002 ± 0.001<br />

2 nd -most signif. track z 0 /σ(z 0 ) 0.115 ± 0.030 0.005 ± 0.001 0.066 ± 0.017<br />

most signif. track |p trk | 0.113 ± 0.275 0.003 ± 0.015 0.043 ± 0.002<br />

2 nd -most signif. track |p trk | 0.062 ± 0.003 0.068 ± 0.006 0.048 ± 0.003<br />

joint probability R-φ 1 0.676 ± 0.031 0.887 ± 0.041<br />

joint probability z 0.922 ± 0.037 1 1<br />

Table 5.4: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at <strong>the</strong> Z resonance in <strong>the</strong> case only <strong>the</strong> IP vertex has been found.<br />

Z mass resonance - 2 vertices found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

M Pt 1 1 1<br />

|p| 0.098 ± 0.007 0.404 ± 0.029 0.114 ± 0.008<br />

decay length 0.182 ± 0.013 0.990 ± 0.072 0.139 ± 0.010<br />

decay length significance 0.070 ± 0.008 0.187 ± 0.022 0.115 ± 0.013<br />

N trk,vtx 0.063 ± 0.004 0.162 ± 0.009 0.081 ± 0.004<br />

secondary vertex probability 0.230 ± 0.017 0.212 ± 0.016 0.124 ± 0.010<br />

joint probability R-φ 0.040 ± 0.004 0.071 ± 0.007 0.052 ± 0.005<br />

joint probability z 0.061 ± 0.008 0.159 ± 0.021 0.052 ± 0.007<br />

Table 5.5: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at <strong>the</strong> Z resonance in <strong>the</strong> case only one secondary vertex has been found.<br />

Z mass resonance - 3 or more vertices found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

M Pt 1 1 1<br />

|p| 0.490 ± 0.076 0.960 ± 0.148 0.580 ± 0.089<br />

decay length 0.550 ± 0.119 0.338 ± 0.072 0.370 ± 0.080<br />

decay length significance 0.810 ± 0.018 0.106 ± 0.023 0.078 ± 0.017<br />

N trk,vtx 0.665 ± 0.089 0.811 ± 0.108 0.829 ± 0.111<br />

secondary vertex probability 0.087 ± 0.045 0.112 ± 0.057 0.066 ± 0.034<br />

joint probability R-φ 0.006 ± 0.002 0.012 ± 0.005 0.007 ± 0.003<br />

joint probability z 0.007 ± 0.003 0.017 ± 0.007 0.009 ± 0.003<br />

Table 5.6: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at <strong>the</strong> Z resonance in <strong>the</strong> case two or more secondary vertices have been found.


5.5 Performance <strong>of</strong> Flavour Tagging 98<br />

500GeV - 1 vertex found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

most signif. track d 0 /σ(d 0 ) 0.363 ± 0.084 1 0.063 ± 0.014<br />

2 nd -most signif. track d 0 /σ(d 0 ) 0.499 ± 0.079 0.072 ± 0.020 0.429 ± 0.067<br />

most signif. track z 0 /σ(z 0 ) 0.036 ± 0.008 0.166 ± 0.053 0.148 ± 0.033<br />

2 nd -most signif. track z 0 /σ(z 0 ) 1 0.057 ± 0.016 1<br />

most signif. track |p trk | 0.019 ± 0.004 0.057 ± 0.019 0.013 ± 0.003<br />

2 nd -most signif. track |p trk | 0.009 ± 0.002 0.013 ± 0.005 0.012 ± 0.003<br />

joint probability R-φ 0.364 ± 0.066 0.304 ± 0.091 0.564 ± 0.102<br />

joint probability z 0.363 ± 0.061 0.487 ± 0.142 0.688 ± 0.116<br />

Table 5.7: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at 500 GeV in <strong>the</strong> case only <strong>the</strong> IP vertex has been found.<br />

500GeV - 2 vertices found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

M Pt 1 0.438 ± 0.116 1<br />

|p| 0.071 ± 0.021 0.128 ± 0.034 0.082 ± 0.024<br />

decay length 0.420 ± 0.119 1 0.320 ± 0.091<br />

decay length significance 0.011 ± 0.006 0.013 ± 0.007 0.019 ± 0.011<br />

N trk,vtx 0.054 ± 0.014 0.061 ± 0.013 0.070 ± 0.018<br />

secondary vertex probability 0.262 ± 0.078 0.106 ± 0.028 0.141 ± 0.042<br />

joint probability R-φ 0.083 ± 0.026 0.064 ± 0.018 0.108 ± 0.034<br />

joint probability z 0.147 ± 0.057 0.168 ± 0.061 0.125 ± 0.049<br />

Table 5.8: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at 500GeV in <strong>the</strong> case only one secondary vertex has been found.<br />

500GeV - 3 or more vertices found by ZVTOP<br />

variable b-tag c-tag c-tag(b bgd.)<br />

M Pt 0.500 ± 0.268 0.814 ± 0.435 0.745 ± 0.398<br />

|p| 0.173 ± 0.096 0.553 ± 0.307 0.306 ± 0.169<br />

decay length 1 1 1<br />

decay length significance 0.013 ± 0.010 0.028 ± 0.021 0.019 ± 0.014<br />

N trk,vtx 0.418 ± 0.172 0.829 ± 0.341 0.775 ± 0.318<br />

secondary vertex probability 0.349 ± 0.504 0.731 ± 1.050 0.394 ± 0.569<br />

joint probability R-φ 0.015 ± 0.011 0.048 ± 0.036 0.028 ± 0.021<br />

joint probability z 0.035 ± 0.033 0.127 ± 0.123 0.062 ± 0.061<br />

Table 5.9: Relative importance I i /I max <strong>of</strong> variables used as inputs for flavour tag neural<br />

nets at 500GeV in <strong>the</strong> case two or more secondary vertices have been found.


5.6 Flavour Tagging in <strong>the</strong> SiD Detector Concept 99<br />

At <strong>the</strong> Z resonance, in case only <strong>the</strong> primary vertex is found by ZVRES, <strong>the</strong> joint probability<br />

variables in R-φ <strong>and</strong> R-z provide <strong>the</strong> best h<strong>and</strong>le for distinguishing between different<br />

jet flavours. This is to be expected given that <strong>the</strong>se variables combine information from all<br />

<strong>the</strong> tracks in <strong>the</strong> jet, ra<strong>the</strong>r than resulting from only one <strong>of</strong> <strong>the</strong>m (as is <strong>the</strong> case for <strong>the</strong><br />

o<strong>the</strong>r six inputs). For <strong>the</strong> c tag provided for <strong>the</strong> case that all backgrounds are present, <strong>the</strong><br />

momentum <strong>of</strong> <strong>the</strong> most significant track in <strong>the</strong> jet also contributes significantly to <strong>the</strong> flavour<br />

tag result. The o<strong>the</strong>r variables contribute to a much lesser extent. At 500 GeV, <strong>the</strong> joint<br />

probability variables are still important, but <strong>the</strong> impact parameter significances <strong>of</strong> <strong>the</strong> most<br />

<strong>and</strong> second most significant track in <strong>the</strong> jet play a similarly important role in jet flavour<br />

identification.<br />

If at least two vertices are found in a jet, <strong>the</strong> p t corrected vertex mass provides <strong>the</strong> clearest<br />

indication <strong>of</strong> <strong>the</strong> jet flavour. O<strong>the</strong>r important variables are <strong>the</strong> seed vertex decay length <strong>and</strong><br />

<strong>the</strong> number <strong>of</strong> tracks in <strong>the</strong> seed vertex, especially if three or more vertices are found, as well<br />

as <strong>the</strong> vertex momentum |p| <strong>and</strong> <strong>the</strong> secondary vertex probability. As can be expected, <strong>the</strong><br />

relative importance <strong>of</strong> <strong>the</strong> decay length increases with increasing jet energy, <strong>and</strong> surpasses<br />

that <strong>of</strong> <strong>the</strong> M Pt variable for <strong>the</strong> three vertex case at 500 GeV.<br />

5.6 Flavour Tagging in <strong>the</strong> SiD Detector Concept<br />

Although <strong>the</strong> presented algorithm has been mainly studied in <strong>the</strong> ILD/LDC framework it<br />

has been extensively used also with <strong>the</strong> SiD detector geometry. In order to validate <strong>the</strong><br />

s<strong>of</strong>tware also in this framework <strong>the</strong> performance <strong>of</strong> <strong>the</strong> flavour tagging has been analysed.<br />

The analysis has been attempted in two separate stages. In <strong>the</strong> first stage <strong>the</strong> networks<br />

have not been retrained <strong>and</strong> <strong>the</strong> same networks have been used as <strong>the</strong> ones presented in <strong>the</strong><br />

ILD/LDC analysis. As it can be seen in fig. 5.15 this resulted in a substantial deterioration<br />

<strong>of</strong> <strong>the</strong> flavour tagging performance. In order to improve <strong>the</strong> performance <strong>the</strong> networks<br />

have been retrained. However in order to simplify <strong>the</strong> process instead <strong>of</strong> using 9 different<br />

neural networks only 3 have been used. Each neural network, b tag, c tag <strong>and</strong> c tag with b


5.7 Summary 100<br />

background <strong>the</strong>refore used all sixteen described discriminating variables plus an additional<br />

two: <strong>the</strong> jet energy <strong>and</strong> <strong>the</strong> number <strong>of</strong> vertices reconstructed in <strong>the</strong> jet. As it can be seen<br />

in fig. 5.15 <strong>the</strong> retraining <strong>of</strong> <strong>the</strong> neural network substantially improved <strong>the</strong> performance to<br />

a level comparable with <strong>the</strong> one seen in <strong>the</strong> LDCPrime 02Sc detector model.<br />

Figure 5.15: Comparison <strong>of</strong> tagging performance at √ s = 500 GeV. Tagging purity is plotted<br />

as function <strong>of</strong> efficiency for b jets (red) <strong>and</strong> c jets (green). Performance for c jets assuming<br />

only b background is also shown (blue). The dashed lines represent <strong>the</strong> performance using<br />

<strong>the</strong> LDC trained neural networks, while <strong>the</strong> solid lines represent <strong>the</strong> performance using <strong>the</strong><br />

SiD trained neural networks.<br />

5.7 Summary<br />

In this chapter <strong>the</strong> vertexing <strong>and</strong> flavour tagging algorithms for <strong>the</strong> ILD detector concept<br />

are described in detail. The performance <strong>of</strong> both algorithms is investigated in detail using<br />

a dijet sample with 91.2 GeV centre <strong>of</strong> mass energy. It has been found that b jets can be<br />

selected with a ∼ 90% purity for an efficiency <strong>of</strong> ∼ 70%. When using 500 GeV centre <strong>of</strong><br />

mass energy dijet events, assuming <strong>the</strong> same efficiency, <strong>the</strong> purity decreases to ∼ 80%.<br />

The algorithms have also been tested within <strong>the</strong> SiD detector concept.


Chapter 6<br />

The <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> <strong>and</strong> <strong>the</strong> t¯t <strong>Cross</strong><br />

<strong>Section</strong><br />

Having set <strong>the</strong> stage <strong>and</strong> described all <strong>the</strong> hardware <strong>and</strong> s<strong>of</strong>tware tools that are needed<br />

it is now possible to perform <strong>the</strong> chosen analysis. This chapter focuses on two specific<br />

observables: <strong>the</strong> mass <strong>of</strong> <strong>the</strong> top quark <strong>and</strong> <strong>the</strong> t¯t cross section. Given <strong>the</strong> early stages <strong>of</strong> <strong>the</strong><br />

ILC project <strong>the</strong> main interest is directed at underst<strong>and</strong>ing <strong>the</strong> limits <strong>and</strong> uncertainties that<br />

one would be able to extract from <strong>the</strong> ILC data. More specifically, as all early feasibility<br />

studies, this analysis focuses on <strong>the</strong> underst<strong>and</strong>ing <strong>of</strong> <strong>the</strong> statistical uncertainties <strong>of</strong> a wide<br />

range <strong>of</strong> observables. At this early stage it is difficult to account for all systematic errors,<br />

particularly considering <strong>the</strong> limited resources.<br />

Additionally it is important to underst<strong>and</strong> <strong>the</strong> performance <strong>of</strong> <strong>the</strong> detector in <strong>the</strong> most<br />

challenging environments. Because <strong>of</strong> this reason all <strong>the</strong> presented studies are being performed<br />

in <strong>the</strong> hadronic channel e + e − → t¯t → b¯bq¯qq¯q, which can be described as an excellent<br />

testing ground for <strong>the</strong> performance <strong>of</strong> <strong>the</strong> reconstruction in very busy events [60].<br />

In order to perform <strong>the</strong> analysis five separate samples have been used:<br />

1. A St<strong>and</strong>ard Model background sample. This is a weighted sample composed <strong>of</strong> ∼ 7<br />

× 10 6 events. As <strong>the</strong> name suggests it includes all <strong>the</strong> background St<strong>and</strong>ard Model<br />

101


Chapter 6. The <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> <strong>and</strong> <strong>the</strong> t¯t <strong>Cross</strong> <strong>Section</strong> 102<br />

processes. Each event is weighted in such a way that <strong>the</strong> integrated luminosity <strong>of</strong><br />

<strong>the</strong> sample is 500 fb −1 , although <strong>the</strong> total number <strong>of</strong> generated events is substantially<br />

lower. In such manner <strong>the</strong> event generation becomes substantially less CPU intensive.<br />

This sample includes all SM final states with <strong>the</strong> exclusion <strong>of</strong> <strong>the</strong> b¯bf ¯ff ¯f events which<br />

have been stripped out.<br />

2. A b¯bf ¯ff ¯f sample generated with a top quark mass <strong>of</strong> 174.0 GeV. The sample is<br />

composed <strong>of</strong> ∼ 0.25 × 10 6 unweighted events. These events can be fur<strong>the</strong>r separated<br />

into <strong>the</strong> signal events b¯bq¯qq¯q <strong>and</strong> <strong>the</strong> remaining background events. It has to be noted<br />

that e + e − → t¯t → b¯bq¯qq¯q is not <strong>the</strong> only process contributing to <strong>the</strong> production <strong>of</strong><br />

<strong>the</strong> b¯bq¯qq¯q signature, but is by far <strong>the</strong> dominant contribution. More specifically it has<br />

been calculated by using <strong>the</strong> Whizard Monte Carlo generator [30, 31] that <strong>the</strong> top<br />

mediated events are responsible for 90% ± 3% <strong>of</strong> <strong>the</strong> produced events. In this sample<br />

it is impossible to separate <strong>the</strong> top mediated <strong>and</strong> <strong>the</strong> non top mediated events. The<br />

full b¯bq¯qq¯q sample has <strong>the</strong>refore been used as signal <strong>and</strong> from here onwards is referred<br />

to as <strong>the</strong> hadronic t¯t.<br />

3. Three b¯bf ¯ff ¯f template samples with top quark masses <strong>of</strong> 173.5 GeV, 174.0 GeV <strong>and</strong><br />

174.5 GeV. All three samples are composed <strong>of</strong> ∼ 1.1 × 10 6 events with a weight <strong>of</strong> 0.25.<br />

These events can be fur<strong>the</strong>r separated into <strong>the</strong> signal events b¯bq¯qq¯q <strong>and</strong> <strong>the</strong> remaining<br />

background events.<br />

In all <strong>the</strong> following sections, unless differently stated, all number <strong>of</strong> events are considered<br />

as multiplied by <strong>the</strong>ir weights so that <strong>the</strong> samples are normalized to <strong>the</strong> same integrated<br />

luminosity <strong>of</strong> 500 fb −1 . All <strong>the</strong> samples used in <strong>the</strong> process have undergone full reconstruction<br />

in <strong>the</strong> SiD detector as described in <strong>the</strong> Simulation <strong>and</strong> Reconstruction chapter. The samples<br />

are representative <strong>of</strong> an ILC total integrated luminosity <strong>of</strong> 500fb −1 <strong>of</strong> which 250fb −1 taken<br />

with an electron polarization <strong>of</strong> -80% <strong>and</strong> a positron polarization <strong>of</strong> +30% <strong>and</strong> <strong>the</strong> remaining<br />

250fb −1 with an electron polarization <strong>of</strong> +80% <strong>and</strong> a positron polarization <strong>of</strong> -30%.


6.1 Preliminary Event Selection 103<br />

6.1 Preliminary Event Selection<br />

As it is very <strong>of</strong>ten <strong>the</strong> case <strong>the</strong> first step <strong>of</strong> <strong>the</strong> presented analysis is to perform a process<br />

<strong>of</strong> event selection. The main aim <strong>of</strong> this process is to separate <strong>the</strong> signal events from <strong>the</strong><br />

background events in <strong>the</strong> best way feasible. It is clear that ultimately one aims at rejecting<br />

as many background events as possible while retaining <strong>the</strong> majority <strong>of</strong> <strong>the</strong> signal events.<br />

However <strong>the</strong> optimal balancing point between <strong>the</strong> loss <strong>of</strong> efficiency <strong>and</strong> <strong>the</strong> increased purity<br />

<strong>of</strong> <strong>the</strong> signal sample after <strong>the</strong> selection process is <strong>of</strong>ten dependent on <strong>the</strong> specific details<br />

<strong>of</strong> <strong>the</strong> measurement that one ultimately aims to perform. In <strong>the</strong> case <strong>of</strong> a cross section<br />

measurement <strong>the</strong> optimal selection process can be argued, from simple error considerations,<br />

to be <strong>the</strong> one that maximizes <strong>the</strong> ratio S/ √ S + B, where S <strong>and</strong> B are respectively <strong>the</strong><br />

number <strong>of</strong> signal <strong>and</strong> background events. More difficult is <strong>the</strong> case <strong>of</strong> an invariant mass or<br />

asymmetry measurement in which <strong>the</strong> uncertainty will depend not only on <strong>the</strong> number <strong>of</strong><br />

background <strong>and</strong> signal events left in <strong>the</strong> final sample, but also on <strong>the</strong>ir particular distributions<br />

<strong>and</strong> <strong>the</strong> fitting procedures. Since <strong>the</strong> presented analysis has a very wide scope, as it ultimately<br />

aims to perform four separate measurements, it is difficult to define a single optimized event<br />

selection process. The difficulty is fur<strong>the</strong>r amplified by <strong>the</strong> cross correlation <strong>of</strong> <strong>the</strong> variables.<br />

The alternative <strong>of</strong> defining separate event selections for each process is instead not feasible<br />

in a preliminary study.<br />

The approach <strong>of</strong> <strong>the</strong> author has <strong>the</strong>refore been to attempt to define a set <strong>of</strong> common<br />

selection criteria that generically define <strong>the</strong> e + e − → t¯t → b¯bq¯qq¯q process <strong>and</strong> aim to reject<br />

most <strong>of</strong> <strong>the</strong> non-top SM background, leptonic top decays <strong>and</strong> poorly reconstructed hadronic<br />

top decays. The algorithm has been separated into three stages: lepton event selection,<br />

event selection using kinematic <strong>and</strong> topological variables <strong>and</strong> particle ID event selection.<br />

Initially <strong>the</strong> algorithm looks at isolated leptons, defined as a jet containing only one<br />

reconstructed particle which is ei<strong>the</strong>r an electron or a muon, <strong>and</strong> rejects all events where<br />

one or more isolated leptons have been found. No such jet should in fact be present in<br />

a e + e − → t¯t → b¯bq¯qq¯q event, if <strong>the</strong> jets have been well reconstructed. The all inclusive<br />

St<strong>and</strong>ard Model background instead has an abundance <strong>of</strong> events where an isolated lepton is


6.1 Preliminary Event Selection 104<br />

expected. This process eliminates ∼ 75× 10 6 background events, or 0.6% <strong>of</strong> all background,<br />

without <strong>the</strong> loss <strong>of</strong> a single signal event.<br />

Subsequently a set <strong>of</strong> kinematic <strong>and</strong> topological discriminating variables has been identified.<br />

These are <strong>the</strong> total energy <strong>of</strong> <strong>the</strong> event, <strong>the</strong> jet finder y 56 parameter (<strong>the</strong> y cut separation<br />

between <strong>the</strong> 5 <strong>and</strong> 6 jet hypo<strong>the</strong>sis, as explained in chapter 3), <strong>the</strong> number <strong>of</strong> particles in<br />

<strong>the</strong> event <strong>and</strong> <strong>the</strong> number <strong>of</strong> tracks in <strong>the</strong> event. The number <strong>of</strong> particles in <strong>the</strong> event is<br />

defined as <strong>the</strong> number <strong>of</strong> reconstructed particles that <strong>the</strong> Particle Flow Algorithm (PFA)<br />

identifies. In particular, <strong>the</strong> y 56 variable is a good indicator that an event is topologically<br />

consistent with a six jet hypo<strong>the</strong>sis. For example, if physically <strong>the</strong>re are only five jets in<br />

<strong>the</strong> event one jet <strong>of</strong> <strong>the</strong> event will have to be separated into two reconstructed jets, due to<br />

<strong>the</strong> six jet requirement. This will result in <strong>the</strong> substantially lower y 56 value <strong>and</strong> <strong>the</strong>refore<br />

<strong>the</strong> rejection <strong>of</strong> <strong>the</strong> event. It also ensures that <strong>the</strong> jets are well reconstructed <strong>and</strong> discards<br />

events where two jets from a six jets events are overlapping. Events with jets outside <strong>of</strong> <strong>the</strong><br />

acceptance will have a total energy that is substantially lower than 500 GeV, <strong>and</strong> will be<br />

rejected by <strong>the</strong> total energy requirement. Similar considerations apply to events in which<br />

neutrinos represent a substantial fraction <strong>of</strong> <strong>the</strong> total energy, such events will also be rejected<br />

by <strong>the</strong> same requirement. Fig. 6.1 illustrates <strong>the</strong> discriminatory power <strong>of</strong> each variable when<br />

<strong>the</strong>se are considered independently. Table 6.1 instead presents <strong>the</strong> full list <strong>of</strong> kinematic event<br />

selections.<br />

Kinematic <strong>and</strong> <strong>Top</strong>ological Event Selection<br />

Variable Barrel Value<br />

E tot > 400 GeV<br />

log(y 56 ) > -8.5<br />

number <strong>of</strong> particles in event > 80<br />

number <strong>of</strong> tracks in event > 30<br />

Table 6.1: List <strong>of</strong> <strong>the</strong> kinematic <strong>and</strong> topological event selection procedures.<br />

After this process has been performed all but 492 × 10 3 background events have been<br />

rejected. This compares to <strong>the</strong> initial number <strong>of</strong> 12.5 × 10 9 events <strong>and</strong> corresponds to a<br />

99.996% rejection factor. The selection process comes to <strong>the</strong> expense <strong>of</strong> an efficiency loss <strong>of</strong><br />

9.7% <strong>of</strong> <strong>the</strong> initial 143 × 10 3 signal events. Although <strong>the</strong> performance <strong>of</strong> <strong>the</strong>se preliminary


NumberOFparticlesnocutsTOPHARD<br />

Entries<br />

132410<br />

Mean 119.1<br />

RMS 18.66<br />

YCutMinnocutsTOPHARD<br />

Entries<br />

132410<br />

Mean −6.007<br />

RMS 0.9707<br />

TotalEnergynocutsWRONG<br />

Entries 7020944<br />

Mean 429.1<br />

RMS 64.75<br />

NumberOFtracksnocutsWRONG<br />

Entries 7020944<br />

Mean 2.601<br />

RMS 0.976<br />

6.2 Kinematic Fitter <strong>and</strong> W Identification 105<br />

log(y_56) − Hadronic ttbar<br />

Total Energy − Hadronic ttbar<br />

Events<br />

7<br />

10<br />

Events<br />

6<br />

10<br />

6<br />

10<br />

log(y_56) − Hadronic ttbar<br />

5<br />

10<br />

5<br />

10<br />

log(y_56) − SM Background<br />

4<br />

10<br />

4<br />

10<br />

3<br />

10<br />

3<br />

10<br />

2<br />

10<br />

2<br />

10<br />

Total Energy − SM Background<br />

10<br />

10<br />

Total Energy − Hadronic ttbar<br />

Events<br />

8<br />

10<br />

7<br />

10<br />

6<br />

10<br />

5<br />

10<br />

4<br />

10<br />

3<br />

10<br />

2<br />

10<br />

1<br />

−11 −10 −9 −8 −7 −6 −5 −4 −3<br />

log(y_56)<br />

10<br />

(a)<br />

Number <strong>of</strong> Particles − Hadronic ttbar<br />

Number <strong>of</strong> Particles − Hadronic ttbar<br />

Number <strong>of</strong> Particles − SM Background<br />

1<br />

40 60 80 100 120 140 160 180 200 220<br />

# <strong>of</strong> Particles<br />

(c)<br />

Events<br />

10<br />

10<br />

9<br />

10<br />

8<br />

10<br />

7<br />

10<br />

6<br />

10<br />

5<br />

10<br />

4<br />

10<br />

3<br />

10<br />

2<br />

10<br />

1<br />

300 320 340 360 380 400 420 440 460 480 500<br />

Total Energy (GeV)<br />

10<br />

(b)<br />

Number <strong>of</strong> Tracks − Hadronic ttbar<br />

Number <strong>of</strong> Tracks − Hadronic ttbar<br />

Number <strong>of</strong> Tracks − SM Background<br />

1<br />

0 20 40 60 80 100 120<br />

# <strong>of</strong> Tracks<br />

(d)<br />

Figure 6.1: Kinematic <strong>and</strong> <strong>Top</strong>ological Event Selections a) y 56 , b) total energy, c) number<br />

<strong>of</strong> particles in <strong>the</strong> event, d) number <strong>of</strong> tracks in <strong>the</strong> event.<br />

cuts is already significant it is clear that one should be able to improve it by means <strong>of</strong> particle<br />

identification processes. One would <strong>the</strong>refore aim at identifying <strong>the</strong> b quarks by using <strong>the</strong><br />

described LCFI Vertex algorithm <strong>and</strong> at identifying <strong>the</strong> W bosons by its significant invariant<br />

mass.<br />

6.2 Kinematic Fitter <strong>and</strong> W Identification<br />

In order to identify <strong>the</strong> invariant mass <strong>of</strong> <strong>the</strong> reconstructed c<strong>and</strong>idate W bosons <strong>the</strong> KinFit<br />

[61, 62] kinematic fitting algorithm has been used. This section <strong>the</strong>refore briefly describes <strong>the</strong><br />

algorithm used for identifying <strong>the</strong> W bosons <strong>and</strong>, subsequently, for calculating <strong>the</strong> top quark<br />

mass. The idea behind kinematic fitting is to use kinematic constraints <strong>of</strong> a given physical<br />

process to improve <strong>the</strong> precision <strong>of</strong> <strong>the</strong> parameters <strong>of</strong> interest. The desired parameters are


6.2 Kinematic Fitter <strong>and</strong> W Identification 106<br />

fitted using a least squares technique <strong>and</strong> <strong>the</strong> physical constraints are incorporated into <strong>the</strong><br />

fit using Lagrange multipliers [61, 62], forcing <strong>the</strong> fit to fulfil <strong>the</strong>se constraints. In more<br />

ma<strong>the</strong>matical terms this can be expressed as <strong>the</strong> following χ 2 minimization:<br />

χ 2 Tot(⃗η,⃗ε, ⃗ λ) = (⃗y − ⃗η) T · V −1 · (⃗y − ⃗η) + 2 ⃗ λ T · ⃗f(⃗η,⃗ε) (6.1)<br />

where ⃗y are <strong>the</strong> measured quantities, ⃗η are <strong>the</strong> measured parameters to be optimized, ⃗ε are<br />

<strong>the</strong> unmeasured parameters to be optimized, V −1 is <strong>the</strong> inverse <strong>of</strong> <strong>the</strong> uncertainty matrix<br />

on <strong>the</strong> measurements, here assumed to be a diagonal matrix, ⃗ f are <strong>the</strong> constraint function<br />

<strong>and</strong> ⃗ λ are <strong>the</strong> Lagrange multipliers. ⃗ε,⃗η, ⃗ λ are <strong>the</strong> unknown values. The χ 2 Tot<br />

is <strong>the</strong>refore<br />

composed <strong>of</strong> a conventional χ 2 term <strong>and</strong> <strong>of</strong> a constraint term. It can now be shown that <strong>the</strong><br />

following equations must be solved:<br />

⃗0 =<br />

⃗0 =<br />

V −1 · (⃗y − ⃗η) + F T η · ⃗λ<br />

F T ε · ⃗λ<br />

⃗0 = ⃗ f(⃗η,⃗ε) (6.2)<br />

where <strong>the</strong> matrices F T η <strong>and</strong> F T ε are defined as:<br />

F T η = (F T η ) kn = ∂f k<br />

∂η n<br />

F T ε = (F T ε ) kj = ∂f k<br />

∂ε j<br />

(6.3)<br />

<strong>and</strong> k, n <strong>and</strong> j are respectively <strong>the</strong> number <strong>of</strong> constraints, <strong>the</strong> number <strong>of</strong> measured <strong>and</strong><br />

unmeasured parameters. The third equation <strong>of</strong> 6.2 ensures that <strong>the</strong> hard constraints are met.<br />

Additionally it is clear from equation 6.1 that in order to perform <strong>the</strong> fit <strong>the</strong> uncertainties<br />

on <strong>the</strong> measured quantities must be determined. More specifically in <strong>the</strong> presented analysis<br />

<strong>the</strong> jet energy <strong>and</strong> angle uncertainties need to be known. The parameters used have been<br />

50%/ √ E for <strong>the</strong> jet energy expressed in GeV <strong>and</strong> 30 mrad for <strong>the</strong> angle. Such values have<br />

been found to model well <strong>the</strong> performance <strong>of</strong> <strong>the</strong> SiD detector. [63]


Entries 128574<br />

Mean 77.37<br />

RMS 16.62<br />

FourOnlyWDiff<strong>Mass</strong>FitnocutsTOPHARD<br />

Entries 129080<br />

Mean 76.46<br />

RMS 14.67<br />

Entries 121414<br />

Mean 72.96<br />

RMS 21.24<br />

Entries 129080<br />

Mean 77.29<br />

RMS 15.12<br />

6.2 Kinematic Fitter <strong>and</strong> W Identification 107<br />

In <strong>the</strong> case <strong>of</strong> W boson identification <strong>the</strong> only set constraint is that <strong>the</strong> masses <strong>of</strong> <strong>the</strong> two<br />

c<strong>and</strong>idate W bosons present in <strong>the</strong> event should be equal. This is <strong>of</strong> course an approximation<br />

since <strong>the</strong> W bosons are not physically forced to decay on <strong>the</strong> mass shell <strong>and</strong> <strong>the</strong>refore <strong>the</strong>y<br />

are not forced to have <strong>the</strong> same mass. However this method has been shown to produce<br />

slightly better results when compared to <strong>the</strong> possibility <strong>of</strong> not applying any constraints.<br />

Ano<strong>the</strong>r problem presented while trying to identify <strong>the</strong> W bosons is <strong>the</strong> choice <strong>of</strong> <strong>the</strong> right<br />

combination <strong>of</strong> jets. One can ei<strong>the</strong>r consider all six jets present in <strong>the</strong> event or only <strong>the</strong><br />

four least b-like jets. As it is expected, given <strong>the</strong> good performance <strong>of</strong> <strong>the</strong> LCFI Vertex b<br />

tagging <strong>and</strong> <strong>the</strong> resulting reduced number <strong>of</strong> combinatorial possibilities, <strong>the</strong> second method<br />

is substantially superior (shown in fig. 6.2). Such method has <strong>the</strong>refore been used.<br />

W bosons <strong>Mass</strong> − Hadronic ttbar<br />

Events<br />

4000<br />

3500<br />

3000<br />

Four Jet <strong>Mass</strong><br />

Four <strong>of</strong> Six Jet <strong>Mass</strong><br />

Events<br />

W bosons <strong>Mass</strong> − Hadronic ttbar<br />

4500<br />

4000<br />

3500<br />

Kinematic Fit<br />

No Kinematic Fit<br />

2500<br />

2000<br />

1500<br />

1000<br />

500<br />

3000<br />

2500<br />

2000<br />

1500<br />

1000<br />

500<br />

40 50 60 70 80 90 100 110 120<br />

<strong>Mass</strong> (GeV)<br />

(a)<br />

40 50 60 70 80 90 100 110 120<br />

<strong>Mass</strong> (GeV)<br />

(b)<br />

Events<br />

5000<br />

W bosons <strong>Mass</strong> − Hadronic ttbar<br />

Known W <strong>Mass</strong><br />

Events<br />

W bosons <strong>Mass</strong>− SM Background<br />

6000<br />

4000<br />

Unknown W <strong>Mass</strong><br />

5000<br />

3000<br />

4000<br />

2000<br />

1000<br />

3000<br />

2000<br />

1000<br />

W mass known with fit<br />

W mass known no fit<br />

W mass unknown<br />

40 50 60 70 80 90 100 110 120<br />

<strong>Mass</strong> (GeV)<br />

(c)<br />

0<br />

40 50 60 70 80 90 100 110 120<br />

<strong>Mass</strong> (GeV)<br />

(d)<br />

Figure 6.2: Different methods <strong>of</strong> reconstructing <strong>the</strong> W bosons: a) <strong>the</strong> methods using only<br />

<strong>the</strong> four least b-like jets in <strong>the</strong> kinematic fit (black line) is compared to <strong>the</strong> method where all<br />

jets are used (light line), b) <strong>the</strong> results are shown before (black line) <strong>and</strong> after <strong>the</strong> kinematic<br />

fit (light line), c) <strong>the</strong> methods where <strong>the</strong> mass <strong>of</strong> <strong>the</strong> W boson is known (black line) is<br />

compared to <strong>the</strong> method where <strong>the</strong> mass is unknown (light line), in d) <strong>the</strong> effect <strong>of</strong> <strong>the</strong><br />

various procedures on <strong>the</strong> background sample is displayed. All variables are plotted after<br />

<strong>the</strong> kinematic event selection.


6.3 b Tagging Event Selection 108<br />

Even after considering only four jets three possible jet combinations are still left. A<br />

way <strong>of</strong> determining <strong>the</strong> best combination must <strong>the</strong>refore be defined. Two possibilities have<br />

been considered: <strong>the</strong> chosen combination is ei<strong>the</strong>r <strong>the</strong> one that best fulfils <strong>the</strong> equal mass<br />

constraint or <strong>the</strong> one that minimizes <strong>the</strong> quantity ∆M = √ (M w1 − M w ) 2 + (M w2 − M w ) 2 ,<br />

where M w1 <strong>and</strong> M w2 are <strong>the</strong> masses <strong>of</strong> <strong>the</strong> W boson c<strong>and</strong>idates before kinematic fitting; M w<br />

is <strong>the</strong> world average for <strong>the</strong> W boson mass.<br />

The second method has been shown to perform better <strong>and</strong> has hence been utilized for <strong>the</strong><br />

event selection. As it can be noticed (fig. 6.2) <strong>the</strong> introduction <strong>of</strong> a constrain improves only<br />

minimally <strong>the</strong> result, but it substantially simplifies <strong>the</strong> logic behind <strong>the</strong> process since both<br />

bosons are forced to have <strong>the</strong> same mass. A fur<strong>the</strong>r event selection was <strong>the</strong>refore performed<br />

on <strong>the</strong> calculated mass after <strong>the</strong> kinematic fitting process. More specifically <strong>the</strong> events that<br />

have calculated masses for <strong>the</strong> W boson c<strong>and</strong>idates <strong>of</strong> less than 50GeV or more than 110GeV<br />

are rejected as background events.<br />

6.3 b Tagging Event Selection<br />

The final step <strong>of</strong> <strong>the</strong> event selection process is <strong>the</strong> rejection <strong>of</strong> events in which two b quarks<br />

have not been clearly identified, done by means <strong>of</strong> <strong>the</strong> already presented LCFI algorithm.<br />

However <strong>the</strong> method has been originally tested only on a dijet sample. It has been <strong>the</strong>refore<br />

deemed necessary to check <strong>the</strong> performance in <strong>the</strong> case <strong>of</strong> more complex topologies, as <strong>the</strong><br />

one presented by <strong>the</strong> hadronic top. The satisfactory results <strong>of</strong> such test can be seen in fig.<br />

6.3. In numerical terms, if a b quark tagging efficiency <strong>of</strong> 45.0% is required, <strong>the</strong>n one will<br />

incur a 2.6% c quark mistagging efficiency <strong>and</strong> 0.8% uds quark mistagging efficiency. As we<br />

can infer from fig. 5.13(b), comparable mistagging efficiencies for <strong>the</strong> same tagging efficiency<br />

in <strong>the</strong> LDC dijet study at 91.2 GeV are 0.8% <strong>and</strong> 0.07%. Only part <strong>of</strong> this degradation can<br />

be however attributed to <strong>the</strong> increased complexity <strong>of</strong> <strong>the</strong> event. The unoptimized SiD neural<br />

networks, as seen in fig. 5.15 have in fact been used in this study. The optimized ones were<br />

not available in time to be included into this analysis.


neural net <strong>of</strong> uds<br />

Entries<br />

1.956656e+07<br />

Mean 0.04469<br />

RMS 0.1023<br />

6.3 b Tagging Event Selection 109<br />

NN output for b,c,uds jets at MC level<br />

Jets<br />

7<br />

10<br />

NN output for ’MC−uds jets<br />

NN output for ’MC−c jets’<br />

6<br />

10<br />

NN output for ’MC−b jets’<br />

5<br />

10<br />

4<br />

10<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

b−tag NN output<br />

Figure 6.3: Performance <strong>of</strong> <strong>the</strong> b tagging algorithm when used on a b¯bq¯qq¯q sample.<br />

In a first instance <strong>the</strong> variable b − tag sum , <strong>the</strong> sum <strong>of</strong> <strong>the</strong> b-tag neural net outputs for<br />

all six jets is calculated (as seen in fig. 6.4(a)) <strong>and</strong> used for <strong>the</strong> selection procedure. Given<br />

that this is at <strong>the</strong> same time <strong>the</strong> most complex <strong>and</strong> <strong>the</strong> most powerful <strong>of</strong> <strong>the</strong> used variables<br />

an optimization has been attempted. Fig. 6.4(b) shows <strong>the</strong> S/ √ S + B quantity assuming<br />

<strong>the</strong> selection is performed at different values <strong>of</strong> <strong>the</strong> summed neural net output. The value <strong>of</strong><br />

S/ √ S + B that fig. 6.4(b) suggests to use for <strong>the</strong> event selection is one. S/ √ S + B however<br />

decreases only minimally if a slightly higher requirement is imposed. The importance that<br />

<strong>the</strong> b tagging has in <strong>the</strong> kinematic fitting <strong>of</strong> <strong>the</strong> W boson, as already seen, <strong>and</strong> <strong>of</strong> <strong>the</strong> top<br />

quark is however not expressed in <strong>the</strong> simplicity <strong>of</strong> this graph. It has <strong>the</strong>refore been decided<br />

that a slightly higher requirement would be beneficial to <strong>the</strong> remainder <strong>of</strong> <strong>the</strong> analysis;<br />

particularly because <strong>the</strong> kinematic fitting methods described require <strong>the</strong> presence <strong>of</strong> not<br />

one, but two well identified b quarks. All events below <strong>the</strong> b − tag sum value <strong>of</strong> 1.5 have<br />

<strong>the</strong>refore been rejected as background while all events with values above 1.5 are considered<br />

signal.<br />

As mentioned one would also like to ensure that at least two recognizable b jets are<br />

present in <strong>the</strong> event. Events whose most b-like jet has a neural net b-tag output <strong>of</strong> less than<br />

0.9 (fig. 6.5(a)) or whose second most b-like jet has a neural net b-tag output <strong>of</strong> less than<br />

0.4 are also rejected (fig. 6.5(b)).


BsumTOPHARDEYPTcut<br />

Entries<br />

129080<br />

Mean 1.787<br />

RMS 0.6173<br />

HighestBtagTOPHARDEYPTcut<br />

Entries<br />

129080<br />

Mean 0.9229<br />

RMS 0.1601<br />

SecondBtagWRONGEYPTcut<br />

Entries<br />

121414<br />

Mean 0.1871<br />

RMS 0.2788<br />

6.3 b Tagging Event Selection 110<br />

Events<br />

5<br />

10<br />

4<br />

10<br />

3<br />

10<br />

2<br />

10<br />

10<br />

Sum <strong>of</strong> b−tag NN output<br />

Hadronic ttbar<br />

SM Background<br />

1<br />

0 1 2 3 4 5 6<br />

Sum b−tag NN output<br />

(a)<br />

Signal/sqrt(Signal + Background)<br />

b−tag Event Selection Optimization<br />

240<br />

220<br />

200<br />

180<br />

160<br />

140<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

0<br />

0 0.5 1 1.5 2 2.5 3 3.5 4<br />

Sum b−tag NN output<br />

(b)<br />

Figure 6.4: The variable b − tag sum is plotted for <strong>the</strong> signal <strong>and</strong> background events in a). In<br />

b) one can see <strong>the</strong> resulting S/ √ S + B quantity used for <strong>the</strong> event selection optimization.<br />

All variables are plotted after <strong>the</strong> kinematic event selection.<br />

Jets<br />

3<br />

×10<br />

120<br />

100<br />

Most b−quark like jet in <strong>the</strong> event<br />

Hadronic ttbar<br />

Jets<br />

250<br />

3<br />

×10<br />

Second most b−quark like jet in <strong>the</strong> event<br />

Hadronic ttbar<br />

80<br />

SM Background<br />

200<br />

SM Background<br />

60<br />

150<br />

40<br />

100<br />

20<br />

50<br />

0<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

NN output b−tag<br />

(a)<br />

0<br />

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

NN output b−tag<br />

(b)<br />

Figure 6.5: The neural net output for <strong>the</strong> most b-like jet , in a), <strong>and</strong> second most b-like jet,<br />

in b), are displayed for both <strong>the</strong> signal <strong>and</strong> <strong>the</strong> background sample. All variables are plotted<br />

after <strong>the</strong> kinematic event selection.<br />

After both particle identification procedures have been applied <strong>the</strong> signal selection efficiency<br />

is 51.5%, this implies that approximately 74 thous<strong>and</strong> b¯bq¯qq¯q events have passed all<br />

event selection procedures. Of <strong>the</strong> background samples 33.5 thous<strong>and</strong> events have passed<br />

<strong>the</strong> selection. The purity <strong>of</strong> <strong>the</strong> resulting sample is <strong>the</strong>refore 68.8%. Of <strong>the</strong> remaining background<br />

events 8.5 thous<strong>and</strong>s, or 25%, are b¯bqlνq¯q events. It is also expected that a significant<br />

proportion <strong>of</strong> <strong>the</strong> remaining background derives from <strong>the</strong> e + e − → W + W − → q¯qq¯q process,<br />

with a smaller contribution from e + e − → ZZ → q¯qq¯q.


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 111<br />

6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong><br />

The next step <strong>of</strong> <strong>the</strong> analysis is <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> top mass. Once again a kinematic<br />

fitter was employed to reduce <strong>the</strong> combinatorial background <strong>of</strong> <strong>the</strong> six-jet environment <strong>and</strong><br />

to take advantage <strong>of</strong> <strong>the</strong> existing kinematic constraints <strong>of</strong> top decays. Table 6.2 illustrates<br />

<strong>the</strong> full list <strong>of</strong> constraints used for this analysis. Similarly to what was previously described<br />

<strong>the</strong> two most b-like jets were considered to be originating from b-quarks <strong>and</strong> <strong>the</strong>refore were<br />

not used in <strong>the</strong> reconstruction <strong>of</strong> <strong>the</strong> two W bosons. Additionally <strong>the</strong> masses <strong>of</strong> such b-jets<br />

were forced to be 5.8 GeV, which is roughly <strong>the</strong> mass <strong>of</strong> B mesons.<br />

<strong>Top</strong> Kinematic Fitting Constraints<br />

<strong>Mass</strong>(top1) = <strong>Mass</strong>(top2)<br />

<strong>Mass</strong>(W1) = 80.4 GeV<br />

<strong>Mass</strong>(W2) = 80.4 GeV<br />

<strong>Mass</strong>(b Jet1 ) = 5.8 GeV<br />

<strong>Mass</strong>(b Jet2 ) = 5.8 GeV<br />

E Total = 500 GeV<br />

p x ;p y ;p z = 0<br />

Table 6.2: List <strong>of</strong> kinematic fitting constraints used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> top mass.<br />

It is clear that, in setting <strong>the</strong> constraints <strong>of</strong> table 6.2, a whole set <strong>of</strong> approximations is<br />

made. Similarly to <strong>the</strong> already discussed W boson example <strong>the</strong> masses <strong>of</strong> <strong>the</strong> two top quarks<br />

are set to be equal; <strong>the</strong> W boson mass is now constrained even fur<strong>the</strong>r to its on-shell values.<br />

The beam bremsstrahlung process is also ignored <strong>and</strong> <strong>the</strong> ideal momentum <strong>and</strong> energy values<br />

<strong>of</strong> <strong>the</strong> beam are <strong>the</strong>refore used as constraints.<br />

The s<strong>of</strong>tware calculates <strong>the</strong> best kinematic configuration for each <strong>of</strong> <strong>the</strong> six possible<br />

combinations. Of <strong>the</strong>se six configurations <strong>the</strong> one with <strong>the</strong> lowest χ 2 , as presented in equation<br />

6.1, is chosen. The impact <strong>of</strong> this procedure on <strong>the</strong> reconstructed top mass distribution can<br />

be seen in 6.6(a). In order to interpret <strong>the</strong> result it must be noted that while before <strong>the</strong><br />

kinematic fitting <strong>the</strong> top <strong>and</strong> <strong>the</strong> antitop quarks produced in each event can be used as<br />

two statistically independent measures <strong>of</strong> <strong>the</strong> top mass, after <strong>the</strong> fitting <strong>the</strong> mass <strong>of</strong> <strong>the</strong><br />

top <strong>and</strong> antitop quarks are equal <strong>and</strong> <strong>the</strong>refore represent only one statistically independent


Six2bNoFit<strong>Mass</strong>withbmassEYPTBcutTOPHARD<br />

Entries<br />

145058<br />

Mean 179.3<br />

RMS 33.61<br />

Six2bFit<strong>Mass</strong>withbmassEYPTBcutTOPHARD<br />

Entries<br />

72529<br />

Mean 189.8<br />

RMS 24.31<br />

Six2bNoFit<strong>Mass</strong>withbmassEYPTBcutTOPHARD<br />

Entries<br />

23690<br />

Mean 188.1<br />

RMS 43.26<br />

Six2bFit<strong>Mass</strong>withbmassEYPTBcutTOPHARD<br />

Entries 11845<br />

Mean 192.5<br />

RMS 33.28<br />

6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 112<br />

measurement. The kinematic fitting procedure can <strong>the</strong>refore be seen as an alternative, <strong>and</strong><br />

arguably better, use <strong>of</strong> <strong>the</strong> same information. In fact, as it can be seen in 6.6(a) it improves<br />

<strong>the</strong> resolution <strong>of</strong> <strong>the</strong> mass distribution <strong>and</strong> also betters <strong>the</strong> jet energy scale on a per event<br />

basis.<br />

<strong>Top</strong> quarks/Events<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> − Hadronic ttbar<br />

1400<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

Kinematic Fit<br />

No Kinematic Fit<br />

<strong>Top</strong> quarks/Events<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> − SM Background<br />

300<br />

Kinematic Fit<br />

250<br />

No Kinematic Fit<br />

200<br />

150<br />

100<br />

200<br />

0<br />

120 140 160 180 200 220 240 260 280 300<br />

<strong>Mass</strong> (GeV)<br />

(a)<br />

50<br />

0<br />

120 140 160 180 200 220 240 260 280 300<br />

<strong>Mass</strong> (GeV)<br />

(b)<br />

Figure 6.6: <strong>Top</strong> quark kinematic fit: <strong>the</strong> effect <strong>of</strong> <strong>the</strong> kinematic fit on <strong>the</strong> invariant mass<br />

distribution <strong>of</strong> <strong>the</strong> signal sample, in a), <strong>and</strong> on <strong>the</strong> background sample, in b), can be seen.<br />

It is also possible to convert <strong>the</strong> χ 2 <strong>of</strong> <strong>the</strong> fit into a probability <strong>and</strong> to select events whose<br />

probability <strong>of</strong> satisfying <strong>the</strong> required constraints is higher than a certain minimum. The<br />

effect <strong>of</strong> setting such event selection has been studied <strong>and</strong> can be seen in fig. 6.7.<br />

Events<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> − Hadronic ttbar<br />

1200<br />

1000<br />

800<br />

600<br />

400<br />

200<br />

0<br />

All Fitted Events<br />

Probability > 1%<br />

Probability > 30%<br />

140 160 180 200 220 240 260<br />

<strong>Mass</strong> (GeV)<br />

(a)<br />

Events<br />

240<br />

220<br />

200<br />

180<br />

160<br />

140<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

0<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> − SM Background<br />

All Fitted Events<br />

Probability > 1%<br />

Probability > 30%<br />

140 160 180 200 220 240 260<br />

<strong>Mass</strong> (GeV)<br />

(b)<br />

Figure 6.7: <strong>Top</strong> quark kinematic fit event selection; <strong>the</strong> results <strong>of</strong> performing an event<br />

selection on <strong>the</strong> basis <strong>of</strong> <strong>the</strong> probability that <strong>the</strong> event satisfies <strong>the</strong> kinematic fit constraints<br />

are displayed: in a) for <strong>the</strong> signal, in b) for <strong>the</strong> background events.<br />

It is clear that rejecting <strong>the</strong> events with probability lower than 1% has a significant<br />

influence on <strong>the</strong> number <strong>of</strong> signal <strong>and</strong> background events as well as on <strong>the</strong> shape <strong>of</strong> <strong>the</strong>


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 113<br />

mass distribution. Differently any fur<strong>the</strong>r rejection seems to have only a very small effect.<br />

This result suggests that <strong>the</strong> best mass resolution will be achieved with <strong>the</strong> 1% probability<br />

selection procedure. As any tighter cut performs similarly with respect to <strong>the</strong> mass resolution<br />

<strong>and</strong> background rejection, but has slightly lower signal efficiencies.<br />

Having reached <strong>the</strong> goal <strong>of</strong> plotting <strong>the</strong> top quark mass distribution a method needs to<br />

be implemented in order to estimate <strong>the</strong> top mass <strong>and</strong> its relative error. Two techniques<br />

were used for this: <strong>the</strong> curve fitting <strong>and</strong> <strong>the</strong> template fitting. Both techniques are described<br />

below.<br />

6.4.1 Curve Fitting<br />

The first method used in this analysis is a simple curve fitting technique. In this process <strong>the</strong><br />

calculated top quark distribution is modelled with a Breit-Wigner distribution, representing<br />

<strong>the</strong> natural width <strong>of</strong> <strong>the</strong> top quark convoluted with an ad-hoc determined resolution function:<br />

∫<br />

S(t) = N<br />

BW(x) × RES(t − x)dx (6.4)<br />

where N is a normalization factor <strong>and</strong> <strong>the</strong> Breit-Wigner is defined as:<br />

BW(x) =<br />

Γ<br />

2π[(x − M) 2 + Γ 2 /4]<br />

(6.5)<br />

<strong>the</strong> used resolution function is an asymmetric double gaussian:<br />

⎧<br />

⎪⎨ l × exp<br />

RES(x) =<br />

⎪⎩ l × exp<br />

(− (x−M)2<br />

2σ1<br />

(− 2 (x−M)2<br />

2σ 2 1<br />

)<br />

+ (1 − l) × exp<br />

)<br />

+ (1 − l) × exp<br />

(− (x−M)2<br />

2σ2<br />

(− 2 (x−M)2<br />

2σ 2 3<br />

)<br />

, x > M<br />

)<br />

, x < M<br />

(6.6)<br />

In equation 6.5 Γ is <strong>the</strong> Breit-Wigner width <strong>and</strong> M is <strong>the</strong> mass <strong>of</strong> <strong>the</strong> top quark. In equation<br />

6.6 l is a weight parameter <strong>and</strong> σ 1,2,3 are <strong>the</strong> st<strong>and</strong>ard deviation <strong>of</strong> <strong>the</strong> fitted Gaussians; as<br />

before M is <strong>the</strong> mass <strong>of</strong> <strong>the</strong> top quark. Additionally <strong>the</strong> SM background has been modelled<br />

with a second order polynomial.


BWconvGaussplustanh<br />

Entries 57396<br />

Mean 177.3<br />

RMS 7.92<br />

χ 2<br />

/ ndf<br />

157.8 / −10<br />

p1 174<br />

±<br />

0.1<br />

6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 114<br />

The first step <strong>of</strong> <strong>the</strong> process is to fit <strong>the</strong> statistically independent MC generated signal<br />

events with <strong>the</strong> function displayed in equation 6.4. Please note that <strong>the</strong>se are not <strong>the</strong> same<br />

events as <strong>the</strong> ones on which <strong>the</strong> final, mass determining, fit is performed. In this process<br />

<strong>the</strong> parameters Γ <strong>and</strong> M are fixed to <strong>the</strong> ones specified at <strong>the</strong> generator level. The aim <strong>of</strong><br />

<strong>the</strong> fit is <strong>the</strong>refore to extract <strong>the</strong> resolution function parameters: w <strong>and</strong> σ 1,2,3 . In a separate<br />

process <strong>the</strong> background events are fitted with <strong>the</strong> second order polynomial. Ideally one would<br />

fit a statistically independent background sample; but, given <strong>the</strong> lack <strong>of</strong> such sample, <strong>the</strong><br />

background data sample has been fit as an approximation. Given <strong>the</strong> relatively low statistics<br />

<strong>of</strong> <strong>the</strong> background with respect to <strong>the</strong> signal (


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 115<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong><br />

Event Selection Fit Range (GeV) χ 2 /NDF <strong>Mass</strong> (GeV) σ(GeV)<br />

Kinematic fit 165-200 308/68 173.937 0.054<br />

Kinematic fit 165-185 130/38 173.565 0.051<br />

Probability > 1% 165-200 858/68 173.902 0.056<br />

Probability > 1% 165-185 157/38 173.965 0.051<br />

Probability > 30% 165-200 945/68 174.153 0.045<br />

Probability > 30% 165-185 160/38 173.884 0.045<br />

Table 6.3: <strong>Top</strong> quark mass - curve fitting technique: <strong>the</strong> full list <strong>of</strong> results obtained from<br />

<strong>the</strong> curve fitting techniques using different event selection <strong>and</strong> fit ranges is displayed<br />

which explains <strong>the</strong> fat tails at high masses, not present before <strong>the</strong> kinematic fit (fig. 6.6(a)).<br />

This derives from <strong>the</strong> forcing <strong>of</strong> <strong>the</strong> events reconstructed with masses higher than 250GeV<br />

to a lower value, compatible with <strong>the</strong> set constraints. O<strong>the</strong>r major unmodelled sources <strong>of</strong><br />

discrepancy are <strong>the</strong> combinatorial background <strong>and</strong> <strong>the</strong> approximation <strong>of</strong> t¯t → b¯bq¯qq¯q as <strong>the</strong><br />

signal sample. It is clear that with a choice <strong>of</strong> different <strong>and</strong> substantially more complex<br />

functions all <strong>the</strong> above described events could be accounted for, this is however not <strong>the</strong> path<br />

chosen by this analysis. In fact a different <strong>and</strong> more reliable method for determining <strong>the</strong> top<br />

quark mass, a template fitting method, has subsequently been implemented <strong>and</strong> is described<br />

in <strong>the</strong> next section.<br />

Finally also <strong>the</strong> background events are weighted <strong>and</strong> this poses a choice on what shall<br />

be considered as <strong>the</strong> background uncertainty. For example, if <strong>the</strong> weights <strong>of</strong> events w i are<br />

larger than one, <strong>the</strong>n by considering <strong>the</strong> uncertainty <strong>of</strong> a bin as √ ∑i w2 i <strong>the</strong> modelled error<br />

is clearly overstated with respect to <strong>the</strong> physically expected one. The effect <strong>of</strong> considering<br />

<strong>the</strong> error as √ ∑i w i, as it has been done in this analysis, is instead more subtle. In this case<br />

<strong>the</strong> uncertainty seems to imply a much smoo<strong>the</strong>r distribution than <strong>the</strong> presented one, as <strong>the</strong><br />

error <strong>of</strong> each bin is statistically understated. A curve fitted on such events will <strong>the</strong>refore<br />

tend to result in a larger χ 2 /NDF value. This might also explain partially <strong>the</strong> seemingly<br />

large values obtained in <strong>the</strong> presented fits. Although this can be a major effect, it is present<br />

only in <strong>the</strong> background events <strong>and</strong> hence probably not very dramatic in this analysis. It<br />

has <strong>the</strong>refore been preferred to <strong>the</strong> possible alternative. Incidentally while dealing with<br />

a template sample, because one is not trying to model <strong>the</strong> actually collected data but to


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 116<br />

reproduce our <strong>the</strong>oretical model <strong>the</strong> uncertainty should always be calculated as √ ∑i w2 i , as<br />

it has also been.<br />

Because <strong>of</strong> <strong>the</strong> combination <strong>of</strong> all <strong>the</strong>se reasons <strong>the</strong> fit had to be restricted only to a<br />

narrow window encompassing <strong>the</strong> peak <strong>of</strong> <strong>the</strong> distribution. This is in fact <strong>the</strong> region where<br />

<strong>the</strong> distribution is most sensitive to <strong>the</strong> signal, least sensitive to <strong>the</strong> background <strong>and</strong> to<br />

most <strong>of</strong> <strong>the</strong> described limitations. It is clear from table 6.3 that widening <strong>the</strong> fit region<br />

fur<strong>the</strong>r decreases <strong>the</strong> reliability <strong>of</strong> <strong>the</strong> fit. Interestingly enough no major effect is seen by<br />

selecting events only above a certain kinematic fit probability; only a small improvement in<br />

<strong>the</strong> uncertainty can be seen.<br />

As a result <strong>of</strong> this procedure one can conclude that <strong>the</strong> obtainable statistical uncertainty<br />

on <strong>the</strong> top mass at <strong>the</strong> ILC will be around 50 MeV. Given <strong>the</strong> limitations <strong>of</strong> <strong>the</strong> method it<br />

seems sensible to quote this only as an approximate value. As already stated a method, which<br />

we consider more reliable, has also been implemented. The next section in fact describes <strong>the</strong><br />

template fitting method. The result <strong>of</strong> this section should <strong>the</strong>refore be considered mainly as<br />

a check <strong>of</strong> such method, ra<strong>the</strong>r than <strong>the</strong> final result <strong>of</strong> <strong>the</strong> top mass analysis.<br />

6.4.2 Template Fitting<br />

In <strong>the</strong> template method <strong>the</strong> data signal sample is compared to several MC signal templates<br />

which have been generated using different values <strong>of</strong> <strong>the</strong> top mass (173.5GeV, 174.0GeV <strong>and</strong><br />

174.5GeV). The χ 2 between <strong>the</strong> distributions can <strong>the</strong>n be calculated as:<br />

χ 2 =<br />

N∑<br />

bins<br />

i=1<br />

(t i − y i + δ i ) 2<br />

(σ 2 t,i + σ2 y,i + σ2 SM,i ) (6.7)<br />

where N bins are <strong>the</strong> number <strong>of</strong> bins in <strong>the</strong> top mass histogram considered in <strong>the</strong> calculation, t i ,<br />

y i are <strong>the</strong> number <strong>of</strong> events in <strong>the</strong> i th bin for <strong>the</strong> <strong>the</strong> template sample <strong>and</strong> <strong>the</strong> signal sample,<br />

σ t,i , σ y,i , σ SM,i are <strong>the</strong> uncertainties on <strong>the</strong> i th bin for <strong>the</strong> template sample, <strong>the</strong> signal sample<br />

<strong>and</strong> <strong>the</strong> SM background. δ i is instead a Gaussian smearing term. This term accounts for<br />

<strong>the</strong> SM model background in a statistically independent way. It is a value r<strong>and</strong>omly picked


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 117<br />

from a Gaussian distribution <strong>of</strong> mean zero <strong>and</strong> st<strong>and</strong>ard deviation equal to σ SM,i . In fig. 6.9<br />

one can see a graphical representation <strong>of</strong> <strong>the</strong> algorithm. Please note that <strong>the</strong> uncertainties<br />

presented in <strong>the</strong> plot are <strong>the</strong> sum under quadrature <strong>of</strong> <strong>the</strong> data sample uncertainties <strong>and</strong> <strong>the</strong><br />

template sample uncertainties. The plotted values <strong>and</strong> <strong>the</strong>ir uncertainties do not however<br />

include any SM background component.<br />

<strong>Top</strong> <strong>Mass</strong>: template − data<br />

Events template − Events data<br />

600<br />

400<br />

200<br />

0<br />

−200<br />

Template 173.5GeV<br />

Template 174.5GeV<br />

−400<br />

−600<br />

150 160 170 180 190 200 210<br />

<strong>Top</strong> <strong>Mass</strong> (GeV)<br />

Figure 6.9: Graphical representation <strong>of</strong> <strong>the</strong> template fitting technique. The curves show <strong>the</strong><br />

difference in <strong>the</strong> number <strong>of</strong> events in each bin between <strong>the</strong> template sample <strong>and</strong> <strong>the</strong> data<br />

sample<br />

Once <strong>the</strong> χ 2 <strong>of</strong> <strong>the</strong> data sample with all <strong>the</strong> three template samples is calculated a<br />

parabola is fit on <strong>the</strong> three mass points <strong>and</strong> <strong>the</strong> minimum <strong>of</strong> <strong>the</strong> parabola, χ 2 min, is determined.<br />

The mass value corresponding to χ 2 min represents <strong>the</strong> reconstructed mass <strong>of</strong> <strong>the</strong> top<br />

quark. The uncertainty <strong>of</strong> <strong>the</strong> data sample is <strong>the</strong>n defined as <strong>the</strong> mass point for which<br />

<strong>the</strong> following relationship is satisfied: χ 2 = χ 2 min + 1 [61]. Fig. 6.10 displays a graphical<br />

illustration <strong>of</strong> <strong>the</strong> above described χ 2 minimization <strong>and</strong> uncertainty estimation. It has to<br />

be noted that ideally more than three template samples should be used for <strong>the</strong> presented<br />

minimization. However <strong>the</strong> generation <strong>of</strong> millions <strong>of</strong> MC events is very CPU intensive. Given<br />

<strong>the</strong> small amount <strong>of</strong> available resources it has been decided to limit <strong>the</strong> number <strong>of</strong> template


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 118<br />

samples to <strong>the</strong> bare minimum <strong>of</strong> three.<br />

Figure 6.10: Graphical representation <strong>of</strong> <strong>the</strong> template fitting minimization. The red crosses<br />

represent <strong>the</strong>m measured χ 2 values (please note that in <strong>the</strong> case <strong>of</strong> this analysis only three<br />

such points are calculated).<br />

This method clearly does not suffer from most <strong>of</strong> <strong>the</strong> issues described in <strong>the</strong> previous<br />

section. In fact <strong>the</strong> problems <strong>of</strong> ’kinematic reflection’, combinatorial background <strong>and</strong> <strong>of</strong> <strong>the</strong><br />

t¯t = b¯bq¯qq¯q approximation are automatically taken care <strong>of</strong>. The precise shape <strong>of</strong> <strong>the</strong> curve<br />

does not in fact need to be known, but only <strong>the</strong> difference between <strong>the</strong> two curves; as both<br />

curves incorporate <strong>the</strong>se effects <strong>the</strong>y will not influence <strong>the</strong> final results <strong>of</strong> <strong>the</strong> algorithm.<br />

Additionally given that <strong>the</strong> algorithm works on a bin by bin basis <strong>the</strong> inconsistency derived<br />

from <strong>the</strong> weighting <strong>of</strong> events is substantially reduced. The only assumption that <strong>the</strong> method<br />

uses is that <strong>the</strong> χ 2 distribution can be approximated, near its minimum, as a parabolic<br />

function.<br />

Before analysing <strong>the</strong> results from <strong>the</strong> template fitting one must be aware that two different<br />

ways <strong>of</strong> renormalizing <strong>the</strong> template samples can be applied. One can renormalize <strong>the</strong> number<br />

<strong>of</strong> events to <strong>the</strong> observed number (<strong>the</strong> number <strong>of</strong> signal events present in <strong>the</strong> data sample) in<br />

which case only <strong>the</strong> shape <strong>of</strong> <strong>the</strong> distribution is used in determining <strong>the</strong> mass <strong>and</strong> its error.<br />

Alternatively one can consider also <strong>the</strong> fact that, at different top masses, <strong>the</strong> cross sections<br />

will be different <strong>and</strong> <strong>the</strong>refore use this information. Of course <strong>the</strong> later result will fur<strong>the</strong>r


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 119<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong><br />

Event Selection Fit Range (GeV) χ 2 min /NDF <strong>Mass</strong> (GeV) σ (GeV)<br />

No Kinematic fit 120-200 148/159 174.135 0.090<br />

No Kinematic fit 140-180 83/79 174.173 0.097<br />

Kinematic fit 150-200 94/99 174.033 0.053<br />

Kinematic fit 165-200 63/69 173.991 0.056<br />

Kinematic fit 165-185 42/39 173.990 0.058<br />

Probability > 1% 150-200 101/99 174.018 0.049<br />

Probability > 1% 165-200 61/69 174.013 0.049<br />

Probability > 1% 165-185 41/39 174.010 0.053<br />

Probability > 5% 150-200 97/99 174.024 0.050<br />

Probability > 5% 165-200 61/69 174.017 0.050<br />

Probability > 5% 165-185 38/39 174.17 0.053<br />

Probability > 10% 150-200 100/99 174.012 0.050<br />

Probability > 10% 165-200 68/69 174.012 0.051<br />

Probability > 10% 165-185 40/39 174.14 0.052<br />

Probability > 20% 150-200 91/99 174.013 0.049<br />

Probability > 20% 165-200 68/69 174.010 0.050<br />

Probability > 20% 165-185 39/39 174.022 0.052<br />

Probability > 30% 150-200 98/99 174.021 0.049<br />

Probability > 30% 165-200 68/69 174.020 0.050<br />

Probability > 30% 165-185 47/39 174.027 0.052<br />

Table 6.4: The template fitting results using <strong>the</strong> renormalized cross sections are displayed<br />

for different event selections <strong>and</strong> different fit ranges.<br />

depend on <strong>the</strong> accuracy <strong>of</strong> <strong>the</strong> cross section calculation.<br />

Table 6.4 shows <strong>the</strong> result obtained by using <strong>the</strong> first method. It can be immediately<br />

seen that <strong>the</strong> estimated statistical uncertainty is <strong>of</strong> <strong>the</strong> order <strong>of</strong> 50 MeV it can also be noted<br />

that while applying <strong>the</strong> kinematic fitting <strong>and</strong> <strong>the</strong> rejecting <strong>the</strong> events that have probabilities<br />

<strong>of</strong> matching <strong>the</strong> constraints smaller than 1% is beneficial to <strong>the</strong> final result, but any tighter<br />

selection does not improve <strong>the</strong> performance. Given fig. 6.7 this is an expected effect; also<br />

increasing <strong>the</strong> fit width beyond <strong>the</strong> 165 GeV-185 GeV region does not substantially improve<br />

<strong>the</strong> result, as is to be expected from fig. 6.9.<br />

Table 6.5 instead presents <strong>the</strong> kinematic fitting results when <strong>the</strong> cross sections from<br />

<strong>the</strong> MC generator are used. Note that in <strong>the</strong> presented calculation <strong>the</strong> MC cross sections<br />

are assumed to have no uncertainty. As more information is used for <strong>the</strong> template fit a<br />

slightly better mass resolution is expected. In fact this can be clearly seen <strong>and</strong> <strong>the</strong> statistical


6.4 <strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong> 120<br />

<strong>Top</strong> <strong>Quark</strong> <strong>Mass</strong><br />

Event Selection Fit Range (GeV) χ 2 min /NDF <strong>Mass</strong> (GeV) σ (GeV)<br />

No Kinematic fit 120-200 151/159 174.198 0.062<br />

No Kinematic fit 140-180 83/79 174.192 0.079<br />

Kinematic fit 150-200 94/99 174.033 0.049<br />

Kinematic fit 165-200 66/69 174.035 0.050<br />

Kinematic fit 165-185 42/39 174.010 0.056<br />

Probability > 1% 150-200 101/99 174.046 0.046<br />

Probability > 1% 165-200 62/69 174.048 0.045<br />

Probability > 1% 165-185 41/39 174.025 0.051<br />

Probability > 5% 150-200 96/99 174.049 0.047<br />

Probability > 5% 165-200 62/69 174.049 0.047<br />

Probability > 5% 165-185 38/39 174.030 0.051<br />

Probability > 10% 150-200 99/99 174.033 0.047<br />

Probability > 10% 165-200 68/69 174.039 0.047<br />

Probability > 10% 165-185 41/39 174.23 0.051<br />

Probability > 20% 150-200 89/99 174.013 0.047<br />

Probability > 20% 165-200 63/69 174.031 0.050<br />

Probability > 20% 165-185 39/39 174.031 0.051<br />

Probability > 30% 150-200 97/99 174.035 0.047<br />

Probability > 30% 165-200 73/69 174.040 0.047<br />

Probability > 30% 165-185 46/39 174.035 0.051<br />

Table 6.5: The template fitting results using <strong>the</strong> MC cross sections are displayed for different<br />

event selections <strong>and</strong> different fit ranges.<br />

resolution achievable is now lower at 45 MeV.<br />

It is now important to determine <strong>the</strong> stability <strong>and</strong> reliability <strong>of</strong> <strong>the</strong> template fitting<br />

method. A few simple tests can be devised. Theoretically <strong>the</strong> value <strong>of</strong> χ 2 min/NDF should<br />

be ≈ 1 in all presented cases. Looking at table 6.4 <strong>and</strong> table 6.5 it is clear that this is<br />

<strong>the</strong> case. Additionally, if <strong>the</strong> measurements are statistically independent, <strong>the</strong>y should form<br />

a Gaussian distribution around <strong>the</strong> central value. Given that <strong>the</strong> samples used are always<br />

<strong>the</strong> same <strong>the</strong> measurements are clearly not independent, never <strong>the</strong> less <strong>the</strong> fact that in<br />

most <strong>of</strong> <strong>the</strong> cases <strong>the</strong> measurements fall inside one σ is reassuring. The accuracy <strong>of</strong> <strong>the</strong><br />

measurement also performs as expected when <strong>the</strong> fit ranges or event selections are varied.<br />

Finally <strong>the</strong> measurement should also not be dependent on <strong>the</strong> way it is displayed, for example<br />

<strong>the</strong> variation <strong>of</strong> bin sizes should not change <strong>the</strong> final result. This has been also tested <strong>and</strong><br />

indeed <strong>the</strong> results have been found stable. In order to fur<strong>the</strong>r test <strong>the</strong> stability <strong>and</strong> reliability<br />

<strong>of</strong> <strong>the</strong> used template fitting method a toy MC study has also been performed. In this study


6.5 <strong>Cross</strong> <strong>Section</strong> <strong>of</strong> <strong>the</strong> b¯bq¯qq¯q Process 121<br />

<strong>the</strong> template method has been compared to <strong>the</strong> curve fitting method for a series <strong>of</strong> known<br />

distributions. The results between <strong>the</strong> two methods have been found to be always consistent.<br />

A final word has to be said on <strong>the</strong> interpretation <strong>of</strong> <strong>the</strong> template fitting result. By<br />

considering equation 6.7 one can immediately see that <strong>the</strong> result is dependent on <strong>the</strong> error<br />

<strong>of</strong> <strong>the</strong> template. Given that <strong>the</strong> template sample is four times larger than <strong>the</strong> signal data<br />

sample <strong>the</strong> relative uncertainty is: σ t,i ≈ 1/2σ y,i . Similarly since <strong>the</strong> purity <strong>of</strong> <strong>the</strong> data<br />

sample is 75%-80% also σ SM,i ≈ 1/2σ y,i . It is <strong>the</strong>n possible to make a back <strong>of</strong> <strong>the</strong> envelope<br />

estimation <strong>of</strong> <strong>the</strong> uncertainty achievable with this method in <strong>the</strong> limit where σ t,i


6.6 Summary 122<br />

b¯bq¯qq¯q <strong>Cross</strong> <strong>Section</strong><br />

Event Selection Efficiency Purity <strong>Cross</strong> <strong>Section</strong> (fb) σ (fb) σ (%)<br />

No kinematic fit 51.5% 68.7% 287.4 1.3 0.45%<br />

Kinematic fit 48.6% 74.8% 287.2 1.3 0.45%<br />

Probability > 1% 35.4% 83.6% 287.2 1.7 0.59%<br />

Probability > 5% 31.9% 85.0% 288.5 1.5 0.52%<br />

Probability > 10% 30.0% 85.8% 287.3 1.6 0.56%<br />

Probability > 20% 27.3% 86.8% 287.5 1.6 0.56%<br />

Probability > 30% 25.3% 87.4% 287.2 1.6 0.55%<br />

Table 6.6: The b¯bq¯qq¯q cross section as calculated using different event selections. The event<br />

selection is based on <strong>the</strong> probability that <strong>the</strong> events match <strong>the</strong> kinematic fitting constraints.<br />

deteriorate if any fur<strong>the</strong>r event selection is applied due to <strong>the</strong> reduced statistics. One can<br />

now compare <strong>the</strong> obtained result with <strong>the</strong> actual cross section at MC level (285.8fb) <strong>and</strong><br />

notice <strong>the</strong>ir statistical agreement.<br />

It is important to remember that this is not actually <strong>the</strong> cross section <strong>of</strong> <strong>the</strong> e + e − →<br />

t¯t → b¯bq¯qq¯q process, but <strong>of</strong> <strong>the</strong> b¯bq¯qq¯q final state. However <strong>the</strong> absolute uncertainty <strong>of</strong> this<br />

measurement is equivalent in <strong>the</strong> approximation where <strong>the</strong>re is no error on <strong>the</strong> efficiency<br />

calculation <strong>and</strong> <strong>the</strong> efficiencies remain equal. This can be seen by noting that no term would<br />

change in <strong>the</strong> <strong>the</strong> uncertainty, expressed as: √ N ALL /(ε ∫ Ldt). Incidentally if <strong>the</strong> efficiency<br />

increases, which is what one would expect, given that some event selections are targeted to<br />

<strong>the</strong> reconstruction <strong>of</strong> <strong>the</strong> top quark <strong>and</strong> W boson masses, <strong>the</strong> absolute error will decrease as<br />

a result.<br />

6.6 Summary<br />

This chapter showed that <strong>the</strong> precision achievable when measuring <strong>the</strong> top mass in <strong>the</strong><br />

hadronic t¯t is between 45 MeV <strong>and</strong> 49 MeV, depending on <strong>the</strong> used assumptions. It also<br />

established that <strong>the</strong> cross section <strong>of</strong> <strong>the</strong> b¯bq¯qq¯q process can be measured with a precision <strong>of</strong><br />

1.3 fb. All <strong>the</strong> precisions are quoted without systematic uncertainties.<br />

The chapter also described <strong>the</strong> event selection process used in <strong>the</strong> hadronic t¯t study.<br />

The importance <strong>of</strong> <strong>the</strong> b tagging <strong>and</strong> kinematic fitting algorithms for <strong>the</strong> event selection <strong>and</strong>


6.6 Summary 123<br />

reconstruction has been outlined. As this is a feasibility study, not all <strong>the</strong> used analysis tools<br />

were yet optimized. It is <strong>the</strong>refore foreseen that <strong>the</strong> achievable resolutions will improve.


Chapter 7<br />

<strong>Quark</strong> Charge <strong>and</strong> Forward Backward<br />

Asymmetries<br />

From <strong>the</strong> list <strong>of</strong> observables that have been described in <strong>the</strong> introductory <strong>the</strong>ory chapter only<br />

<strong>the</strong> forward backward asymmetries <strong>of</strong> <strong>the</strong> top quark <strong>and</strong> <strong>of</strong> its decay product, <strong>the</strong> bottom<br />

quark, still need to be addressed. As has been already described in <strong>the</strong> relevant section <strong>the</strong>se<br />

will provide an insight into <strong>the</strong> top Electro-Weak couplings. More specifically <strong>the</strong> top quark<br />

forward backward asymmetry tests <strong>the</strong> Z tt coupling, while <strong>the</strong> bottom quark observable<br />

measures <strong>the</strong> W tb coupling.<br />

Performing <strong>the</strong> desired measurements in a leptonic or semi-leptonic t¯t channel should be<br />

a relatively easy task. The charges <strong>of</strong> <strong>the</strong> leptons (µ <strong>and</strong> e) are in fact well known as <strong>the</strong>y are<br />

ra<strong>the</strong>r simple to reconstruct. From <strong>the</strong>se it is <strong>the</strong>n possible to unambiguously reconstruct<br />

<strong>the</strong> charges <strong>of</strong> <strong>the</strong> top quark from which <strong>the</strong> lepton originated <strong>and</strong> <strong>of</strong> <strong>the</strong> matching bottom<br />

quark. Differently in <strong>the</strong> hadronic b¯bq¯qq¯q, which has <strong>the</strong> largest statistics, no such solution is<br />

possible <strong>and</strong> a method must be devised in order to calculate <strong>the</strong> charge <strong>of</strong> <strong>the</strong> bottom quark<br />

by using <strong>the</strong> jet originating from it. As <strong>the</strong> charge <strong>of</strong> <strong>the</strong> b quark is -1/3 <strong>and</strong> <strong>the</strong> charge <strong>of</strong><br />

<strong>the</strong> ¯b quark is +1/3 this is equivalent to distinguishing <strong>the</strong> b quark from <strong>the</strong> ¯b quark. Several<br />

techniques used to reconstruct <strong>the</strong> quark charge are described below.<br />

124


7.1 Vertex Charge 125<br />

7.1 Vertex Charge<br />

The first method that has been developed is <strong>the</strong> vertex charge. This method has been<br />

implemented by <strong>the</strong> author into <strong>the</strong> LCFI Vertex s<strong>of</strong>tware <strong>and</strong> as such it aims not only to<br />

reconstruct <strong>the</strong> charge <strong>of</strong> <strong>the</strong> bottom quarks but also <strong>of</strong> <strong>the</strong> charm quarks. Slightly different<br />

assumptions are made for different flavours <strong>and</strong> it is <strong>the</strong>refore important to first determine<br />

<strong>the</strong> quark flavour. To calculate <strong>the</strong> vertex charge <strong>of</strong> a bottom jet all tracks that are thought<br />

to belong to <strong>the</strong> decay chain are identified <strong>and</strong> <strong>the</strong> vertex charge is given by <strong>the</strong> sign <strong>of</strong> <strong>the</strong><br />

sum <strong>of</strong> <strong>the</strong>ir charges. The rare cases in which <strong>the</strong> vertex charge has an absolute value above<br />

2, or below -2, indicating significant mistakes in <strong>the</strong> assignment <strong>of</strong> tracks to <strong>the</strong> decay chain,<br />

are discarded in <strong>the</strong> determination <strong>of</strong> <strong>the</strong> quark charge sign. To assign tracks to <strong>the</strong> b quark<br />

decay chain <strong>the</strong> same approach is used as <strong>the</strong> one described in <strong>the</strong> flavour tagging chapter<br />

<strong>of</strong> this <strong>the</strong>sis for <strong>the</strong> momentum corrected vertex mass. The parameters used for a charm<br />

jet can instead be found in Appendix B.<br />

Fig. 7.1 demonstrates <strong>the</strong> ability to distinguish <strong>the</strong> parton charge <strong>of</strong> heavy flavour jets.<br />

As this is part <strong>of</strong> <strong>the</strong> LCFI vertex s<strong>of</strong>tware it is presented for <strong>the</strong> LDC detector using <strong>the</strong> same<br />

91.2 GeV sample as <strong>the</strong> one described in Chapter 5. No information about <strong>the</strong> parton charge<br />

is obtained for <strong>the</strong> case <strong>of</strong> a reconstructed vertex charge <strong>of</strong> zero, <strong>the</strong> dominant contribution<br />

to which is from decays <strong>of</strong> neutral B-mesons with vertices with even number <strong>of</strong> tracks. Good<br />

charge separation can instead be achieved by selecting jets with non-zero reconstructed<br />

vertex charge, mostly coming from charged B-mesons. Table 7.1 quantifies <strong>the</strong> performance<br />

<strong>of</strong> parton charge identification with st<strong>and</strong>ard LCFI Vertex parameters. The ratio <strong>of</strong> correct<br />

over wrong charge identification is better for c jets than for b jets, <strong>and</strong> <strong>the</strong> reconstruction<br />

quality is found to degrade slightly with increasing collision energy, as expected.<br />

This method however suffers from a series <strong>of</strong> inefficiencies: only a fixed point on <strong>the</strong><br />

purity efficiency trade-<strong>of</strong>f is available to <strong>the</strong> final user, <strong>the</strong> method is not applicable in case<br />

no vertex has been found in <strong>the</strong> jet <strong>and</strong> it is not designed to distinguish between b <strong>and</strong> ¯b<br />

quarks in <strong>the</strong> case <strong>of</strong> neutral mesons. Incidentally b (¯b) quarks fragment into neutral mesons<br />

in more than 50% <strong>of</strong> <strong>the</strong> events. While performing this final analysis it has been decided


htemp<br />

Entries 1660<br />

Mean 0.3177<br />

RMS 0.957<br />

htemp<br />

Entries 791<br />

Mean 0.2946<br />

RMS 0.7427<br />

htemp<br />

Entries 733<br />

Mean 0.7913<br />

RMS 0.8185<br />

htemp<br />

Entries 733<br />

Mean 0.7913<br />

RMS 0.8185<br />

htemp<br />

Entries 733<br />

Mean 0.7913<br />

RMS 0.8185<br />

htemp<br />

Entries 440<br />

Mean 0.5455<br />

RMS 0.7964<br />

htemp<br />

Entries 733<br />

Mean 0.7913<br />

RMS 0.8185<br />

htemp<br />

Entries 733<br />

Mean 0.7913<br />

RMS 0.8185<br />

7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge 126<br />

Vertex Charge b <strong>Quark</strong><br />

Vertex Charge b <strong>Quark</strong> − Charged Hadrons Only<br />

Jets<br />

700<br />

600<br />

Q_MC>0<br />

Jets<br />

400<br />

350<br />

Q_MC>0<br />

500<br />

Q_MC


VertexCONTChargeBquark<br />

Entries<br />

371207<br />

Mean 0.1121<br />

RMS 0.2511<br />

7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge 127<br />

Vertex Charge<br />

MC Type<br />

Reconstructed Vertex Charge<br />

Correct Ambiguous Wrong No Vertex/Tracks<br />

91.2 GeV b jets 32.8% 33.1% 12.2% 21.9%<br />

c jets 14.6% 27.4% 3.2% 54.8%<br />

500 GeV b jets 27.0% 23.9% 13.5% 35.6%<br />

c jets 13.0% 24.5% 4.2% 58.4%<br />

Table 7.1: Performance <strong>of</strong> parton charge identification using LCFI Vertex default settings.<br />

Any non-zero vertex charge measurement with <strong>the</strong> same (opposite) sign as <strong>the</strong> parton charge<br />

is labelled correct (wrong). Jets with a reconstructed vertex charge <strong>of</strong> zero are called ambiguous.<br />

Jets where no vertices are present, or where no tracks pass <strong>the</strong> strict track quality<br />

criteria for <strong>the</strong> vertex charge measurement (applicable only for c quark jets), are listed in<br />

<strong>the</strong> rightmost column.<br />

where Q T is <strong>the</strong> charge <strong>of</strong> <strong>the</strong> track, p T is <strong>the</strong> momentum <strong>of</strong> <strong>the</strong> track <strong>and</strong> k is a user defined<br />

parameter; as expected <strong>the</strong> sums are performed only on <strong>the</strong> tracks associated with <strong>the</strong> vertex.<br />

The performance <strong>of</strong> such method for discriminating <strong>the</strong> parton charge can be seen in fig.<br />

7.2(a). Differently from <strong>the</strong> previous section <strong>the</strong> evens used in <strong>the</strong> plot are from <strong>the</strong> 174.0<br />

GeV b¯bq¯qq¯q template sample. Only b quark jets at MC level with a neural net b tag higher<br />

than 0.4 are being included.<br />

Events<br />

8000<br />

7000<br />

6000<br />

5000<br />

Momentum Weighted Vertex Charge<br />

bbar quark<br />

b quark<br />

purity<br />

Momentum Weighted Vertex Charge − Performance<br />

0.85<br />

0.8<br />

0.75<br />

4000<br />

0.7<br />

3000<br />

2000<br />

1000<br />

0.65<br />

0.6<br />

k = 0.3<br />

k = 0.1<br />

k = 1.0<br />

0<br />

−1 −0.5 0 0.5 1<br />

Momentum Weighted Vertex Charge<br />

0.55<br />

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9<br />

efficiency<br />

(a)<br />

(b)<br />

Figure 7.2: Momentum weighted vertex charge a) distributions for b <strong>and</strong> ¯b quark jets, k<br />

parameter used = 0.3 b) purity vs efficiency curves for different weighting parameters k.<br />

Analysis performed on a b¯bq¯qq¯q sample with 500 GeV centre <strong>of</strong> mass energy.<br />

Fig 7.2(b) instead shows <strong>the</strong> purity/efficiency performance for different values <strong>of</strong> <strong>the</strong><br />

k parameter. As a comparison it can be noted that <strong>the</strong> st<strong>and</strong>ard vertex charge, applied<br />

to <strong>the</strong> same sample, obtains a purity <strong>of</strong> 76.6% with an efficiency <strong>of</strong> 44.5%. For <strong>the</strong> same


VertexCONTChargeBquark<br />

Entries<br />

371207<br />

Mean 0.06994<br />

RMS 0.1782<br />

7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge 128<br />

efficiency value <strong>the</strong> results with <strong>the</strong> new method are 76.8% if <strong>the</strong> k parameter is set to 0.3<br />

<strong>and</strong> 77.2% if <strong>the</strong> k parameter is set to 0.1. The k value used in <strong>the</strong> rest <strong>of</strong> <strong>the</strong> analysis is<br />

0.3. In fact, although <strong>the</strong> algorithm with this setting performs marginally worse than with<br />

a k value <strong>of</strong> 0.1 or 0.05 at high efficiencies, it performs substantially better when a high<br />

purity is required. The uncertainty on <strong>the</strong> presented purity measurements, calculated with<br />

<strong>the</strong> formula σ p = √ p(1 − p)/n, is 0.1%, where p is <strong>the</strong> purity.<br />

However all <strong>the</strong> jets without a secondary vertex have been completely excluded from<br />

this calculation. These represent 3.2% <strong>of</strong> <strong>the</strong> total number <strong>of</strong> jets originating from a b<br />

quark, assuming <strong>the</strong> event selection <strong>of</strong> a neural net b-tag higher than 0.4. Since ideally<br />

one would like to be able to determine also <strong>the</strong> charge <strong>of</strong> <strong>the</strong>se jets an additional method,<br />

<strong>the</strong> momentum weighted jet charge [64], has been implemented. As <strong>the</strong> name suggest <strong>the</strong><br />

algorithm is identical to <strong>the</strong> one already described in equation 7.1. The track selection<br />

process is however slightly different; <strong>the</strong> sum is now performed over all <strong>the</strong> tracks present in<br />

<strong>the</strong> jet ra<strong>the</strong>r than in <strong>the</strong> vertex.<br />

Events<br />

12000<br />

10000<br />

Momentum Weighted Jet Charge<br />

bbar quark<br />

b quark<br />

purity<br />

Momentum Weighted Jet Charge − Performance<br />

0.85<br />

0.8<br />

8000<br />

0.75<br />

6000<br />

0.7<br />

4000<br />

2000<br />

0.65<br />

0.6<br />

k = 0.3<br />

k = 0.1<br />

k = 1.0<br />

0<br />

−1 −0.5 0 0.5 1<br />

Momentum Weighted Jet Charge<br />

0.55<br />

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9<br />

efficiency<br />

(a)<br />

(b)<br />

Figure 7.3: Momentum weighted jet charge a) distributions for b quark <strong>and</strong> ¯b quark jets,<br />

k parameter used = 0.3 b) purity vs efficiency curves for different weighting parameters k.<br />

Analysis performed on a b¯bq¯qq¯q sample with 500 GeV centre <strong>of</strong> mass.<br />

The performance <strong>of</strong> <strong>the</strong> algorithm can be seen in fig. 7.3(a). Fig. 7.3(b) instead shows<br />

<strong>the</strong> dependence <strong>of</strong> <strong>the</strong> performance on <strong>the</strong> value used for <strong>the</strong> k parameter. Also in this<br />

case <strong>the</strong> value <strong>of</strong> 0.3 has been chosen as optimal. At first inspection <strong>the</strong> performance <strong>of</strong> <strong>the</strong><br />

two algorithms appears very similar; <strong>the</strong> vertex charge performing just slightly better. It is


7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge 129<br />

<strong>the</strong>refore worth analysing whe<strong>the</strong>r <strong>the</strong> two methods are conveying <strong>the</strong> same information, in<br />

which case using just one algorithm could suffice, or whe<strong>the</strong>r by combining <strong>the</strong> two algorithms<br />

one can extract additional information <strong>and</strong> hence improve <strong>the</strong> final result.<br />

purity<br />

1<br />

0.95<br />

0.9<br />

0.85<br />

Momentum Weighted Vertex Charge<br />

purity<br />

1<br />

0.95<br />

0.9<br />

0.85<br />

Momentum Weighted Jet Charge<br />

b/bbar<br />

B−/B+<br />

B0bar/B0<br />

0.8<br />

0.8<br />

0.75<br />

0.75<br />

0.7<br />

0.65<br />

0.6<br />

b/bbar<br />

B−/B+<br />

B0bar/B0<br />

0.7<br />

0.65<br />

0.6<br />

0.55<br />

0.55<br />

0.5<br />

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

efficiency<br />

0.5<br />

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1<br />

efficiency<br />

(a)<br />

(b)<br />

Figure 7.4: Performance <strong>of</strong> a) momentum weighted vertex charge <strong>and</strong> b) momentum weighted<br />

jet charge in distinguishing B + from B − <strong>and</strong> ¯B 0 from B 0 .<br />

The performance <strong>of</strong> <strong>the</strong> algorithms has <strong>the</strong>refore been tested under <strong>the</strong> presence <strong>of</strong> only<br />

charged mesons B + <strong>and</strong> B − <strong>and</strong> only neutral mesons B 0 <strong>and</strong> ¯B 0 . Fig. 7.4(a) show that<br />

<strong>the</strong> momentum weighted vertex charge is able to distinguish very well between B + <strong>and</strong> B − ,<br />

while having almost no discriminatory power when it comes to B 0 <strong>and</strong> ¯B 0 . Differently <strong>the</strong><br />

performance <strong>of</strong> <strong>the</strong> momentum weighted jet charge, fig. 7.4(b), is more similar between <strong>the</strong><br />

two cases <strong>and</strong> <strong>the</strong> algorithm can separate reasonably well also B 0 <strong>and</strong> ¯B 0 . It also should<br />

be noted that neutral mesons (both B 0 <strong>and</strong> B S ) can oscillate. While in this process <strong>the</strong><br />

charge <strong>of</strong> <strong>the</strong> meson does not change, <strong>the</strong> charge <strong>of</strong> <strong>the</strong> b quark does. This introduces a<br />

fur<strong>the</strong>r smearing to <strong>the</strong> distributions. In this analysis, while <strong>the</strong> B 0 / ¯B 0 (<strong>and</strong> <strong>the</strong> B S / ¯ B S )<br />

oscillations have been included at MC level <strong>the</strong>y have not been studied in detail. Indeed<br />

<strong>the</strong> oscillation <strong>of</strong> <strong>the</strong> neutral mesons is expected to have a ra<strong>the</strong>r small effect on <strong>the</strong> B 0<br />

mesons, which have a frequency <strong>of</strong> oscillation <strong>of</strong> ≈ 0.5 × 10 12 s − 1, which implies a period<br />

<strong>of</strong> oscillation larger than <strong>the</strong>ir mean lifetime (≈ 1.5 × 10 −12 s). Hence a small effect is<br />

expected. Differently in <strong>the</strong> case <strong>of</strong> B S mesons <strong>the</strong> effect should be large as <strong>the</strong> frequency (≈<br />

18 × 10 12 s − 1) implies a period <strong>of</strong> oscillation that is much smaller than <strong>the</strong>ir mean lifetime<br />

(≈ 1.5 × 10 −12 s). Hence <strong>the</strong> smearing in <strong>the</strong> case <strong>of</strong> B s mesons is much more significant


7.2 Momentum Weighted Vertex <strong>and</strong> Jet Charge 130<br />

<strong>and</strong> <strong>the</strong> techniques presented for parton charge discrimination are hence less effective. For<br />

<strong>the</strong> purpose <strong>of</strong> evaluating <strong>the</strong> performance <strong>of</strong> <strong>the</strong> algorithms <strong>the</strong> original parton produced<br />

at <strong>the</strong> Feynman diagram level has been used. Once again it must be remembered that all<br />

<strong>the</strong> results regarding <strong>the</strong> momentum weighted vertex charge do not include jets where no<br />

secondary vertex has been found ( about 3.2% <strong>of</strong> <strong>the</strong> b jets, after <strong>the</strong> event selection described<br />

in <strong>the</strong> previous chapter has been performed).<br />

The difference between <strong>the</strong> two algorithms derives from <strong>the</strong> different physical principle<br />

that <strong>the</strong>y are based on. The jet charge algorithm is based on exploiting <strong>the</strong> fact that <strong>the</strong><br />

most energetic hadrons have a higher probability <strong>of</strong> containing <strong>the</strong> charge <strong>of</strong> <strong>the</strong> quark that<br />

initiated <strong>the</strong> jet. In order to explain <strong>the</strong> reasoning one needs to start by making <strong>the</strong> fair<br />

assumption that <strong>the</strong> original quark is also <strong>the</strong> one with <strong>the</strong> <strong>the</strong> highest energy in <strong>the</strong> jet.<br />

In order for <strong>the</strong> original quark to undergo hadronization a new quark pair must <strong>the</strong>n be<br />

produced. The hadronization process <strong>of</strong> <strong>the</strong> original quark absorbs only one <strong>of</strong> <strong>the</strong> two<br />

quarks. If after <strong>the</strong> fragmentation <strong>the</strong> resulting meson is charged <strong>the</strong> reasoning becomes<br />

trivial, given that all <strong>the</strong> o<strong>the</strong>r quarks have a lower momentum. If instead <strong>the</strong> resulting<br />

meson is neutral, <strong>the</strong>n <strong>the</strong> second most energetic meson has to be considered. Because <strong>of</strong><br />

momentum conservation considerations this meson most probably contains <strong>the</strong> second quark<br />

produced in <strong>the</strong> mentioned pair production. Its charge must <strong>the</strong>n be <strong>the</strong> same as <strong>the</strong> charge<br />

<strong>of</strong> <strong>the</strong> original quark. The charge <strong>of</strong> <strong>the</strong> second meson can hence be ei<strong>the</strong>r zero or <strong>of</strong> <strong>the</strong> same<br />

absolute value as <strong>the</strong> original quark. If <strong>the</strong> charge is zero <strong>the</strong> same reasoning can be repeated<br />

for <strong>the</strong> next most energetic meson. A much more detailed explanation <strong>of</strong> this principle can<br />

be found in [65].<br />

The principle behind <strong>the</strong> vertex charged algorithms is instead based on precisely determining<br />

all <strong>the</strong> tracks that derive from <strong>the</strong> vertex. In this case <strong>the</strong> aim is to directly determine<br />

<strong>the</strong> charge <strong>of</strong> <strong>the</strong> meson. The momentum weighting conveys some information on <strong>the</strong> reliability<br />

<strong>of</strong> <strong>the</strong> track. Additionally, because <strong>of</strong> <strong>the</strong> same reasons as <strong>the</strong> one discussed in <strong>the</strong><br />

case <strong>of</strong> <strong>the</strong> jet charge, it also contains some indication on <strong>the</strong> probability that <strong>the</strong> track<br />

derives from <strong>the</strong> original B meson. Given <strong>the</strong> ra<strong>the</strong>r poor performance <strong>of</strong> <strong>the</strong> vertex charge


RatioVertexCONTChargeBquark<br />

Entries 100<br />

Mean 0.6233<br />

RMS 0.4541<br />

R Distribution − Momentum Weighted Jet Charge<br />

Entries 100<br />

Mean 0.7089<br />

RMS 0.391<br />

7.3 Combined Charge 131<br />

in distinguishing between B 0 <strong>and</strong> ¯B 0 such information is only minimal.<br />

7.3 Combined Charge<br />

As <strong>the</strong> two different methods contain ra<strong>the</strong>r different information it has been decided to<br />

combine <strong>the</strong>m into a single discriminatory variable. Ideally one would like to include also<br />

o<strong>the</strong>r discriminatory parameters (examples are <strong>the</strong> lepton charge <strong>and</strong> <strong>the</strong> dipole charge<br />

algorithms), however, do to a lack <strong>of</strong> manpower, <strong>the</strong>se have not yet been implemented.<br />

The method chosen for <strong>the</strong> combination <strong>of</strong> <strong>the</strong> parameters is <strong>the</strong> one presented in [66].<br />

If fi b (x i ) is <strong>the</strong> probability density functions for <strong>the</strong> b quark for variable x i <strong>and</strong> f¯b<br />

i (x i ) is <strong>the</strong><br />

equivalent distribution for <strong>the</strong> ¯b quark <strong>the</strong>n for each discriminating variable x i a parameter<br />

r i is defined:<br />

r i = f¯b i (x i )<br />

f b i (x i)<br />

(7.2)<br />

which effectively is <strong>the</strong> ratio <strong>of</strong> <strong>the</strong> probability distributions; <strong>the</strong> index i denotes <strong>the</strong> discriminating<br />

variable. While ideally one would perform such calculation in functional form in<br />

this analysis this has been approximated to <strong>the</strong> ratio <strong>of</strong> entries in each bin <strong>of</strong> <strong>the</strong> histograms<br />

presented in fig. 7.2(a) <strong>and</strong> fig. 7.3(a).<br />

R Distribution − Momentum Weighted Vertex Charge<br />

R Distribution − Momentum Weighted Jet Charge<br />

r<br />

2<br />

10<br />

r<br />

2<br />

10<br />

10<br />

10<br />

1<br />

1<br />

−1<br />

10<br />

−1<br />

10<br />

−2<br />

10<br />

−1 −0.5 0 0.5 1<br />

Momentum Weighted Vertex Charge<br />

−2<br />

10<br />

−1 −0.5 0 0.5 1<br />

Momentum Weighted Jet Charge<br />

(a)<br />

(b)<br />

Figure 7.5: f¯b<br />

i (x i )/f b i (x i ) for a) momentum weighted vertex charge <strong>and</strong> b) momentum<br />

weighted jet charge.<br />

Fig. 7.5(a) <strong>and</strong> fig. 7.5(b) present <strong>the</strong> r i values obtained for each bin <strong>of</strong> <strong>the</strong> momentum


DtotBquark<br />

Entries<br />

47966<br />

Mean 0.2709<br />

RMS 0.5316Combined<br />

7.3 Combined Charge 132<br />

weighted vertex charge <strong>and</strong> <strong>the</strong> momentum weighted jet charge. As this is a MC calculation<br />

it is, <strong>of</strong> course, performed on <strong>the</strong> 174.0 GeV template sample. For each data event a combined<br />

tagging variable can <strong>the</strong>n be defined:<br />

r = ∏ i<br />

r i (7.3)<br />

please note that <strong>the</strong> value <strong>of</strong> r i , previously calculated on <strong>the</strong> MC sample, is <strong>the</strong> one corresponding<br />

to <strong>the</strong> value <strong>of</strong> x i calculated in <strong>the</strong> data event. Clearly this method, as most<br />

multi-variable combination methods, works best in <strong>the</strong> case that <strong>the</strong> discriminating variables<br />

are uncorrelated.<br />

Given <strong>the</strong> definition <strong>of</strong> r, if r < 1 <strong>the</strong>n <strong>the</strong> reconstructed jet is naturally more likely to be<br />

a b quark <strong>and</strong> if r > 1 <strong>the</strong> jet is more likely to originate from ¯b quark. The range <strong>of</strong> possible<br />

Events<br />

11000<br />

10000<br />

9000<br />

8000<br />

7000<br />

6000<br />

5000<br />

4000<br />

3000<br />

2000<br />

Charge<br />

b quark<br />

bbar quark<br />

−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1<br />

Combined Charge<br />

(a)<br />

purity<br />

1<br />

0.95<br />

0.9<br />

0.85<br />

0.8<br />

0.75<br />

0.7<br />

0.65<br />

0.6<br />

0.55<br />

0.5<br />

Combined Charge − Performance<br />

Combined Charge<br />

Momentum Weighted Jet Charge<br />

Momentum Weighted Vertex Charge<br />

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9<br />

efficiency<br />

(b)<br />

Figure 7.6: Combined charge a) distributions for b quark <strong>and</strong> ¯b quark jets b) purity vs<br />

efficiency curves for combined charge, momentum weighted vertex charge <strong>and</strong> momentum<br />

weighted jet charge. Please note that <strong>the</strong> calculation is performed after all <strong>the</strong> event selections,<br />

as explained in <strong>the</strong> previous chapter, have been performed.<br />

values for r is <strong>the</strong>refore between 0 <strong>and</strong> ∞. For <strong>the</strong> purpose <strong>of</strong> graphical representation <strong>and</strong><br />

convenience <strong>of</strong> <strong>the</strong> end user a variable whose limits are -1 <strong>and</strong> +1 has been devised. The<br />

combined charge variable is <strong>the</strong>refore defined as:<br />

C = 1 − r<br />

1 + r . (7.4)


7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry 133<br />

A jet with C > 0 is more likely to derive from a b quark <strong>and</strong> a jet with C < 0 is more likely<br />

to derive from a ¯b quark. Fig. 7.6(a) shows <strong>the</strong> result <strong>of</strong> <strong>the</strong> charge recombination technique<br />

when performed on <strong>the</strong> 174.0 GeV template sample after all <strong>the</strong> events selections described<br />

in <strong>the</strong> previous chapter have been performed. Fig. 7.6(b) instead shows <strong>the</strong> improved performance<br />

<strong>of</strong> <strong>the</strong> combined charge when compared with <strong>the</strong> independent momentum weighted<br />

vertex charge <strong>and</strong> momentum weighted jet charge.<br />

7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry<br />

The combined charge can now be applied to calculate <strong>the</strong> forward backward asymmetries<br />

<strong>of</strong> <strong>the</strong> b <strong>and</strong> t quarks. Before this task has been attempted some final event selections have<br />

been performed. Firstly all <strong>the</strong> events reconstructed with a top mass lower than 155 GeV or<br />

higher than 195 GeV have been rejected. The mass value used is <strong>the</strong> one after <strong>the</strong> kinematic<br />

fit, as described in <strong>the</strong> previous chapter dealing with <strong>the</strong> top mass measurement.<br />

It has also been decided to investigate <strong>the</strong> possibility <strong>of</strong> performing an event selection<br />

based on <strong>the</strong> reconstructed charge <strong>of</strong> <strong>the</strong> quarks. For this purpose one would like to use <strong>the</strong><br />

information derived from both jets. Assuming that <strong>the</strong> event is actually a b¯bq¯qq¯q, ra<strong>the</strong>r<br />

than an event from <strong>the</strong> SM background, <strong>and</strong> that <strong>the</strong> quark identification has been correctly<br />

performed, <strong>the</strong> charge calculations performed on <strong>the</strong> two jets are really two uncorrelated<br />

measurements <strong>of</strong> <strong>the</strong> same quantity. The two b jets must in fact have opposite absolute<br />

values for <strong>the</strong>ir charge. The combined charges <strong>of</strong> <strong>the</strong> two jets with <strong>the</strong> highest neural net b<br />

tags are <strong>the</strong>refore multiplied <strong>and</strong> used as an event selection parameter. Fig. 7.7(a) shows<br />

such distribution for <strong>the</strong> signal <strong>and</strong> <strong>the</strong> background events. The main aim <strong>of</strong> this process<br />

is not to reject <strong>the</strong> SM background, not plotted, but to reject <strong>the</strong> background deriving<br />

from <strong>the</strong> events in which <strong>the</strong> b quark has been mistagged or <strong>the</strong> charge <strong>of</strong> such quark<br />

has been misreconstructed. An event charge is labelled as misreconstructed only when <strong>the</strong><br />

reconstructed combined charge <strong>of</strong> <strong>the</strong> ¯b jet is higher than <strong>the</strong> combined charge <strong>of</strong> <strong>the</strong> b jet.<br />

Similarly to <strong>the</strong> case <strong>of</strong> <strong>the</strong> b tagging event selection, presented in <strong>the</strong> previous chapter,


ChargeMultiplyTOPHARDEYPTBcut<br />

Entries 150303<br />

Mean −0.06762<br />

RMS 0.3435<br />

7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry 134<br />

Events<br />

6000<br />

5000<br />

4000<br />

3000<br />

2000<br />

Combined charge jet1 * combined charge jet2<br />

All bbqqqq events<br />

Wrong charge + mistagged<br />

Mistagged<br />

Signal/sqrt(Signal + Background)<br />

140<br />

120<br />

100<br />

80<br />

60<br />

40<br />

Combined Charge Event Selection Optimization<br />

1000<br />

20<br />

0<br />

−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1<br />

Combined Charge b−jet1 * Combined Charge b−jet2<br />

(a)<br />

0<br />

−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1<br />

Combined Charge b−jet1 * Combined Charge b−jet2<br />

(b)<br />

Figure 7.7: Combined charge b jet-1 × combined charge b jet-2 a) distribution for reconstructed<br />

events, mistagged evens <strong>and</strong> events with misidentified charge b) event selection<br />

optimization using a S/ √ S + B maximization technique.<br />

also in this case an optimization has been attempted <strong>and</strong> <strong>the</strong> value <strong>of</strong> S/ √ S + B has been<br />

maximized. Fig. 7.7(b) shows <strong>the</strong> results <strong>of</strong> such optimization. Note that for <strong>the</strong> purpose<br />

<strong>of</strong> this optimization also <strong>the</strong> SM background has been included. Interestingly enough <strong>the</strong><br />

S/ √ S + B result suggests that all events should be included. Under <strong>the</strong>se conditions <strong>the</strong><br />

total efficiency <strong>of</strong> <strong>the</strong> analysis is 22.7%, while <strong>the</strong> purity is 58.1%. The impurities derive<br />

45.9% from <strong>the</strong> SM background, 45.0% from <strong>the</strong> charge misreconstruction <strong>and</strong> 9.1% from<br />

<strong>the</strong> quark misidentification.<br />

The calculation <strong>of</strong> <strong>the</strong> forward backward asymmetry as defined in equation 1.13 can<br />

now be performed. Fig. 7.8 shows <strong>the</strong> distribution with respect to cos(θ) <strong>of</strong> <strong>the</strong> signal<br />

<strong>and</strong> background events used in this calculation. For <strong>the</strong> purpose <strong>of</strong> this calculation, <strong>the</strong><br />

two jets with <strong>the</strong> highest neural net b tags have been used. Of <strong>the</strong>se <strong>the</strong> jet with a higher<br />

combined charge has been declared as originating from a b quark, while <strong>the</strong> o<strong>the</strong>r b jet has<br />

been declared as originating from a ¯b quark. The angle θ <strong>of</strong> <strong>the</strong> reconstructed b jet has been<br />

used as an approximation <strong>of</strong> <strong>the</strong> angle <strong>of</strong> <strong>the</strong> original b quark.<br />

Table. 7.2 instead shows <strong>the</strong> results <strong>of</strong> <strong>the</strong> A fb calculation, as defined in equation 1.13, for<br />

different event selections, to check <strong>the</strong> stability <strong>of</strong> <strong>the</strong> result. The number <strong>of</strong> correctly reconstructed<br />

b¯bq¯qq¯q events is calculated for <strong>the</strong> forward <strong>and</strong> backward hemispheres independently.<br />

For this purpose <strong>the</strong> SM background is subtracted from <strong>the</strong> total number <strong>of</strong> reconstructed


quarkscos<strong>the</strong>taALLEYPTBcut<br />

Entries<br />

29122<br />

Mean 0.1191<br />

RMS 0.6081<br />

7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry 135<br />

b quarks<br />

6000<br />

5000<br />

Events Used for b quark A_fb<br />

Events used for b quark A_fb<br />

Mistagging + SM Background<br />

4000<br />

SM Background<br />

3000<br />

2000<br />

1000<br />

0<br />

−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1<br />

cos(<strong>the</strong>ta)<br />

Figure 7.8: Events used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> b quark A fb . No event selection is performed<br />

using <strong>the</strong> combined charge. However, in order to qualify as a b ra<strong>the</strong>r than ¯b quark, <strong>the</strong><br />

combined charge <strong>of</strong> <strong>the</strong> jet must be higher than <strong>the</strong> one <strong>of</strong> <strong>the</strong> o<strong>the</strong>r b jet present in <strong>the</strong><br />

event. Notice that, as explained in <strong>the</strong> previous chapter, only two b jets are assumed to be<br />

present in each event. These are <strong>the</strong> jets with <strong>the</strong> highest neural net b tags.<br />

events. The number <strong>of</strong> events left is <strong>the</strong>n multiplied by <strong>the</strong> purity <strong>of</strong> <strong>the</strong> reconstruction,<br />

accounting for all <strong>the</strong> events where <strong>the</strong> charge has been misidentified or where <strong>the</strong> b jet has<br />

been mistagged. When calculating <strong>the</strong> purity, only <strong>the</strong> quark <strong>and</strong> charge misidentification<br />

are used as a background <strong>and</strong> <strong>the</strong> SM background is not included, since it is substracted separately.<br />

Note that separate purities have been calculated for <strong>the</strong> forward <strong>and</strong> <strong>the</strong> backward<br />

hemispheres. The equation applied to each hemisphere <strong>the</strong>refore is: N b = (N tot − N SM ) ∗ p,<br />

where N b is <strong>the</strong> number <strong>of</strong> b jets correctly identified <strong>and</strong> <strong>the</strong>refore used in <strong>the</strong> calculation,<br />

N tot is <strong>the</strong> total number <strong>of</strong> events reconstructed, N SM is <strong>the</strong> st<strong>and</strong>ard model background<br />

<strong>and</strong> p is <strong>the</strong> purity <strong>of</strong> <strong>the</strong> reconstruction.<br />

For each event selection <strong>the</strong> uncertainty has been calculated with three different assumptions.<br />

The lowest uncertainty, σ 1 , assumes that <strong>the</strong> efficiency <strong>of</strong> tagging <strong>and</strong> <strong>the</strong> st<strong>and</strong>ard<br />

model background have been perfectly simulated at MC level <strong>and</strong> <strong>the</strong>refore do not contribute<br />

to <strong>the</strong> achievable precision <strong>of</strong> <strong>the</strong> forward backward asymmetry. The only uncertainty term<br />

<strong>the</strong>refore is √ N tot,θ)90 ◦ where N tot,θ)90 ◦ is <strong>the</strong> number <strong>of</strong> events whose b quarks have<br />

been reconstructed in <strong>the</strong> forward (backward) region <strong>of</strong> <strong>the</strong> detector. Clearly this includes<br />

both signal <strong>and</strong> background events. For <strong>the</strong> second evaluation, σ 2 , <strong>the</strong> uncertainty derived


7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry 136<br />

from <strong>the</strong> purity <strong>of</strong> <strong>the</strong> tagging defined as σ p = √ p(1 − p)/n, where n is <strong>the</strong> total number<br />

<strong>of</strong> events passing <strong>the</strong> event selection process has been also considered. Finally <strong>the</strong> third<br />

evaluation, σ 3 , considers also an additional Poissonian uncertainty equal to <strong>the</strong> square root<br />

<strong>of</strong> <strong>the</strong> reconstructed St<strong>and</strong>ard Model background events. It has to be noted that, if <strong>the</strong><br />

uncertainty on <strong>the</strong> SM background is treated as a binomial error <strong>and</strong> <strong>the</strong>refore included into<br />

<strong>the</strong> purity calculation, <strong>the</strong> quoted results do not vary. In each <strong>of</strong> <strong>the</strong> three cases <strong>the</strong> uncertainty<br />

has been calculated separately for <strong>the</strong> forward <strong>and</strong> backward regions <strong>and</strong> subsequently<br />

<strong>the</strong> st<strong>and</strong>ard error propagation method has been used to evaluate <strong>the</strong> uncertainty on each<br />

asymmetry calculation. Possible systematic uncertainties have been neglected.<br />

b quark A fb<br />

Event Selection A fb σ 1 σ 2 σ 3<br />

Charge b jet-1 × Charge b jet-2 < 1.0 0.293 0.006 0.007 0.008<br />

Charge b jet-1 × Charge b jet-2 < 0.5 0.293 0.006 0.007 0.008<br />

Charge b jet-1 × Charge b jet-2 < 0.0 0.289 0.007 0.008 0.009<br />

Table 7.2: Reconstructed A fb for <strong>the</strong> b quark <strong>and</strong> <strong>the</strong> respective uncertainties. The different<br />

uncertainties (σ 1 ,σ 2 ,σ 3 ) have been calculated with different assumptions as explained in <strong>the</strong><br />

text <strong>of</strong> this section.<br />

From <strong>the</strong>se results (table 7.2) it is clear that an uncertainty <strong>of</strong> less than 0.01 is achievable<br />

when <strong>the</strong> b quark A fb is calculated. It is also clear that <strong>the</strong> devised method performs in a<br />

stable manner. Incidentally <strong>the</strong> calculated uncertainty on <strong>the</strong> A fb can be improved if <strong>the</strong><br />

results <strong>of</strong> <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> b quark are combined with <strong>the</strong> ones from <strong>the</strong> ¯b quark. The<br />

Lagrangian used in equation 1.11 is in fact CP conserving. However since on an event basis<br />

<strong>the</strong> directions <strong>of</strong> <strong>the</strong> b <strong>and</strong> ¯b are highly correlated because <strong>of</strong> <strong>the</strong> large boosts to <strong>the</strong> t <strong>and</strong> ¯t<br />

quarks <strong>the</strong> combination <strong>of</strong> <strong>the</strong> results in <strong>the</strong> centre <strong>of</strong> mass frame is not at all a trivial task<br />

<strong>and</strong> it will probably lead only to a small improvement.<br />

It is now possible to compare <strong>the</strong> results with <strong>the</strong> <strong>the</strong>oretical predictions. This will also<br />

allow us to convert <strong>the</strong> calculated asymmetry resolutions into sensitivity to anomalous form<br />

factors, as defined in equation 1.11. Table 7.3 presents a series <strong>of</strong> predictions <strong>of</strong> <strong>the</strong> b quark<br />

asymmetry for different values <strong>of</strong> <strong>the</strong> W tb anomalous couplings [16].<br />

Given that <strong>the</strong> reconstructed results have been calculated with both anomalous couplings


7.4 Bottom <strong>Quark</strong> Forward Backward Asymmetry 137<br />

b quark A fb - <strong>the</strong>oretical prediciton<br />

B WR B WL A fb<br />

0.0 0.0 0.279<br />

0.0 -0.2 0.243<br />

0.0 -0.4 0.218<br />

0.0 -0.6 0.197<br />

0.0 -1.0 0.169<br />

-0.6 0.0 0.301<br />

-1.0 0.0 0.315<br />

Table 7.3: A fb asymmetries for <strong>the</strong> st<strong>and</strong>ard model <strong>and</strong> anomalous W tb vertices. Calculated<br />

at a centre <strong>of</strong> mass value <strong>of</strong> 500 GeV <strong>and</strong> in <strong>the</strong> centre <strong>of</strong> mass rest frame.<br />

equal to zero, <strong>the</strong> discrepancy between <strong>the</strong> results <strong>and</strong> <strong>the</strong> <strong>the</strong>oretical prediction is approximately<br />

1.5 σ 3 . Although this can easily be a simple statistical fluctuation o<strong>the</strong>r possible<br />

reasons behind this slightly large discrepancy should also be investigated. The difference can<br />

derive from <strong>the</strong> use <strong>of</strong> a different MC generator (CompHEP [67, 68]) when calculating <strong>the</strong><br />

<strong>the</strong>oretical predictions or from <strong>the</strong> fact that <strong>the</strong> generated signal sample is an all inclusive<br />

b¯bq¯qq¯q ra<strong>the</strong>r than e + e −<br />

→ t¯t → b¯bq¯qq¯q. In fact if <strong>the</strong> A fb asymmetry is calculated at<br />

MC level for <strong>the</strong> used sample <strong>the</strong> value <strong>of</strong> 0.291 is obtained. A result in perfect agreement<br />

with <strong>the</strong> reconstructed values <strong>and</strong> that suggests no major systematic uncertainty has been<br />

introduced by <strong>the</strong> performed analysis.<br />

It can be also inferred from table 7.3 that <strong>the</strong> performed analysis is sensitive to <strong>the</strong> presence<br />

<strong>of</strong> an B WL anomalous form factor whose absolute value is greater that approximately<br />

0.05 or to <strong>the</strong> presence <strong>of</strong> a B WR form factor whose absolute value is greater than 0.3.<br />

Finally, in order to check for any significant detector smearing leading to systematic<br />

effects in A fb <strong>the</strong> angular resolution <strong>of</strong> <strong>the</strong> b jet θ angle with respect to <strong>the</strong> original b quark<br />

has been plotted. Fig. 7.9(a) shows <strong>the</strong> angular resolution for <strong>the</strong> angle θ. It can be clearly<br />

seen that <strong>the</strong> uncertainty is small, <strong>the</strong> root mean square <strong>of</strong> <strong>the</strong> distribution is 0.08 radians,<br />

<strong>and</strong> that it will <strong>the</strong>refore not have a major systematic effect on <strong>the</strong> reconstructed asymmetry.<br />

Incidentally if it did have an effect it would tend to smear <strong>the</strong> distribution plotted in fig. 7.8<br />

<strong>and</strong> <strong>the</strong>refore lower <strong>the</strong> calculated asymmetry near <strong>the</strong> values <strong>of</strong> cos(θ) = 0. This is clearly<br />

not observed. It has also been checked that <strong>the</strong> resolution does not decrease substantially


Bquarkscos<strong>the</strong>tadiffangleTOPHARDEYPTBcut<br />

Entries 103381<br />

Mean −0.001433<br />

RMS 0.08108<br />

bquarkscos<strong>the</strong>tadiff2DTOPEYPTBcut<br />

Entries 113341<br />

Mean x 1.804<br />

Mean y 4.158e−05<br />

RMS x 0.659<br />

RMS y 0.1178<br />

7.5 <strong>Top</strong> <strong>Quark</strong> Forward Backward Asymmetry 138<br />

in <strong>the</strong> very forward regions. As it can be seen in fig. 7.9(b) this is also not <strong>the</strong> case.<br />

b quarks<br />

14000<br />

12000<br />

Theta angle resolution − b quark<br />

Theta angle resolution − b quark<br />

10000<br />

8000<br />

6000<br />

4000<br />

2000<br />

0<br />

−0.4 −0.2 0 0.2 0.4<br />

<strong>the</strong>ta reconstructed − <strong>the</strong>ta MC (radiants)<br />

b quarks<br />

3<br />

10<br />

2<br />

10<br />

10<br />

1<br />

1.5<br />

1<br />

0.5<br />

<strong>the</strong>ta reconstructed − <strong>the</strong>ta MC (radiants)<br />

0<br />

−0.5<br />

−1<br />

−1.5<br />

3<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

<strong>the</strong>ta reconstructed<br />

(a)<br />

(b)<br />

Figure 7.9: Resolution <strong>of</strong> <strong>the</strong> reconstructed b quark angle θ a) with respect to <strong>the</strong> θ <strong>of</strong> <strong>the</strong><br />

original MC level b quark b) also plotted vs. <strong>the</strong> reconstructed angle θ. The figures are<br />

plotted after all <strong>the</strong> event selections used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> b quark A fb <strong>and</strong> only <strong>the</strong><br />

quarks whose flavour <strong>and</strong> charge have been correctly identified are included.<br />

7.5 <strong>Top</strong> <strong>Quark</strong> Forward Backward Asymmetry<br />

The analysis <strong>of</strong> <strong>the</strong> top quark asymmetry is in principle identical to <strong>the</strong> one already presented<br />

for <strong>the</strong> b quark. The only added complication is that, differently from <strong>the</strong> b quark, where <strong>the</strong><br />

angle θ <strong>of</strong> <strong>the</strong> b jet can be used as a very good approximation to <strong>the</strong> angle θ <strong>of</strong> <strong>the</strong> original<br />

b quark, <strong>the</strong> direction <strong>of</strong> <strong>the</strong> top quark must be reconstructed from its decay products.<br />

This results in a combinatorial problem that is identical to <strong>the</strong> one faced when trying to<br />

reconstruct <strong>the</strong> top quark mass, discussed in <strong>the</strong> previous chapter. The same tool, <strong>the</strong><br />

kinematic fitter, has <strong>the</strong>refore been used. More specifically <strong>the</strong> direction <strong>of</strong> <strong>the</strong> top quark is<br />

calculated from <strong>the</strong> combination <strong>of</strong> jets that minimizes <strong>the</strong> χ 2 Tot<br />

value <strong>of</strong> equation 6.1. The<br />

jets’ energies <strong>and</strong> momenta used are <strong>the</strong> ones calculated from <strong>the</strong> kinematic fitter. A fur<strong>the</strong>r<br />

complication is that <strong>the</strong> charge <strong>of</strong> <strong>the</strong> top quark cannot be directly established. Therefore,<br />

<strong>the</strong> charge <strong>of</strong> <strong>the</strong> b quark is used. If a reconstructed b quark jet is part <strong>of</strong> <strong>the</strong> three jets<br />

used to reconstruct <strong>the</strong> top quark <strong>the</strong>n <strong>the</strong> top quark is declared as a t. If instead a ¯b jet<br />

is present <strong>the</strong> quark is declared as a ¯t. Given <strong>the</strong> constrains set to <strong>the</strong> kinematic fitter only<br />

one such quark will be present in each jet. Only <strong>the</strong> two jets with <strong>the</strong> highest neural net b


quarkscos<strong>the</strong>taALLEYPTBcut<br />

Entries<br />

29122<br />

Mean 0.1341<br />

RMS 0.6075<br />

7.5 <strong>Top</strong> <strong>Quark</strong> Forward Backward Asymmetry 139<br />

tags are in fact considered as originating from b quarks.<br />

t quarks<br />

6000<br />

5000<br />

Events Used for t quark A_fb<br />

Events used for t quark A_fb<br />

Mistagging + SM Background<br />

4000<br />

SM Background<br />

3000<br />

2000<br />

1000<br />

0<br />

−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1<br />

cos(<strong>the</strong>ta)<br />

Figure 7.10: Events used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> t quark A fb . No event selection is<br />

performed using <strong>the</strong> combined charge. However, in order to qualify as a t ra<strong>the</strong>r than ¯t<br />

quark, <strong>the</strong> combined charge <strong>of</strong> <strong>the</strong> b jet used to reconstruct <strong>the</strong> top quark must be higher<br />

than <strong>the</strong> one <strong>of</strong> <strong>the</strong> o<strong>the</strong>r b jet present in <strong>the</strong> event. Notice that, as explained in <strong>the</strong> previous<br />

chapter, only two b jets are assumed to be present in each event. These are <strong>the</strong> jets with<br />

<strong>the</strong> highest neural net b tags.<br />

Once <strong>the</strong> momentum <strong>of</strong> <strong>the</strong> top quark has been established <strong>the</strong> distribution <strong>of</strong> top quark<br />

events with respect to <strong>the</strong>ir cos(θ) can be plotted. Fig. 7.10 shows such distribution <strong>and</strong><br />

also includes <strong>the</strong> St<strong>and</strong>ard Model <strong>and</strong> <strong>the</strong> mistagging backgrounds.<br />

Subsequently <strong>the</strong> same A fb calculations have been performed as <strong>the</strong> ones described in <strong>the</strong><br />

previous section for <strong>the</strong> b quark case. The results <strong>of</strong> <strong>the</strong>se calculation can be seen in table<br />

7.4. It is once again clear that <strong>the</strong> analysis is stable. Not surprisingly <strong>the</strong> uncertainties are<br />

roughly equivalent to <strong>the</strong> ones obtained for <strong>the</strong> b quark. The calculated asymmetries can be<br />

compared to <strong>the</strong> one calculated at MC level: 0.351. Given that <strong>the</strong> MC result with no event<br />

selection applied <strong>and</strong> <strong>the</strong> reconstructed results are less than 0.5 σ away <strong>the</strong>re seems to be<br />

no major presence <strong>of</strong> systematic biases from <strong>the</strong> analysis.<br />

Finally, in <strong>the</strong> same fashion as for <strong>the</strong> b quark, <strong>the</strong> θ angle resolution has been calculated.<br />

Fig. 7.11(a) shows such distribution. In agreement with expectations <strong>the</strong> resolution is worse<br />

than <strong>the</strong> one presented for <strong>the</strong> b quark. The RMS is 0.19 radians. This however should not<br />

pose a significant systematic bias to <strong>the</strong> calculation. In fact only <strong>the</strong> very central events <strong>of</strong> <strong>the</strong>


Tquarkscos<strong>the</strong>tadiffangleTOPHARDEYPTBcut<br />

Entries 103381<br />

Mean 0.000761<br />

RMS 0.1974<br />

Tquarkscos<strong>the</strong>tadiff2DTOPEYPTBcut<br />

Entries 113341<br />

Mean x<br />

1.274<br />

Mean y 0.01863<br />

RMS x 0.6527<br />

RMS y 0.4129<br />

7.6 Summary 140<br />

t quark A fb<br />

Event Selection A fb σ 1 σ 2 σ 3<br />

Charge b jet-1 × Charge b jet-2 < 1.0 0.356 0.006 0.007 0.008<br />

Charge b jet-1 × Charge b jet-2 < 0.5 0.348 0.006 0.007 0.008<br />

Charge b jet-1 × Charge b jet-2 < 0.0 0.353 0.007 0.008 0.009<br />

Table 7.4: Reconstructed A fb for <strong>the</strong> t quark <strong>and</strong> <strong>the</strong> respective uncertainties. The different<br />

uncertainties (σ 1 ,σ 2 ,σ 3 ) have been calculated with different assumptions as explained in <strong>the</strong><br />

text <strong>of</strong> <strong>the</strong> previous section.<br />

7.10 distribution will ever be smeared enough to change hemisphere when reconstructed. The<br />

asymmetry in this region is however small <strong>and</strong> <strong>the</strong>refore has only a marginal contribution to<br />

<strong>the</strong> total calculated A fb . Fig. 7.11(b) instead shows that <strong>the</strong>re is no substantial degradation<br />

<strong>of</strong> <strong>the</strong> resolution in <strong>the</strong> forward regions.<br />

t quarks<br />

10000<br />

Theta angle resolution − t quark<br />

Theta angle resolution − t quark<br />

8000<br />

6000<br />

t quarks<br />

3<br />

10<br />

2<br />

10<br />

4000<br />

2000<br />

0<br />

−1 −0.5 0 0.5 1<br />

<strong>the</strong>ta reconstructed − <strong>the</strong>ta MC (radiants)<br />

10<br />

1<br />

3<br />

2<br />

1<br />

<strong>the</strong>ta reconstructed − <strong>the</strong>ta MC (radiants)<br />

0<br />

−1<br />

−2<br />

−3<br />

3<br />

2.5<br />

2<br />

1.5<br />

1<br />

0.5<br />

0<br />

<strong>the</strong>ta reconstructed<br />

(a)<br />

(b)<br />

Figure 7.11: Resolution <strong>of</strong> <strong>the</strong> reconstructed t quark angle θ a) with respect to <strong>the</strong> θ <strong>of</strong> <strong>the</strong><br />

original MC level t quark b) also plotted vs. <strong>the</strong> reconstructed angle θ. The figures are<br />

plotted after all <strong>the</strong> event selections used for <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> t quark A fb <strong>and</strong> only <strong>the</strong><br />

quarks whose flavour <strong>and</strong> charge have been correctly identified are included.<br />

7.6 Summary<br />

This chapter dealt with <strong>the</strong> determination <strong>of</strong> <strong>the</strong> forward backward asymmetries (A fb ) <strong>of</strong> <strong>the</strong><br />

t quark <strong>and</strong> its decay product, <strong>the</strong> b quark, in <strong>the</strong> hadronic t¯t channel. It has been determined<br />

that both <strong>of</strong> <strong>the</strong>se can be measured with a precision <strong>of</strong> 0.008, without <strong>the</strong> inclusion <strong>of</strong> any<br />

systematic biases. The achievable resolution <strong>of</strong> <strong>the</strong> b quark A fb has been compared to <strong>the</strong>


7.6 Summary 141<br />

<strong>the</strong>oretical predictions in <strong>the</strong> case <strong>of</strong> W tb anomalous couplings.<br />

Three different methods (<strong>the</strong> vertex charge, <strong>the</strong> momentum weighted vertex charge <strong>and</strong><br />

<strong>the</strong> momentum weighted jet charge), used for determining <strong>the</strong> charge <strong>of</strong> <strong>the</strong> b quark from<br />

which a heavy jet has originated, have also been described. An algorithm for recombining<br />

<strong>the</strong> last two <strong>of</strong> <strong>the</strong> mentioned methods has been implemented. The performance <strong>of</strong> <strong>the</strong><br />

implemented methods has been tested; a charge identification purity <strong>of</strong> up to 80% was<br />

achieved with efficiency <strong>of</strong> 60% assuming that <strong>the</strong> jet flavour has been correctly tagged. The<br />

collection <strong>of</strong> <strong>the</strong>se methods forms <strong>the</strong> backbone <strong>of</strong> <strong>the</strong> A fb analysis.


Chapter 8<br />

Conclusion<br />

The main aim <strong>of</strong> this <strong>the</strong>sis, <strong>the</strong> study <strong>of</strong> <strong>the</strong> proprieties <strong>of</strong> <strong>the</strong> top quark at <strong>the</strong> International<br />

Linear Collider by employing <strong>the</strong> SiD detector concept, has been achieved. The analysis has<br />

been performed in <strong>the</strong> fully hadronic channel e + e − → t¯t → b¯bq¯qq¯q with a 500 GeV centre <strong>of</strong><br />

mass energy <strong>and</strong> it resulted in <strong>the</strong> calculation <strong>of</strong> <strong>the</strong> achievable precisions for <strong>the</strong> following<br />

observables:<br />

• <strong>the</strong> top mass: 49 MeV.<br />

• <strong>the</strong> cross section <strong>of</strong> <strong>the</strong> e + e − → t¯t → b¯bq¯qq¯q process: 1.3 fb.<br />

• forward backward asymmetry <strong>of</strong> <strong>the</strong> top quark: 0.008.<br />

• forward backward asymmetry <strong>of</strong> <strong>the</strong> b quark from <strong>the</strong> top decay: 0.008<br />

All <strong>the</strong> quoted values are to be interpreted as in absence <strong>of</strong> any systematic biases.<br />

Throughout <strong>the</strong> work particular care has been devoted to <strong>the</strong> development <strong>and</strong> testing<br />

<strong>of</strong> algorithms <strong>and</strong> techniques that have a wider scope <strong>and</strong> applicability. The implemented<br />

heavy jet identification is <strong>the</strong>refore described in great detail <strong>and</strong> its performance has been<br />

extensively tested within <strong>the</strong> ILD/LDC framework on a 91.2 GeV dijet <strong>and</strong> a 500 GeV dijet<br />

sample. Using <strong>the</strong> 91.2 GeV centre <strong>of</strong> mass energy sample it has been found that b jets can<br />

be selected with a 90% purity for an efficiency <strong>of</strong> 70%. At 500 GeV centre <strong>of</strong> mass energy,<br />

142


Chapter 8. Conclusion 143<br />

assuming <strong>the</strong> same efficiency, <strong>the</strong> purity decreases to 80%. The algorithm has also been<br />

tested within <strong>the</strong> SiD framework <strong>and</strong> more specifically with b¯bq¯qq¯q events, which are <strong>the</strong><br />

signal <strong>of</strong> <strong>the</strong> performed top analysis.<br />

Three quark charge reconstruction algorithms have been implemented. These are <strong>the</strong> vertex<br />

charge, <strong>the</strong> momentum weighted vertex charge <strong>and</strong> <strong>the</strong> momentum weighted jet charge.<br />

Their performance has been tested by tagging <strong>the</strong> b quarks <strong>of</strong> <strong>the</strong> b¯bq¯qq¯q sample. The vertex<br />

charge has also been tested for b <strong>and</strong> c quarks with <strong>the</strong> ILD/LDC 91.2 GeV dijet sample.<br />

The momentum weighted vertex charge <strong>and</strong> <strong>the</strong> momentum weighted jet charge have been<br />

also combined in a single discriminating variable which has been shown to improve <strong>the</strong> reconstruction<br />

performance. By using this variable a purity <strong>of</strong> up to 80% is achievable with<br />

an efficiency lower than 60% when <strong>the</strong> charges <strong>of</strong> well identified b quarks are being reconstructed.<br />

This study has been performed on <strong>the</strong> b¯bq¯qq¯q sample.<br />

A separate study <strong>of</strong> <strong>the</strong> CPC2 sensor capacitance has also been performed, <strong>the</strong> final aim<br />

being <strong>the</strong> minimization <strong>of</strong> <strong>the</strong> clock current needed to operate <strong>the</strong> sensor; currently 10 A for<br />

a 10 cm 2 sensor. The pixel capacitance has been found to be 3.11 ± 0.05 nF/cm 2 . The<br />

main contribution has been found to come from <strong>the</strong> intergate capacitance which accounts<br />

for 87% <strong>of</strong> <strong>the</strong> total pixel capacitance. Subsequently a series <strong>of</strong> different gate designs have<br />

been investigated with <strong>the</strong> help <strong>of</strong> finite element modelling <strong>and</strong> <strong>of</strong> test structures.<br />

Finally <strong>the</strong> experimental set-up developed for <strong>the</strong> testing <strong>of</strong> <strong>the</strong> CPC2 sensor has been<br />

described toge<strong>the</strong>r with <strong>the</strong> s<strong>of</strong>tware data acquisition. A study <strong>of</strong> <strong>the</strong> CPC2 noise has been<br />

performed which allowed to achieve a low noise operation: 70e − RMS noise.


Appendix A<br />

CPC2 Bias Voltages<br />

Figure A.1: Schematic <strong>of</strong> <strong>the</strong> CPC2 Bias Voltages.<br />

In <strong>the</strong> noise analysis performed in Chapter 4 <strong>the</strong> substrate voltage is set to 0 V. The<br />

voltage across <strong>the</strong> buried channel, which is not shown in <strong>the</strong> picture, <strong>and</strong> which has been<br />

referred to also as V Ref has been set to 16 V. The voltage <strong>of</strong> <strong>the</strong> output gate (OG) is set to<br />

2 V. The RD voltage is set to 15 V, OD is instead set to 16 V, while CS is grounded. All<br />

<strong>the</strong>se voltages are to be interpreted as DC. Additionally an AC voltage <strong>of</strong> 2 V is used for<br />

<strong>the</strong> clock.<br />

144


Appendix B<br />

LCFI Vertex Flavour Tagging<br />

S<strong>of</strong>tware Parameters<br />

ZVRES flavour tag vertex charge<br />

parameter value parameter value parameter value<br />

w IP 1 N L 5 T qb,max (mm) 1<br />

k 0.125 p trk,NL,min (GeV) 1 (L/D) qb,max<br />

2.5<br />

R 0 0.6 p trk,NL−1,min (GeV) 2 (L/D) qb,min<br />

0.18<br />

χ 2 TRIM 10 N trks,min 1 V I,qb 1<br />

χ 2 0 10 χ 2 norm,max 20 T qc,max (mm) 1<br />

T max (mm) 1 (L/D) qc,max<br />

2.5<br />

(L/D) max<br />

2.5 (L/D) qc,min<br />

0.5<br />

(L/D) min<br />

0.18 V I,qc 0<br />

V I 2<br />

N σ,max 2<br />

w PT,max 3<br />

w corr,max 2<br />

(b/σ b ) cut 200<br />

Table B.1: LCFI Vertex Default Parameters<br />

Values <strong>of</strong> code parameters used for <strong>the</strong> results presented in <strong>the</strong> LCFI Vertex flavour<br />

tagging s<strong>of</strong>tware chapter.<br />

145


Bibliography<br />

[1] L. H. Ryder. “Quantum field <strong>the</strong>ory”. Cambridge Univ Press (1996).<br />

[2] M. E. Peskin; D. V. Schroeder. “An introduction to quantum field <strong>the</strong>ory”. Basic Books<br />

(1993).<br />

[3] S. L. Glashow. Nucl. Phys., 22 (1961) 579–588.<br />

[4] A. Salam. In N. Svartholm, editor, “Elementary Particle Theory”, page 367. Almquist<br />

<strong>and</strong> Wiksells, Stockholm (1969).<br />

[5] S. Weinberg et al. Physical Review Letters, 19 (1967) 1264–1266.<br />

[6] C.S; E. Ambler; R. W. Hayward; D. D. Hoppes; R. P. Hudson C. S. Wu. Physical<br />

Review, 105 (1957) 1413–1415.<br />

[7] P. W. Higgs. Physical Review, 145 (1966) 1156–1163.<br />

[8] C. Amsler et al. Phys. Lett., B667. http://pdg.lbl.gov.<br />

[9] J. Christenson; J. Cronin; V. Fitch; R. Turlay. Phys. Rev. Lett., 13, 138.<br />

[10] S. T. Hill; E. H. Simmons. Phys. Rept., 381 (2003) 235–402.<br />

[11] D. Chakraborty; J. Konigsberg. arXiv preprint hep-ph/0303092.<br />

[12] A. Djouadi; J. Lykken; K. Mönig; Y. Okada; M. Oreglia; S. Yamashita. arXiv preprint<br />

0709.1893.<br />

[13] R. Barate et al. Phys. Lett., B565 (2003) 61–75.<br />

[14] K. Hagiwara; K. Hikasa; K. Nakamura; M. Tanabashi; M. Aguilar-Benitez; C. Amsler<br />

et al. Phys. Rev, D66 (2002) 1–213.<br />

[15] J.A. Aguilar-Saavedra. Nuclear Physics, B812 (2009) 181 – 204.<br />

[16] E. Boos; M. Dubinin; M. Sachwitz; H. J. Schreiber. The European Physical Journal,<br />

C16 (2000) 269–278.<br />

[17] M. Martinez; R. Miquel. The European Physical Journal, C27 (2003) 49–55.<br />

146


Bibliography 147<br />

[18] J. Brau; Y. Okada; N. Walker et al. arXiv preprint 0712.1950.<br />

[19] N. Phinney; N. Toge; N. Walker et al. arXiv preprint 0712.2361.<br />

[20] J. Resta-Lopez, P. N. Burrows, A. Latina & D. Schulte. arXiv preprint accph/0902.2915.<br />

[21] L. Lilje; H. Hayano; N. Ohuchi; S. Fukuda; C. Adolphsen. TILC 2009.<br />

http://tilc09.kek.jp.<br />

[22] B. Barish. TILC 2009. http://tilc09.kek.jp.<br />

[23] SiD Collaboration. “SiD Letter <strong>of</strong> Intent”.http://silicondetector.org/display/SiD/LOI<br />

(2009).<br />

[24] R. Lipton. International Workshop on Vertex Detectors-2007.<br />

hhttp://vertex2007.syr.edu.<br />

[25] SiD Collaboration. “SiD02”.http://confluence.slac.stanford.edu/display/ilc/sid02.<br />

[26] M. J. Charles. arXiv preprint 0901.4670.<br />

[27] S. Agostinelli; J. Allison; K. Amako; J. Apostolakis; H. Araujo; P. Arce; M. Asai; D.<br />

Axen; S. Banerjee; G. Barr<strong>and</strong> et al. Nuclear Inst. <strong>and</strong> Meth., A506 (2003) 250–303.<br />

[28] ILD Collaboration. “ILD Letter <strong>of</strong> Intent”. http://www.ilcild.org (2009).<br />

[29] D. Schulte et al. Particle Accelerator Conference - 2007. http://pac07.org.<br />

[30] W. Kilian; T. Ohl; J. Reuter. arXiv preprint 0708.4233.<br />

[31] M. Moretti; T. Ohl;J. Reuter. arXiv preprint hep-ph/0102195.<br />

[32] T. Sjostr<strong>and</strong>; S. Mrenna; P. Sk<strong>and</strong>s. arXiv preprint hep-ph/0603175v2.<br />

[33] N. Graf; J. McCormick. Calorimetry in High Energy Physics, 867 (2006) 503–512.<br />

[34] P. Mora de Freitas. Linear Collider Workshop-LCWS 2004.<br />

http://polywww.in2p3.fr/actualites/congres/lcws2004.<br />

[35] G. Musat. Linear Collider Workshop-LCWS 2004.<br />

http://polywww.in2p3.fr/actualites/congres/lcws2004.<br />

[36] T. Johnson. Snowmass Workshop-2005. http://www.slac.stanford.edu/econf/C0508141.<br />

[37] BR S. Catani; Y. L. Dokshitzer; M. Olsson; G. Turnock; B. R.Webber. Physics Letters,<br />

B269, 3-4 (1991) 432–438.<br />

[38] F. Gaede. Nuclear Instruments <strong>and</strong> Meth., A559 (2006) 177 – 180.<br />

[39] O. Wendt; F. Gaede; T. Krämer. arXiv preprint physics/0702171.<br />

[40] F. Gaede; T. Behnke; N. Graf; T. Johnson. arXiv preprint 0306114.


Bibliography 148<br />

[41] J.R. Janesick. “Scientific charge-coupled devices”. SPIE press (2001).<br />

[42] K. Stefanov. Nuclear Inst. <strong>and</strong> Meth., A569 (2006) 48–52.<br />

[43] K. Stefanov. International Workshop on Vertex Detectors-2007.<br />

hhttp://vertex2007.syr.edu.<br />

[44] B. Hawes; D. Cussans; C. Damerell; E. Devetak; J. Fopma; R. Gao; J. Goldstein; T.<br />

Greenshaw; N. Kundu; A.Nomerotski; C. Perry; K. Stefanov; S. Thomas; S. Worm.<br />

accepted by Nucl. Instr. <strong>and</strong> Meth..<br />

[45] P. Allport; D. Bailey; C. Buttar; D. Cussans; C. Damerell; N. De Groot; J. Fopma;<br />

B. Foster; S. Galagedera; A. R. Gillman et al. “Linear Collider Flavour Identification:<br />

Case for Support”. http://hepwww.rl.ac.uk/lcfi/lcfi 3 proposal.pdf (2005).<br />

[46] D. Meeker. Users Manual, s<strong>of</strong>tware homepage: http://femm. berlios. de/index. html.<br />

[47] S. Worm et al. Nucl. Instrum. Meth., A582 (2007) 839–842.<br />

[48] G. Moldovan; B. Jeffery; A. Nomerotski; A. Kirkl<strong>and</strong>. Nucl. Instrum. Meth., A604<br />

(2009) 108 – 110.<br />

[49] D. Bailey et al. arXiv preprint 0908.3019.<br />

[50] D. J. Jackson. Nucl. Instrum. Meth., A388 (1997) 247–253.<br />

[51] J. Thom. Technical Report, DE2002-798982; SLAC, 585.<br />

[52] S. Gorbunov; I. Kisel. CBM-SOFT-note-2007-003.<br />

http://www.gsi.de/documents/DOC-2007-May-14-1.pdf.<br />

[53] T. Abe. Nuclear Inst. <strong>and</strong> Meth., A447 (2000) 90–99.<br />

[54] R. Hawkings. LC-PHSM-2000-021. http://www-flc.desy.de/lcnotes/notes.<br />

[55] D. Buskulic et al. Phys. Lett., B313 (1993) 535–548.<br />

[56] P. Abreu;. Eur. Phys. J., C10 (1999) 415.<br />

[57] K. Abe et al. Phys. Rev. Lett., 79 (1997) 590–596.<br />

[58] J. Abdallah et al. Eur. Phys. J., C32 (2004) 185–208.<br />

[59] A. Hocker; P. Speckmayer; J. Stelzer; F. Tegenfeldt; H. Voss; K. Voss; A. Christov;<br />

S. Henrot-Versille; M. Jachowski; A. Krasznahorkay Jr. et al. arXiv preprint<br />

physics/0703039.<br />

[60] T. Behnke; N. Graf; A. Miyamoto. ILC-MEMO-2008-001.<br />

http://ilcdoc.linearcollider.org/collection/Memos?ln=en.<br />

[61] L. Lyons. “Statistics for nuclear <strong>and</strong> particle physicists”. Cambridge University Press<br />

(1986).


Bibliography 149<br />

[62] B. List; J. List. LC-TOOL-2009-001. www-flc.desy.de/lcnotes/notes/.<br />

[63] R. Cassell. Arxiv preprint arXiv:0902.2694.<br />

[64] R. Akers et al. Z. Phys., C66 (1995) 19–30.<br />

[65] R. D. Field; R. P. Feynman. Nuclear Physics, B136 (1978) 1 – 76.<br />

[66] D0 Collaboration. Physical Review, D74.<br />

[67] E. Boos et al. Nucl. Instrum. Meth., A534 (2004) 250–259.<br />

[68] A. Pukhov et al. arXiv preprint hep-ph/9908288.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!