CASINO manual - Theory of Condensed Matter

More documents

Recommendations

Info

Noting that |Φ| ˆK ψ |Φ| = ∑ 〈 N i=1 ∇ i ·[|Φ| 2 (q i A i −∇ i ψ)]/(2m i ), it is easy to show that |Φ| ∣ ˆK ∣ 〉 ∣∣ ψ |Φ| = 0. Hence 〈〉〈〉 ∣ ∣ Φ ∣Ĥ ∣ Φ = |Φ| ∣Ĥψ∣ |Φ| . (455) So the ground-state eigenvalue of the fixed-phase Schrödinger equation Ĥψ|φ 0 | = E 0 |φ 0 | is equal to the expectation value of the Hamiltonian Ĥ with respect to φ 0 = |φ 0 | exp(iψ), which is greater than or equal to the Fermionic ground-state energy of Ĥ by the variational principle, becoming equal in the limit that the fixed phase ψ is exactly equal to that of the Fermionic ground state [77]. 37.4 Importance sampling The fixed-phase imaginary-time Schrödinger equation is (Ĥψ − E T ) |Φ| = − ∂|Φ| ∂t , (456) where E T is the reference energy. In the large-time limit the ground-state eigenfunction |φ 0 | of the fixed-phase Hamiltonian is projected out. Let the DMC wave function Φ have the same phase exp(iψ) as the trial wave function Ψ. Then f ≡ Φ ∗ Ψ = |Φ||Ψ| is real. Substitute |Φ| = |Ψ| −1 f into Eq. (456) and rearrange to obtain the importance-sampled fixed-phase imaginary-time Schrödinger equation, N∑ i=1 1 [ −∇ 2 2m i f + 2∇ i · (Re(V i )f) ] + [Re(E L ) − E T ]f = − ∂f i ∂t , (457) where V i = Ψ −1 ∇ i Ψ = |Ψ| −1 ∇ i |Ψ| + i∇ i ψ is the complex drift velocity. This is a straightforward generalization of the usual fixed-node importance-sampled imaginary-time Schrödinger equation, with the real part of the drift velocity appearing in the drift–diffusion term and the real part of the local energy appearing in the branching term. After equilibration the algorithm produces configurations distributed as φ ∗ 0Ψ = |φ 0 ||Ψ|. Noting that Re(E L ) = |Ψ| −1 Ĥ ψ |Ψ|, the mixed estimate of the energy is equal to the pure estimate: ∫ φ ∗ 0 ΨRe(E L ) dR ∫ φ ∗ 0 Ψ dR = = ∫ |φ0 ||Ψ|Re(E L ) dR ∫ |φ0 ||Ψ| dR 〈〉 ∣ |φ 0 | ∣Ĥψ∣ |Ψ| = E 0 = 〈|φ 0 | | |Ψ|〉〈〉 ∣ |φ 0 | ∣Ĥψ∣ |φ 0 | 〈|φ 0 | | |φ 0 |〉 = 〈φ 0|Ĥ|φ 0〉〈φ 0 |φ 0 〉 . (458) For operators that do not commute with the Hamiltonian the mixed estimate is not equal to the pure estimate. Extrapolated estimation can be used in the same fashion as for real wave functions. 37.5 Applying magnetic fields in CASINO At present it is only possible to apply uniform magnetic fields in casino, although it would be straightforward to generalize this. The vector potential is written as A(r) = A 0 + A 1 r, where A 1 is a 3 × 3 matrix. Clearly it is possible to satisfy the Coulomb gauge condition ∇ · A = 0 with this form. To apply an external magnetic field, the complex wf keyword must be set to T in the input file and the magnetic vector potential must be given in a UNIFORM MAGNETIC FIELD block in the expot.data file. The format is: START UNIFORM MAGNETIC FIELD Vector A0 1.0 0.0 0.0 Matrix A1 0.0 0.0 0.0 1.0 0.0 0.0 0.0 0.0 0.0 END UNIFORM MAGNETIC FIELD Please note that it is crucial to use a trial wave function with the correct phase behaviour corresponding to the vector potential; otherwise the calculated energies will be nonsense. 206
38 Analysis of the performance of CASINO on parallel computers 38.1 VMC in parallel The VMC algorithm is perfectly parallel: no interprocessor communication is required during simulations. Each processor carries out an independent random walk using a different random-number sequence, and the results are averaged at the end of each block, so that running for a length of time T on P processors generates the same amount of data as running for time P T on a single processor (assuming the equilibration time to be negligible). VMC should therefore scale to an arbitrarily large number of processors. Note that, although the energy obtained by running for time T on P processors should be in statistical agreement with that obtained by running for time P T on a single processor, the results will not be exactly equal, because the random walks are different in the two cases. 38.2 Optimization in parallel 38.2.1 Standard variance minimization The VMC stages of a variance-minimization calculation are perfectly parallel, as described above. In the optimization stages, the configuration set is distributed evenly between the processors. The master processor broadcasts the current set of optimizable parameters, then each processor calculates the local energy of each of its configurations and reports the energies (and weights, if required) to the master. The CPU time required to evaluate the local energies of the configuration set usually far exceeds the time spent communicating (reporting one or two numbers per configuration to the master and receiving a handful of parameter values at each iteration). In particular the time spent evaluating the local energies increases with system size, whereas the time spent on interprocessor communication is independent of system size. So the standard variance minimization method is essentially perfectly parallel. Note that the number of processor communications could easily be reduced further if each processor were simply to report the sum of its local energies and the sum of the squares of the local energies to the master. 38.2.2 Variance minimization for linear Jastrow parameters The VMC stage of the optimization (including the construction and accumulation of the quartic coefficients) is perfectly parallel. The optimization itself is carried out in serial on the master node. However, this stage typically takes a fraction of a second, and is independent of system size. So the varmin-linjas scheme is essentially perfectly parallel. 38.2.3 Energy minimization The VMC stages of an energy minimization are perfectly parallel, as described above. For the matrix algebra stages, the configurations are divided evenly between the processors, each of which separately generates one section of the full matrices. The full matrices are then gathered on the master processor, where the matrix algebra is done. The time taken to do the matrix algebra is usually insignificant in comparison to the time taken in VMC and matrix generation. The time taken in interprocessor communication is recorded and written out during energy minimization, and is typically at maximum a few percent of the total time spent in an iteration (and often much less than one percent). Overall, energy minimization is very nearly perfectly parallel. 38.3 DMC in parallel 38.3.1 Parallelization strategy When performing DMC on a parallel machine, the population of configurations is usually distributed evenly over the set of processors. The algorithm is not perfectly parallel because the populations on 207
Page 1 and 2:
CASINO User’s Guide Version 2.13
Page 3 and 4:
8.6 GAUSSIAN94/98/03/09 . . . . . .
Page 5 and 6:
29.2 Fourier transformation of CASI
Page 7 and 8:
1 Introduction casino is a computer
Page 9 and 10:
3.2 Legal stuff casino is given awa
Page 11 and 12:
- Grossman-Mitas DMC-DFT molecular
Page 13 and 14:
Your defined setup can then be perm
Page 15 and 16:
- : perform compilation (default).
Page 17 and 18:
6 Introductory user’s guide: how
Page 19 and 20:
ATOM BASIS TYPE The basis used to r
Page 21 and 22:
ENVMC v0.60: Script to extract VMC
Page 23 and 24:
Std. err. in the mean DMC energy (a
Page 25 and 26:
Jastrow factor from each cycle of t
Page 27 and 28:
V(x) Ψ init (x) τ{ t Ψ 0 (x) x T
Page 29 and 30:
the energy becomes roughly constant
Page 31 and 32:
many cores, time limits etc, which
Page 33 and 34:
Set the verbosity level of the mach
Page 35 and 36:
6.7 How to run coupled DFT-DMC mole
Page 37 and 38:
unqmcmd --startqmc=M Start the chai
Page 39 and 40:
config.out This is the name under w
Page 41 and 42:
‘slater-type’: use Slater-type
Page 43 and 44:
CUSTOM SPAIR DEP (Block) This input
Page 45 and 46:
initial configurations come from DM
Page 47 and 48:
DTDMC (Real) Time step for DMC run
Page 49 and 50:
FORCES INFO (Integer) Controls the
Page 51 and 52:
nucleus is assumed to lie at the or
Page 53 and 54:
MOVIECELLS (Logical) If F then casi
Page 55 and 56:
OPT MAXITER (Integer) Largest permi
Page 57 and 58:
of G vectors and discards all those
Page 59 and 60:
half a million cores, it may be des
Page 61 and 62:
VM REWEIGHT (Logical) If vm reweigh
Page 63 and 64:
default this is 0 hartree. It shoul
Page 65 and 66:
A very detailed specification of th
Page 67 and 68:
-1 0 0 1 1 1 1 1 1 1 0 2 1 0 1 2 Pa
Page 69 and 70:
cusp conditions; otherwise, only th
Page 71 and 72:
Detailed information about each of
Page 73 and 74:
|k| (outermost). Hence one can spec
Page 75 and 76:
large and the wave function is near
Page 77 and 78:
R(i) in atomic units 0.000000000000
Page 79 and 80:
2. The charge-density Fourier coeff
Page 81 and 82:
c_767: [ 996 ] ... Compressed expan
Page 83 and 84:
valence charges for each atom %%%%%
Page 85 and 86:
1988). This can be found online. An
Page 87 and 88:
1.3813695578425E+00 1.3957621401173
Page 89 and 90:
PWSCF Method: DFT DFT Functional: u
Page 91 and 92:
0.125203784849961E-01 0.13042462855
Page 93 and 94:
3.3000000000000E+00 2.0000000000000
Page 95 and 96:
Potential Number of grid points 200
Page 97 and 98:
9. Clearly, the total number of pos
Page 99 and 100:
EBEST (DMC only) Best estimate of t
Page 101 and 102:
Use G-vector set 1 Number of sets 2
Page 103 and 104:
1000.0 rho_a(G)*rho_b(-G) 16.0 4.12
Page 105 and 106:
total weight must always be include
Page 107 and 108:
The exporter does not currently wor
Page 109 and 110:
8.4.2 Generating gwfn.data files wi
Page 111 and 112:
After successfully running properti
Page 113 and 114:
% mv Test.FChk dna.Fchk run gaussia
Page 115 and 116:
• By default, gaussian uses spin
Page 117 and 118:
Routine Purpose qmc write Writes th
Page 119 and 120:
not the case the defaults can be ch
Page 121 and 122:
8.10 TURBOMOLE Website: www.turbomo
Page 123 and 124:
• crysgen06/09, crystaltoqmc: The
Page 125 and 126:
• quickblock: Simple reblocking u
Page 127 and 128:
• In Movie Settings choose Trajec
Page 129 and 130:
the move rejection probability is h
Page 131 and 132:
This leads to the single-electron d
Page 133 and 134:
In the UNR [19] scheme the modified
Page 135 and 136:
13.7 Evaluating expectation values
Page 137 and 138:
Population-control catastrophes sho
Page 139 and 140:
where u k has the periodicity of th
Page 141 and 142:
Although the constraint equations a
Page 143 and 144:
18 Wave-function updating Consider
Page 145 and 146:
19.2 Evaluating the nonlocal pseudo
Page 147 and 148:
of a unit point charge at r j in ev
Page 149 and 150:
19.4.3 1D Coulomb interaction Coulo
Page 151 and 152:
To investigate whether the extrapol
Page 153 and 154:
correlations, such as might be enco
Page 155 and 156:
where n is the order of the expansi
Page 157 and 158:
terms by definition, and it can be
Page 159 and 160:
which is therefore independent of r
Page 161 and 162: where the local energy, E L , is E
Page 163 and 164: Another variant of variance minimiz
Page 165 and 166: The energy minimization method used
Page 167 and 168: valid. The first problem, which ari
Page 169 and 170: • Is emin min energy too high? Th
Page 171 and 172: It may be activated by setting vmc
Page 173 and 174: that the number of configuration mo
Page 175 and 176: • It is possible to use blip orbi
Page 177 and 178: 0.5 0.5 0.0 particle 1 det 1 : 9 or
Page 179 and 180: The clearup twistav script can be u
Page 181 and 182: In the infinite system limit, the s
Page 183 and 184: Note that this has the k −2 diver
Page 185 and 186: • The effective time step, Eq. (3
Page 187 and 188: 1 2 H He 1.00794 4.00260 3 4 5 6 7
Page 189 and 190: and the Dirac delta function is δ(
Page 191 and 192: 33.2.2 Atomic densities K eywords:
Page 193 and 194: The spin-density matrix is a two-by
Page 195 and 196: 33.3.3 Homogeneous and isotropic sy
Page 197 and 198: 33.4 Structure factor and spherical
Page 199 and 200: 33.5 One-body density matrix, two-b
Page 201 and 202: [ where the approximation ρ (1)T
Page 203 and 204: The figure shows the localization l
Page 205 and 206: with F HFT mix = − F P mix = −
Page 207 and 208: The input keyword to activate the c
Page 209 and 210: To each walker r j , we assign a li
Page 211: 37 Magnetic fields and the fixed-ph
Page 215 and 216: the orbitals, α = 2 [11]. The use
Page 217 and 218: 39.2 Implementation basics and perf
Page 219 and 220: • Use ‘endif’ and ‘enddo’
Page 221 and 222: • Evidence of testing on serial a
Page 223 and 224: B Appendix 2: Automatic testing of
Page 225 and 226: • Delete any eepot.data and densi
Page 227 and 228: * "Generic" CASINO_ARCHs are intend
Page 229 and 230: - ‘head -n 1 /etc/redhat-release
Page 231 and 232: for a single job in an ensemble of
Page 233 and 234: inary executable. CASINO does not r
Page 235 and 236: otherwise. The following tags are o
Page 237 and 238: [2] V.R. Saunders, R. Dovesi, C. Ro
Page 239 and 240: [50] W. Janke, Statistical Analysis
show all

CASINO manual - Theory of Condensed Matter

Create successful ePaper yourself

Delete template?

Save as template?