Theory Notes
Contents
[hide]- 1 Notes on solving the Schrödinger equation in Hylleraas coordinates for heliumlike atoms
Gordon W.F. Drake
Department of Physics, University of Windsor
Windsor, Ontario, Canada N9B 3P4
(Transcribed from hand-written notes and edited by Lauren Moffatt. Last revised July 20, 2016.)
- 1.1 Introduction
- 1.2 The Hartree Fock Method
- 1.3 Configuration Interaction
- 1.4 Hylleraas Coordinates
- 1.5 Completeness
- 1.6 Solutions of the Eigenvalue Problem
- 1.7 Matrix Elements of H
- 1.8 Radial Integrals and Recursion Relations
- 1.9 Graphical Representation
- 1.10 Matrix Elements of H
- 1.11 General Hermitian Property
- 1.12 Optimization of Nonlinear Parameters
- 1.13 The Screened Hydrogenic Term
- 1.14 Small Corrections
Notes on solving the Schrödinger equation in Hylleraas coordinates for heliumlike atoms
Gordon W.F. Drake
Department of Physics, University of Windsor
Windsor, Ontario, Canada N9B 3P4
(Transcribed from hand-written notes and edited by Lauren Moffatt. Last revised July 20, 2016.)
Introduction
These notes describe some of the technical details involved in solving the Schrödinger equation for a heliumlike atom of nuclear charge Z in correlated Hylleraas coordinates. The standard for computational purposes is to express all quantities in a form that is valid for electronic states of arbitrary angular momentum, and to express matrix elements of the Hamiltonian in a manifestly Hermitian symmetrized form. Further information and derivations of the mathematical techniques for angular integrals and radial recursion relations can be found in the paper G.W.F. Drake, Phys. Rev. A 18, 820 (1978).
The starting point is the two-electron Schrödinger equation for infinite nuclear mass [−ℏ22m(∇21+∇22)−Ze2r1−Ze2r2+e2r12]ψ=Eψ where m is the electron mass, and r12=|r1−r2| (see diagram below).
Begin by rescaling distances and energies so that the Schrödinger equation can be expressed in a dimensionless form. The dimensionless Z-scaled distance is defined by ρ=Zra0 where a0=ℏ2me2 is the atomic unit (a.u.) of distance equal to the Bohr radius 0.52917721092(17)×10−10 m. Then
[−ℏ22mZ2(me2ℏ2)2(∇2ρ1+∇2ρ2)−Z2e2a0ρ−11−Z2e2a0ρ−12+e2a0Zρ−112]ψ=Eψ
But ℏ2m(me2ℏ2)2=e2a0 is the hartree atomic unit of of energy (Eh=27.21138505(60) eV, or equivalently Eh/(hc)=2194.746313708(11) m−1). Therefore, after multiplying through by a0/(Ze)2, the problem to be solved in Z-scaled dimensionless units becomes
[−12(∇2ρ1+∇2ρ2)−1ρ1−1ρ2+Z−1ρ12]ψ=εψ
where ε=Ea0(Ze)2 is the energy in Z-scaled atomic units. For convenience, rewrite this in the conventional form Hψ=εψ where (using r in place of ρ for the Z-scaled distance) H=−12(∇21+∇22)−1r1−1r2+Z−1r12 is the atomic Hamiltonian for infinite nuclear mass.
The Hartree Fock Method
For purposes of comparison, and to define the correlation energy, assume that ψ(r1,r2) can be written in the separable product form ψ(r1,r2)=1√2[u1(r1)u2(r2)±u2(r1)u1(r2)] for the 1s2 1S ground state. Because of the $\frac{1}{r_{12}}$ term in the Schrödinger equation, the exact solution connot be expressed in this form as a separable product. However, the Hartree-Fock approximation corresponds to finding the best possible solution to the Schrödinger equation Hψ(r1,r2)=Eψ(r1,r2) that can nevertheless be expressed in this separable product form, where as before H=−12(∇21+∇22)−1r1−1r2+Z−1r12 is the full two-electron Hamiltonian. To find the best solution, substitute into ⟨ψ|H−E|ψ⟩ and require this expression to be stationary with respect to arbitrary infinitesimal variations δu1 and δu2 in u1 and u2 respectively; i.e.
12⟨δu1(r1)u2(r2)±u2(r1)δu1(r2)|H−E|u1(r1)u2(r2)±u2(r1)u1(r2)⟩
=∫δu1(r1)dr1{∫dr2u2(r2)(H−E)[u1(r1)u2(r2)±u2(r1)u1(r2)]}
=0
for arbitrary δu1(r1). Therefore {∫dr2…}=0.
Similarly, the coefficient of δu2 would give
∫dr1u1(r1)(H−E)[u1(r1)u2(r2)±u2(r1)u1(r2)]=0
Define
I12=I21=∫dru1(r)u2(r),
Hij=∫drui(r)(−12∇−1r)uj(r),
Gij(r)=∫dr′ui(r′)1|r−r′|uj(r′)
Then the above equations become the pair of integro-differential equations
[H0−E+H22+G22(r)]u1(r)=∓[I12(H0−E)+H12+G12(r)]u2(r)
[H0−E+H11+G11(r)]u2(r)=∓[I12(H0−E)+H12+G12(r)]u1(r)
These must be solved self-consistently for the "constants" I12 and Hij and the function Gij(r).
The Hartree Fock energy is E≃−2.87…a.u. while the exact energy is E=−2.903724…a.u.
The difference is called the "correlation energy" because it arises from the way in which the motion of one electron is correlated to the other. The Hartree Fock equations only describe how one electron moves in the average field provided by the other (mean-field theory).
Configuration Interaction
Expand ψ(r1,r2)=C0u(s)1(r1)u(s)1(r2)+C1u(P)1(r1)u(P)1(r2)Y01,1,0(ˆr1,ˆr2)+C2u(d)1(r1)u(d)2(r2)Y02,2,0(ˆr1,ˆr2)+...±exchange where YMl1,l2,L(ˆr1,ˆr2)=∑m1,m2Ym1l1(r1)Ym2l2(r2)×<l1l2m1m2∣LM>
This works, but is slowly convergent, and very laborious. The best CI calculations are accurate to 10−7 a.u.
Hylleraas Coordinates
[E.A. Hylleraas, Z. Phys. 48,469(1928) and 54,347(1929)] suggested using the co-ordinates r1, r2 and r12 or equivalently
s=r1+r2,t=r1−r2,u=r12
and writing the trial functions in the form
Ψ(r1,r2)=i+j+k≤N∑i,j,kci,j,kri+l11rj+l22rk12e−αr1−βr2YMl1,l2,L(ˆr1,ˆr2)±exchange
Diagonalizing H in this non-orthogonal basis set is equivalent to solving ∂E∂ci,j,k=0 for fixed α and β.
The diagonalization must be repeated for different values of α and β in order to optimize the non-linear parameters.
Completeness
The completeness of the above basis set can be shown by first writing r212=r21+r22−2r1r2cosθ12 and cos(θ12)=4π31∑m=−1Ym∗1(θ1,φ1)Ym1(θ2,φ2) Consider first the S-states. The r012 terms are like the ss terms in a CI calculation. The r212 terms bring in p-p type contributions, and the higher powers bring in d-d, f-f etc type terms. In general
Pl(cosθ12)=4π2l+1l∑m=−lYml∗(θ1,φ1)Yml(θ2,φ2)
For P-states, one would have similarly
r012(sp)Pr212(pd)Pr412(df)P⋮⋮
For D-states
r012(sd)D(pp′)Dr212(pf)D(dd′)Dr412(dg)D(ff′)D⋮⋮⋮
In this case, since there are two "lowest-order" couplings to form a D-state, both must be present in the basis set, i.e.
Ψ(r2,r2)=∑cijkri1rj+22rk12e−αr1−βr2YM022(ˆr1,ˆr2)+∑dijkri+11rj+12r12e−α′r1−β′r2YM112(ˆr1,ˆr2)
For F-states, one would need $(sf)F$ and $(pd)F$ terms.
For G-states, one would need $(sg)G$, $(pf)G$ and $(dd^\prime)G$ terms.
Completeness of the radial functions can be proven by considering the Sturm-Liouville problem
(−12∇2−λrs−E)ψ(r)=0
or
(−121r2∂∂r(r2∂∂r)−l(l+1)2r2−λr−E)u(r)=0.
For fixed E and variable $\lambda$ (nuclear charge).
The eigenvalues are $\lambda_n = (E/E_n)^{1/2}$, where $E_n =- \frac{1}{2n^2}$
and the eigenfunctions are
unl(r)=1(2l+1)!((n+l)!(n−l−1)2!)1/2(2α)3/2e−αr× (2αr)l1F1(−n+l+1,2l+2;2αr)
with α=(−2E)1/2 and n≥l+1. The confluent hypergeometric function 1F1(a,b,;z) then denotes a finite polynomial since a=−n+l+1 is a negative integer or zero.
Unlike the hydrogen spectrum, which has both a discrete part for E<0 and a
continuous part for E>0, this forms an entirely discrete set of finite
polynomials, called Sturmian functions. They are orthogonal with respect to
the potential, i.e.
∫∞0r2dr(un′l(r)1runl(r))=δn,n′
Since they become complete in the limit n→∞, this assures the
completeness of the variational basis set.
[See also B Klahn and W.A. Bingel Theo. Chim. Acta (Berlin) 44, 9 and 27 (1977)].
Solutions of the Eigenvalue Problem
For convenience, write
Ψ(r1,r2)=N∑m=1cmφm
where $m$ represents the $m$'th combination of $i,j,k$ and
φijk=ri1rj2rk12e−αr1−βr2YMl1,l2,L(ˆr1,ˆr2)±exchange
\begin{align}
&
\left( \begin{array}{lr}
\cos(\theta) & \sin(\theta)\\
-\sin(\theta) & \cos(\theta)
\end{array} \right)
\left( \begin{array}{lr}
H_{11} & H_{12} \\
H_{12} & H_{22}
\end{array}\right)
\left( \begin{array}{lr}
\cos(\theta) & -\sin(\theta)\\
\sin(\theta) & \cos(\theta)
\end{array} \right)
\\
= & \left(\begin{array}{lr}
cH_{11}+sH_{12} & cH_{12} + sH_{22}\\
-sH_{11} + cH_{12} & -sH_{12} + cH_{22}
\end{array}\right)
\left( \begin{array}{cc}
c & -s \\
s & c
\end{array}\right)
\\
= & \left(c2H11+s2H22+2csH12(c2−s2)H12+cs(H22−H11)(c2−s2)H12+cs(H22−H11)s2H11+c2H22−2csH12\right) \end{align}
Therefore
(cos2(θ)−sin2(θ))H12=cos(θ)sin(θ)(H11−H22)
and
tan(2θ)=2H12H11−H22
i.e.
cos(θ)=(r+ω2r)1/2
sin(θ)=−sgn(H12)(r−ω2r)1/2
where
ω=H22−H11
r=(ω2+4H212)1/2
E1=12(H11+H22−r)
E2=12(H11+H22+r)
Brute Force Method
- Gives all the eigenvalues and eigenvectors, but it is slow
- First orthonormalize the $N$-dimensional basis set; i.e. form linear combinations
Φm=N∑n=1φnRnm
such that ⟨Φm|Φn⟩=δm,n
This can be done by finding an orthogonal tranformation T such that
TTOT=I=(I100…00I20…000I3…0⋮⋮⋮⋱⋮000…IN);
Omn=⟨φm|φn⟩
and then applying a scale change matrix
S=(1I1/2100…001I1/220…0001I1/23…0⋮⋮⋮⋱⋮000…I1/2N)=ST
Then
STTTOTS=1
i.e.
RTOR=1
with R=TS.
If H is the matrix with elements $H_{mn} = \langle \varphi_m | \varphi_n \rangle $, then H expressed in the Φm basis set is
H′=RTHR.
We next diagonalize $H^\prime$ by finding an orthogonal transformation W such that
WTH′W=λ=(λ10…00λ2…0⋮⋮⋱⋮00…λN)
The $q$'th eigenvector is
Ψ(q)=N∑n=1ΦnWn,q=∑n,n′φn′Rn′,nWn,q
i.e.
c(q)n′=N∑n=1Rn′nWn,q
The Power Method
- Based on the observation that if H has one eigenvalue, $\lambda_M$, much bigger than all the rest, and $\chi = \left(\nonumber a1a2⋮\right)\nonumber$ is an arbitrary starting vector, then $\chi = \sum_{q=1}^N x_q\Psi^{(q)}\nonumber$.
(H)nχ=N∑q=1xqλnqΨ(q)→xMλnMΨ(M)
provided $x_M\neq 0$.
To pick out the eigenvector corresponding to any eigenvalue, write the original problem in the form
HΨ=λOΨ(H−λqO)Ψ=(λ−λq)OΨ
Therefore,
GΨ=1λ−λqΨ
where $G=(H-\lambda_qO)^{-1}O\nonumber$ with eigenvalues $\frac{1}{\lambda_n - \lambda_q}\nonumber$.
By picking $\lambda_q$ close to any one of the $\lambda_n$, say $\lambda_{n^\prime}$, then $\frac{1}{\lambda_n-\lambda_q}$ is much larger for $n=n^\prime\nonumber$ than for any other value. The sequence is then
χ1=Gχχ2=Gχ1χ3=Gχ2⋮
until the ratios of components in $\chi_n$ stop changing.
- To avoid matrix inversion and multiplication, note that the sequence is equivalent to
Fχn=(λ−λq)Oχn−1
where $F = H-\lambda_qO$. The factor of $\left( \lambda - \lambda_q \right)$ can be dropped because this only affects the normalization of $\chi_n$. To find $\chi_n$, solve
Fχn=Oχn−1
(N equations in N unknowns). Then
λ=⟨χn|H|χn⟩⟨χn|χn⟩
Matrix Elements of H
H=−12∇21−12∇22−1r1−1r2+Z−1r12
Taking r1,r2 and r12 as independent variables,
∇21=1r21∂∂r1(r21∂∂r1)+1r212∂∂r12(r212∂∂r12)−l1(l1+1)r21+2(r1−r2cos(θ12))1r12∂2∂r1∂r12−2(∇Y1⋅r2)1r12∂∂r12
where ∇Y1 acts only on the spherical harmonic part of the wave function and the diagram
defines the complete set of 6 independent variables is $r_1$, $r_2$, $r_{12}$, $\theta_1$, $\varphi_1$, $\chi$.
If $r_{12}$ were not an independent variable, then one could take the volume element to be
dτ=r21dr1sin(θ1)dθ1dφ1r22dr2sin(θ2)dθ2dφ2.
However, $\theta_2$ and $\varphi_2$ are no longer independent variables. To eliminate them, take the point ${\bf r}_1$ as the origin of a new polar co-ordinate system, and write
dτ=−r21dr1sin(θ1)dθ1dφ1r212dr12sin(ψ)dψdχ
and use
r22=r21+r212+2r1r12cos(ψ)
Then for fixed $r_1$ and $r_{12}$,
2r2dr2=−2r1r12sin(ψ)dψ
Thus
dτ=r1dr1r2dr2r12dr12sin(θ1)dθ1dφ1dχ
The basic type of integral to be calculated is
I(l1,m1,l2,m2;R)=∫sin(θ1)dθ1dφ1dχYm1l1(θ1,φ1)∗Ym2l2(θ2,φ2)×∫r1dr1r2dr2r12dr12R(r1,r2,r12)
Consider first the angular integral. $Y^{m_2}_{l_2} \left( \theta_2, \varphi_2 \right)$ can be expressed in terms of the independent variables $\theta_1, \varphi_1, \chi$ by use of the rotation matrix relation
Ym2l2(θ2,φ2)=∑mD(l2)m2,m(φ1,θ1,χ)∗Yml2(θ12,φ)
where $\theta, \varphi$ are the polar angles of ${\bf r}_2$ relative to ${\bf r}_1$. The angular integral is then
Iang=∫2π0dχ∫2π0dφ1∫π0sin(θ1)dθ1Ym1l1(θ1,φ1)∗×∑mD(l2)m2,m(φ1,θ1,χ)∗Yml2(θ12,φ)
Use
Ym1l1(θ1,φ1)∗=√2l1+14πD(l1)m1,0(φ1,θ1,χ)
together with the orthogonality property of the rotation matrices (Brink and Satchler, p 147)
∫D(j)∗m,m′D(J)M,M′sin(θ1)dθ1dφ1dχ=8π22j+1δjJδmMδm′M′
to obtain
Iang=√2l1+14π8π22l1+1δl1,l2δm1,m2Y0l2(θ12,φ)=2πδl1,l2δm1,m2Pl2(cosθ12)
since
Y0l2(θ12,φ)=√2l1+14πPl2(cos(θ12))
Note that $P_{l_2} \left( \cos \theta \right)$ is just a short hand expression for a radial function because
cosθ12=r21+r22−r2122r1r2
The original integral is thus
I(l1,m1,l2,m2;R)=2πδl1,l2δm1,m2∫∞0r1dr1∫∞0r2dr2∫r1+r2|r1−r2|r12dr12R(r1,r2,r12)Pl2(cosθ12)
where again
cosθ12=r21+r22−r2122r1r2
is a purely radial function.
The above would become quite complicated for large $l_2$ because $P_{l_2}(\cos \theta_{12})$ contains terms up to $\left( \cos \theta_{12} \right)^{l_2}$. However, recursion relations exist which allow any integral containing $P_l \left( \cos \theta_{12} \right)$ in terms of those containing just $P_0 \left( \cos \theta_{12} \right) = 1$ and $P_1 \left( \cos \theta_{12} \right) = \cos \theta_{12}$.
Radial Integrals and Recursion Relations
The basic radial integral is
I0(a,b,c)=∫∞0r1dr1∫∞r1r2dr2∫r1+r2r2−r1r12dr12ra1rb2rc12e−αr1−βr2+∫∞0r2dr2∫∞r2r1dr1∫r1+r2r1−r2r12dr12ra1rb2rc12e−αr1−βr2=2c+2[(c+1)/2]∑i=0(c+22i+1){q!βq+1(α+β)p+1q∑j=0(p+j)!j!(βα+β)j+q′!αq′+1(α+β)p′+1q′∑j=0(p′+j)!j!(αα+β)j}
where
p=a+2i+2p′=b+2i+2q=b+c−2i+2q′=a+c−2i+2
The above applies for $ a, b \geq -2, c\geq -1$. $[x]$ means greatest integer in $x$.
Then
I1(a,b,c)=∫dτrra1rb2rc12e−αr1−βr2P1(cosθ)=12(I0(a+1,b−1,c)+I0(a−1,b+1,c)−I0(a−1,b−1,c+2))
The Radial Recursion Relation
Recall that the full integral is
I(l1m1,l2m2;R)=2πδl1,l2δm1,m2Il2(R)
where
Il2(R)=∫dτrR(r1,r2,r12)Pl2(cosθ12)
To obtain the recursion relation, use
Pl(x)=[P′l+1(x)−P′l−1(x)]2l+1
with
P′l+1(x)=ddxPl+1(x)
Here $x=\cos\theta_{12}$ and
ddcosθ12=dr12dcosθ12ddr12=−r1r2r12ddr12
Then,
Il(R)=−∫dτrRr1r2r12ddr12[Pl+1(cosθ12)−Pl−1(cosθ12)]2l+1
The $r_{12}$ part of the integral is
∫r1+r2|r1−r2|r12dr12Rr1r2r12ddr12[Pl+1−Pl−1]=Rr1r2[Pl+1−Pl−1]r1+r2|r1−r2|−∫r1+r2|r1−r2|r12dr12(ddr12R)r1r2r12[Pl+1−Pl−1]2l+1
The integrated term vanishes because
cosθ12=r21+r22−r2122r1r2=−1whenr212=(r1+r2)2=1whenr212=(r1−r2)2
and $P_l(1)=1$, $P_{l}(-1)=(-1)^{l}$
Thus,
Il+1(r1r2r12R′)=(2l+1)Il(R)−Il−1(r1r2r12R′)
If
R=ra−11rb−12rc+212e−αr1−βr2
then
Il+1(ra1rb2rc12)=2l+1c+2Il(ra−11rb−22rc+212)+Il−1(ra1rb2rc12)
For the case $c=-2$, take
R=ra−11rb−12lnr12e−αr1−βr2
Then
Il+1(ra1rb2rc12)=Il(ra−11rb−12lnr12)(2l+1)−Il−1(ra1rb2r−212)
This allows all $I_{l}$ integrals to be calculated from tables of $I_0$ and $I_1$ integrals.
The General Integral
The above results for the angular and radial integrals can now be combined into a general formula for integrals of the type
I=∫∫dr1dr2R1YM′l′1l′2L′(ˆr1,ˆr2)TQk1k2K(r1,r2)R2YMl1l2L(ˆr1,ˆr2)
where
YMl1l2L(ˆr1,ˆr2)=∑m1,m2⟨l1l2m1m2|LM⟩Ym1l1(ˆr1)Ym2l2(ˆr2)
and
TQk1k2K(r1,r2)=∑q1,q2⟨k1k2q1q2|KQ⟩Yq1k1(ˆr1)Yq2k2(ˆr2)
The basic idea is to make repeated use of the formula
Ym1l1(ˆr1)Ym2l2(ˆr1)=∑lm((2l1+1)(2l2+1)(2l+1)4π)1/2×(l1l2lm1m2m)(l1l2l000)Ym∗l(ˆr1)
where
Ym∗(ˆr)=(−1)mY−ml(ˆr)
and
(l1l2lm1m2m)=(−1)l1−l2−m(2l+1)1/2(l1l2m1m2|l,−m)
is a 3-$j$ symbol. In particular, write
Ym′1l′1(ˆr1)∗Yq1k1(ˆr1)Ym1l1(ˆr1)⏟=∑ΛM(…)YMΛ(ˆr1) ∑λ1μ1Yμ1λ1(ˆr1)Ym′2l′2(ˆr2)Yq2k2(ˆr2)Ym2l2(ˆr2)⏟=∑Λ′M′(…)YM′∗Λ′(ˆr2) ∑λ2μ2Yμ2λ2(ˆr2)
The angular integral then gives a factor of $2 \pi \delta_{\Lambda, \Lambda^{\prime}} \delta_{M, M^{\prime}} P_{\Lambda} \left( \cos \theta_{12} \right)$. The total integral therefore reduces to the form
I=∑ΛCΛIΛ(R1R2)
where $C_{\Lambda} = \sum_{\lambda_{1}, \lambda_{2}} C_{ \lambda_{1}, \lambda_{2}, \Lambda}$. For further details and derivations, including graphical representations, see G.W.F. Drake, Phys. Rev. A 18, 820 (1978).
Graphical Representation
Matrix Elements of H
Recall that
H=−12∇21−12∇22−1r1−1r2+Z−1r12
Consider matrix elements of
∇21=1r21∂∂r1(r21∂∂r1)+1r2∂∂r(r2∂∂r)−(→lY1)2r21+2(r1−r2cosθ)r∂2∂r1∂r−2(∇Y1⋅r2)1r∂∂r
where $r \equiv r_{12}$ and $\cos\theta \equiv \cos\theta_{12}$. Also in what follows, define $\hat{\nabla}^Y_1 \equiv r_1\nabla^Y_1$ where $\nabla^Y$ operates only on the spherical harmonic part of the wave function, and similarly for $\vec{l}_1^Y$.
A general matrix element is
⟨ra′1rb′2rc′12e−α′r1−β′r2YM′l′1l′2L′(ˆr1,ˆr2)|∇21|ra1rb2rc12e−αr1−βr2YMl1l2L(ˆr1,ˆr2)⟩
Since $\nabla^2_1$ is rotationally invariant, this vanishes unless $L=L^\prime$, $M=M^\prime$. Also $\nabla_1^2$ is Hermitian, so that the result must be the same whether it operates to the right or left, even though the results look very different. In fact, requiring the results to be the same yields some interesting and useful integral identities as follows:
Put
F=FYMl1l2L(ˆr1,^r2)
and
F=ra1rb2rc12e−αr1−βr2
Then
∇21F={1r21[a(a+1)−l1(l1+1)]+c(c+1)r2+α22α(a+1)r1+2(r1−r2cosθ)r1r2c[a−αr1]−2cr2(ˆ∇Y1⋅ˆr2)r2r1}F
and
⟨F′|∇21|F⟩=∑Λ∫dτrF′CΛ(1)PΛ(cosθ)×{1r21[a(a+1)−l1(l1+1)]−2α(a+1)r1+c(c+1)r2+α2+2(r2−r2cosθ)r1r2c(a−αr1)}F+∑Λ∫dτrF′CΛ(ˆ∇Y1⋅ˆr2)PΛ(cosθ)(−2cr2r1r2)F
where
∫dτr=∫∞0r1dr1∫∞0r2dr2∫r1+r2|r1−r2|rdr
For brevity, let the sum over $\Lambda$ and the radial integrations be understood, and let $\left( \nabla_1^2 \right)$ stand for the terms which appear in the integrand. Then operating to the right gives
(∇21)R=1r21[a(a+1)−l1(l1+1)]+c(c+1)r2+α2−2α(a+1)r1+2(r1−r2cosθ)r1r2c(a−αr1)−2cr2r1r2(ˆ∇Y1⋅ˆr2)
Operating to the left gives
(∇21)L=1r21[a′(a′+1)−l′1(l′1+1)]+c′(c′+1)r2+α′2−2α′(a′+1)r1+2(r1−r2cosθ)r1r2c′(a′−α′r1)−2c′r2r1r2(ˆ∇Y′1⋅ˆr2)
Now put
a+=a+a′,ˆ∇+1=ˆ∇Y1+ˆ∇Y′1a−=a−a′,ˆ∇−1=ˆ∇Y1−ˆ∇Y′1
etc., and substitute $a^\prime = a_+ - a$, $c^\prime = c_+ - c$, and $\alpha^\prime = \alpha_+ - \alpha$ in $(\nabla^2_1)_L$. If $a_+$, $c_+$ and $\alpha_+$ are held fixed, then the equation
(∇21)R=(∇21)L
must be true for arbitrary $a$, $c$, and $\alpha$. Their coefficients must thus vanish.
This yields the integral relations
(r1−r2cosθ)r1r2=1c+(−(a++1)r21+α+r1)
from the coefficient of $a$, and
(r1−r2cosθ)(a+−α+r1)r1r2=r2r1r2(ˆr2⋅ˆ∇+1)−(c++1)r2
from the coefficient of $c$. The coefficient of $\alpha$ gives an equation equivalent to $(I)$.
Furthermore, if can be show that (see problem)
∑Λ∫dτrrcr2CΛ(ˆr2⋅ˆ∇Y1)PΛ(cosθ)=∑Λ∫dτrrccr1r2CΛ(1)PΛ(cosθ)×(l′1(l′1+1)−l1(l1+1)−Λ(Λ+1)2)
and similarly for $\left( \hat{r}_2 \cdot \hat{\nabla}^{Y\prime}_1 \right)$ with $l_1$ and $l_1^\prime$ interchangeable, then it follows that
rc+r2(ˆr2⋅ˆ∇+1)=−rc+c+r1r2Λ(Λ+1)
and
rc+r2(ˆr2⋅ˆ∇−1)=rc+c+r1r2[l′1(l′1+1)−l1(l1+1)]
where equality applies after integration and summation over $\Lambda$.
Thus $(II)$ becomes
(r1−r2cosθ)(a+−α+r1)r1r2=−Λ(Λ+1)c+r21−(c++1)r2
Problem
Prove the integral relation
∑Λ∫dτrf(r1,r2)1r(ddrq(r))CΛ(ˆr2⋅ˆ∇Y1)PΛ(cosθ)=∑Λ∫dτrf(r1,r2)r1,r2q(r)CΛ(1)PΛ(cosθ)(l′1(l′1+1)−l1(l1+1)−Λ(Λ+1)2)
where $C_\Lambda \left( 1 \right)$ are the angular coefficients from the overlap integral
∫dΩYM∗l′1l′2L(ˆr1,ˆr2)YMl1l2L(ˆr1,ˆr2)=∑ΛCΛ(1)PΛ(cosθ12).
Hint: Use the fact that l21 is Hermitian so that ∫dτ(l21Y′)∗q(r)Y=∫dτY′∗l21(q(r)Y) with →l1=1i→r1×∇1.
It is also useful to use (cos2θ−1)PL(cosθ)=L(L−1)(2L−1)(2L+1)PL−2(cosθ)−2(L2+L−1)(2L−1)(2L+3)PL(cosθ)+(L+1)(L+2)(2L+1)(2L+3)PL+2(cosθ)=L(L−1)(2L−1)(2L+1)[PL−2(cosθ)−PL(cosθ)]+(L+1)(L+2)(2L+1)(2L+3)[PL+2(cosθ)−PL(cosθ)]
together with a double application of the integral recursion relation IL+1(1rddrq(r))=(2L+1)IL(1r1r2q(r))+IL−1(1rddrq(r))
Of course l21YMl1l2L(ˆr1,ˆr2)=l1(l1+1)YMl1l2L(ˆr1,ˆr2).
General Hermitian Property
Each combination of terms of the form <f>R=a2f1af2+abf3+b2f4+bf5+af6∇y1+bf7∇y2+f8(y) can be rewritten <f>L=(a+−a)2f1+(a+−a)f2+(a+−a)(b+−b)f3+(b+−b)2f4+(b+−b)f5+(a+−a)f6∇y′1+(b+−b)f7∇y′2+f8)y′
Since these must be equal for arbitrary a and b, a2+f1+a+f2++b+f3+b+f4+b+f5+a+f6∇y′1+b+f7∇y′2+f8(y′)−f8(y)=0
Adding the corresponding expression with y and $y^\prime$ interchanged yields a2+f1+a+f2+a+b+f3+b2+f4+b+f5+12a+f6∇+1+12b+f7∇+2=0
Subtracting gives f8(y)−f8(y′)=−12[a+f6∇−1+b+f7∇−2] a[−2a+f1−2f2−b+f3−f6=∇+1]=0 b[−2b+f4−2f5−a+f3−f7∇+2]=0
Adding the two forms gives
<f>R+<f>L=12(a2++a2−)f1+a+f2+12(a+b++a−b−)f3 +12(b2++b2−)f4+b+f5+12f6(a+∇+1+a−∇−1) +12(b+∇+2+b−∇−2)+f8(y)+f8(y′)
Subtracting x2×(I) gives
<f>R+<f>L=12[(1−x)a2++a2−]f1+(1−x2)a+f2+12[(1−x)a+b++a−b−]f3+12[(1−x)b2++b2−]f4+(1−x2)f5b++12f6[(1−x2)a+∇+1+a−∇−1]+12[(1−x2)b+∇+2+b−∇−2]+f8(y)+f8(y′)
If x=1, <f>R+<f>L=12[a2−f1+a+f2+a−b−f3+b2−f4+b+f5+(12a+∇+1+a−∇−1)f6+(12b+∇+2+b−∇−2)f7]+f8(y)+f8(y′)
The General Hermitian Property for arbitrary x gives ∇21=14{1r21[(1−x)a2++a2−+2(1−x2)a+−2[l1(l1+1)+l′1(l′1+1)]−2r1[(1−x)α+a++α−a−+2(1−x2)α+]+(1−x)α2++α2−]+2(r1−r2cosθ)r1r2[(1−x)(a+−α+r1)c++(a−−α−r2)c−]−2r2r1r2[(1−x2)c+ˆr2⋅ˆ∇+1+c−ˆr2ˆ∇−1]+(1−x)c2++c2−+2(1−x2)c+r2}.
Use −2r2r1r2[(1−x2)c+ˆr2ˆ∇+1+c−ˆr2ˆ∇−1]=2r21[(1−x2)Λ(Λ+1)−c−c+[l′1(l′1+1)+l1(l1+1)]],
2(r1−r2cosθ)r1r2(a−−α−r1)c−=2c−c+[−1r21[a−(a++1)]+1r1[a−α++α−(a++2)]−α−α+],
and 2(r1−r2cosθ)r1r2(a+−α+r1)c+=2[−1r21[a+(a++1)]+1r1[a+α++α+(a++2)]−α2+].
Substituting into ∇21 gives
∇21=14{1r21[−(1−x)a2++a2−+xa++2(1−x2)Λ(Λ+1)−2l1(l1+1)(1−c−c+)−2l′1(l′1+1)(1+c−c+)−2c−a−c+(a++1)]−2r1[−(1−x)α+(a++2)+α−a−+2(1−x2)α+−c−c+[a−α++α−(a++2)]]−(1−x)α2++α2−−2c−c+α−α++1r2[(1−x)c2++c2−+2(1−x2)c+]}
This has the form ∇21=14[A1r21+B1r1+C1r2+D1] with
A1=−(1−x)a2++a2−+xa++2(1−x2)Λ(Λ+1)−2l1(l1+1)(1−c−c+) −2l′1(l′1+1)(1+c−c+)−2c−a−c+(a++1)
B1=2[(1−x)α+(a++2)−α−a−−2(1−x2)α++c−c+[a−α++α(a++2)]]
C1=(1−x)c2++c2−+2(1−x2)c+
D1=−1(1−x)α2++α2−−2c−c+α−α+
The complete Hamiltonian is then (in Z-scaled a.u.)
H=−12∇21−12∇22−1r1−1r2+Z−1r=−12∇21−12∇22−1r1−Z−1Zr2+Z−1(1r−1r2) =−18[A1r21+B1+8r1+C1r2+D1+D2+A2r22+B2+8(Z−1)/Zr2+C2r2]+Z−1(1r−1r2)
The screened hydrogenic energy is ESH=−12[1n21+(Z−1Z)21n22] so that
H−ESH=−18[A1r21+B1+8r1+C1r2+D1+D2−4n21−(Z−1Z)24n22+A2r2+B2+8(Z−1)/Zr2+C2r2]+Z−1(1r−1r2)
Optimization of Nonlinear Parameters
- The traditional method of performing Hylleraas calculations is to write the basis set in the form
Ψ=∑i,j,kci,j,kφi,j,k(α,β)±exchange
with
φi,j,k(α,β)=ri1rj2rk12e−αr1−βr2YMl1l2L.
The usual procedure is to set α=Z so that it represents the inner 1s electron, and then to vary β so as to minimize the energy.
Since β appears in Ψ as a non-linear parameter, the entire calculation must be repeated for each value of β. However, the minimum becomes progressively smaller as the basis set is enlarged.
- Difficulties
1. If the basis set is constructed so that i+j+j≤N, the the number of terms is (N+1)(N+2)(N+3)/6 N=14 already gives 680 terms and an accuracy of about 1 part in 1010 for low-lying states. A substantial improvement in accuracy would require much larger basis sets, together with multiple precision arithmetic to avoid loss of significant figures when high powers are included.
2. The accuracy rapidly deteriorates as one goes to more highly excited states - about 1 significant figure is lost each time the principle quantum number is increased.
- Cure
We have found that writing basis sets in the form Ψ=∑i,j,k(c(1)i,j,kφijk(α1β1)+c(2)ijkφijk(α2β2))±exchange=ψ(r1,r2)±ψ(r2,r1) so that each combination of powers is included twice with different nonlinear parameters gives a dramatic improvement in accuracy for basis sets of about the same total size.
However, the optimization of the nonlinear parameters is now much more difficult, and an automated procedure is needed.
If E=<Ψ|H|Ψ><Ψ|Ψ> and we assume <Ψ|Ψ>=1, then
∂E∂αt=−2<Ψ|H−E|r1ψ(r1,r2;αt)±r21ψ(r2,r1;αt)> where ψ(r1,r2;αt)=∑i,j,kc(t)ijkφijk(αtβt)
- Problem
1. Prove the above.
2. Prove that there is no contribution from the implicit dependence of c(t)ijk on αt if the linear parameters have been optimized.
The Screened Hydrogenic Term
If the Hamiltonian is written in the form (in Z-scaled a.u.) H=H0(r1,Z)+H0(r2,Z−1)+Z−1(1r−1r2) with H0(r1,Z)=−12∇21−1r1 and H0(r2,Z−1)=−12∇22−Z−1Zr2, then the eigenvectors of H0(r,Z)+H0(r2,Z−1) are products of hydrogenic orbitals Ψ0=ψ0(1s,Z)ψ0(nl,Z−1) and the eigenvalue is ESH=[−12−(Z−1Z)212n2]Z2a.u. called the screened hydrogenic eigenvalue. From highly excited states, ESH and Ψ0are already excellent approximations.
For example, for the 1s8d states, the energies are
E(1s8d1D)=−2.00781651256381a.u.E(1s8d3D)=−2.00781793471171a.u.ESH=−2.0078125
It is therefore advantageous to include the screened hydrogenic terms in the basis set so that the complete trial function becomes Ψ=c0Ψ0+∑ijk[c(1)ijkφijk(α1,β1)+c(2)ijkφ(α2,β2)]±exchange
Also, the variational principal can be re-expressed in the form E=ESH+<Ψ|H−ESH|Ψ><Ψ|Ψ> so that the ESH term can be cancelled analytically from the matrix elements.
For example <Ψ0|H−ESH|Ψ0>=0
and <φijk|H−ESH|Ψ0>=Z−1<φijk|1r−1r2|Ψ0>
Recall that I0(a,b,c)=2c+2[c+1]/2∑i=0(c+22i+1)[f(p,q;β)+f(p′,q′;α)] where
f(p,q;x)=q!xq+1(α+β)p+1p∑j=0(p+j)!j!(xα+β)j
p=a+2i+2p′=b+2i+2q=b+c−2i+2q′=a+c−2i+2
If β≪α, the f(p,q;β)≫f(p′,q′;α) and the i=0 term is the dominant contribution to f(p,q;β). However, since this term depends only on the sum of powers b+c for r2, it cancels exactly from the matrix element of \begin{eqnarray}\frac{1}{r_{12} - \frac{1}{r_2}\end{eqnarray} and can therefore be omitted in the calculations of integrands, there by saving many significant figures. This is especially valuable when b+c is large and a is small.
Small Corrections
- Mass Polarization
If the nuclear mass is not taken to be infinite, then the Hamiltonian is H=P2N2M+∑i=1p2i2m+V
where M=MA−nm is the nuclear mass and m is the electron mass.
Change variables to the C of M R=1M+nm[MrN+m(r1+r2+…)]
and relative variables
si=ri−rN.
Then
H=12(M+nm)p2R+12mn∑i=1p2si+12M∑i,kpsi⋅psk+V(s1,s2,…)=1M+nmp2R+12μn∑i=1p2si+12M∑i≠kpsi⋅psk+V(s1,s2,…)
where μ=(1M+1m)−1
If n=2, the Schroedinger equation is [−ℏ22μ(∇2s1+∇2s2)−ℏ2M∇s1∇s2−Ze2s1−Ze2s2+e2s12]ψ=Eψ.
Define the reduced mass Bohr radius aμ=hbar2μe2=mμa0 and
ρi=Zsiaμε=EZ2(e2/aμ)
The Schroedinger equation then becomes [−12(∇2ρ1+∇2ρ2)−μM→∇ρ1⋅→∇ρ2−1ρ1−1ρ2+Z−1ρ12]ψ=εψ
The effects of finite meass are thus
1. A reduced mass correction to all energy levels of (e2/aμ)/(e2/a0)=μ/m, ie. EM=μmE∞
2. A specific mass shift given in first order perturbation theory by Δε=−μM<ψ|∇ρ1⋅∇ρ2|ψ> or
Δε=−μM<ψ|∇ρ1⋅∇ρ2|ψ>e2aμ=−μ2mM<ψ|∇ρ1⋅∇ρ2|ψ>e2a0
Since ∇ρ1⋅∇ρ2 has the same angular properties as ρ1⋅ρ2, the operator is like the product of two dipole operators. For product type wave functions of the form ψ=ψ(1s)ψ(nl)±exchange the matrix element vanishes for all but p-states.