Hückel Molecular Orbital Theory

In the chapter on cyanine dyes, you learned several important concepts.  You saw that a simple model, the particle in a box, could be used to understand the spectra of conjugated molecules.  The model clearly showed the relationship of chain length and absorption wavelength and let you compute longest absorption wavelength by using the energy difference between the highest energy occupied molecular orbital, the HOMO, and the lowest unoccupied molecular orbital, the LUMO.  You also learned that certain transitions were allowed and others not allowed and were able to predict these by using symmetry arguments and the transition moment integral.

In this chapter we will explore another model. This new model is called the Hückel model or Hückel Molecular Orbital (HMO) theory.  Through HMO theory we will extend our understanding of molecular orbital methods.  The HMO theory is an excellent vehicle for this extension as it employs some of the mathematical concepts and techniques that are used in current high-level ab initio molecular orbital calculations, which are considered in the next chapter.

We will use linear combination of atomic orbitals (the LCAO method) to form each molecular orbital.  We will create and solve the secular equation for computing the energies of the molecular orbitals.  Matrix methods, like those employed for vibrational normal mode analysis, will be used to solve the secular equation for large molecular systems.  Charge densities for each atom in the molecule and bond orders for bonded pairs of atoms will be computed.  Also, molecular orbitals will be used for computing chemically interesting properties such as the site of electrophilic or nucleophilic attack in a reaction.

 

 


12.1 Application to Conjugated Molecules

Conjugated double bonds are double bonds that alternate with single bonds along a molecular chain or in a molecular ring.  Conjugated molecules are those that contain this pattern of alternating double and single bonds.  These molecules have equivalent resonance structures and interesting chemical properties.  Benzene is the prime example.  Conjugated compounds were the first class of polyatomic molecules studied by molecular orbital theory because of their extended pi system of electrons.  Pi electrons are those electrons found in pi orbitals.  Pi orbitals are composed of pz atomic orbitals that are perpendicular to a plane of symmetry in the molecule.  Pi orbitals change sign upon reflection in this plane, whereas sigma orbitals do not.

Several conjugated molecules are shown in Figure 12.1.1.  Notice that they include both non-cyclic and cyclic molecules and heteroatoms such as nitrogen and oxygen.  Hetero means different from carbon and hydrogen.  One of the most important members of this group of molecules is benzene, which is given a special place in organic chemistry because of its stability.

Figure 12.1.1. Sample conjugated molecules.

We can address a number of questions pertaining to benzene and other conjugated molecules using Hückel molecular orbital theory.  What is the cause of benzene's stability?  Can we quantify this stability?  Since all the bonds in benzene have the same length, can we account for the fact that the bonds in benzene are characterized as intermediate between single and double bonds?  Can we write molecular orbitals for benzene and systematically explain some of its chemistry using a quantitative molecular orbital model?  Can we explain the chemical reactivity and spectroscopic properties of other diverse conjugated molecules with the same model?

Although the Hückel model is simple, is it sufficiently robust to predict trends and produce insights that are not possible through the very simple particle-in-a-box model that was used to describe the cyanine dyes.  We will explore these questions in the following sections of this chapter and along the way introduce some of the mathematical tools needed in this molecular orbital approach and the molecular orbital methods used by scientists today.  Just to keep things in perspective, remember that Hückel theory was developed at the time when computers were not available to chemists, and a simple mathematical approach was a very important development.  Although HMO theory includes some drastic assumptions, it enabled the early calculations to be done with mechanical calculators or by hand.

 

12.2. Describing the Sigma and P Electrons Separately.

One of the important ideas used over and over again in Quantum Mechanics is the separation of variables.  In the case of conjugated molecules, we have a chemical system where the separation of variables naturally falls out of our studies in organic chemistry.  When one considers the pi and sigma electrons in a molecule like ethylene or 1,3-butadiene, it is easy to organize the electrons into sigma electrons and pi electrons.  This separation follows from the molecular symmetry and the fact that hybrid atomic orbitals are used to describe bonds in molecules.  The orbitals that form the sigma bonds are the sp2-hybrid orbitals on carbon and the s-orbital on hydrogen as shown in Figure 12.2.1.  The 2pz orbital on carbon is perpendicular to the plane of these three hybrid orbitals and forms a pi bond through sideways overlap with a pz-orbital on a neighboring atom.  This picture of bonding is very successful when carbon is bonded to three other atoms because it provides three orbitals that are highly directional.

 

Figure 12.2.1.  Diagram of sp2 hybrid orbitals.

 

In quantum chemistry, this picture of hybridization and sigma/pi bonding is translated into a mathematical statement about the molecular orbitals in a molecule.  The total wavefunction of a molecule is written as a product of a sigma part and a pi part. 

                                                                               12.2.1
where  is the wavefunction describing the electrons in sigma orbitals and  is the wavefunction describing the electrons in pi orbitalsSuch a product function would be an eigenfunction of the molecular Hamiltonian if the sigma and pi electrons did not interact.

The wavefunction for the pi electrons then is described as a product of all the pi molecular orbitals.

                                                                 12.2.2
Each , with j = 1…N, represents a molecular orbital, i.e. a wavefunction for one electron moving in the electrostatic field of the nuclei and the other electrons.  Two electrons with different spin are placed in each molecular orbital so that the number of occupied molecular orbitals N is half the number of electrons, n, i.e. N = n/2.

Each molecular orbital, , is written as a linear combination of atomic orbitals (LCAO).

            with  ,                                            12.2.3
where  is the 2pz atomic orbital on atom r of the conjugated pi system.  The number of molecular orbitals that one obtains by this procedure is equal to the number of atomic orbitals.  Consequently, the indices j and r both run from 1 to N.  The cjr are the weighting coefficients for the atomic orbitals in the molecular orbital.  These coefficients are not necessarily equal, or in other words, the orbital on each carbon atom is not used to the same extent to form each molecular orbital.  Different values for the coefficients give rise to different net charges at different positions in a conjugated molecule.  This charge distribution is very important when discussing spectroscopy and chemical reactivity.

The energy of the jth molecular orbital is given by a one-electron Schrödinger equation using an effective one electron Hamiltonian, heff, which expresses the interaction of an electron with the rest of the molecule.

                                                                               12.2.4
 is the energy eigenvalue of the jth molecular orbital, corresponding to the eigenfunction .  The beauty of this method, as we will see later, is that the exact form of heff is not needed.  The total pi energy of the molecule is the sum of the single electron energies.

                                                                               12.2.5
where nj is the number of electrons in orbital j.

The expectation value expression for the energy for each molecular orbital is used to find εj and then Eπ.

          .                                        12.2.6
The notation , which is called a bra-ket, just simplifies writing the expression for the integral.  Note that the complex conjugate now is identified by the left-side position and the bra notation  and not by an explicit *.

After substituting Equation 12.2.3 into 12.2.6, we obtain for each molecular orbital

                                                           12.2.7
which can be rewritten as

                                                             12.2.8
where the index j for the molecular orbital has been dropped because this equation applies to any of the molecular orbitals.

Exercise 12.2.1.  Consider a molecular orbital made up of three atomic orbitals, e.g. the three carbon 2pz orbitals of the allyl radical, where the internuclear axes lie in the xy-plane. Write the LCAO for this MO.  Derive the full expression, starting with Equation 12.2.6 and writing each term explicitly, for the energy expectation value for this LCAO in terms of heff.  Compare your result with Equation 12.2.8 to verify that Equation 12.2.8 is the general representation of your result.

 

Exercise 12.2.2.  Write a paragraph describing how the Variational Method could be used to find values for the coefficients cjr in the linear combination of atomic orbitals.

 

To simplify the notation we use the following definitions.  The integrals in the denominator of Equation 12.2.8 represent the overlap between two atomic orbitals used in the linear combination.  The overlap integral is written as Srs.  The integrals in the numerator of Equation 12.2.8 are called either resonance integrals or coulomb integrals depending on the atomic orbitals on either side of the operator heff as described below.

 is the overlap integral.  because we use normalized atomic orbitals.  For atomic orbitals r and s on different atoms,  has some value between 1 and 0: the further apart the two atoms, the smaller the value of .

 is the Coulomb Integral.  It is the kinetic and potential energy of an electron in, or described by, an atomic orbital, r, experiencing the electrostatic interactions with all the other electrons and all the positive nuclei.

 is the Resonance Integral or Bond Integral.  This integral gives the energy of an electron in the region of space where the functions r and s overlap.  This energy sometimes is referred to as the energy of the overlap charge.  If r and s are on adjacent bonded atoms, this integral has a finite value.  If the atoms are not adjacent, the value is smaller, and assumed to be zero in the Hückel model.

In terms of this notation, Equation 12.2.8 can be written as

                                                          .                                                12.2.9

We now must find the coefficients, the c's.  One must have a criterion for finding the coefficients.  The criterion used is the Variational Principle.  Since the energy depends linearly on the coefficients in Equation 12.2.9, the method we use to find the best set of coefficients is called the Linear Variational Method.   This method is similar to the nonlinear variational method described in Chapter 10 for optimizing the parameters in a wavefunction.

 

12.3. The Linear Variational Method.

The Variational Principle tells us that the energy we compute for any trial wavefunction used in equation 12.2.6 will be greater than the exact energy for that Hamiltonian.  In more mathematical terms, any energy we compute is an upper bound to the true energy for the molecular system.  We write this statement as

                                                                                                                          12.3.1

If the trial wavefunction we use has adjustable parameters, then we can vary the parameters in a systematic way to find their optimum values.  The optimum values give the lowest energy for the trial function.  Initial guess values of the parameters will always give energies that are greater than the best energy that one could obtain from a given function unless you are extremely lucky and happen to guess the optimum values.  This problem is equivalent to finding the minimum in a potential energy function or any other mathematical function.  To accomplish this minimization, rather than making trial guesses, it is more efficient to use the calculus approach of taking the derivative of the energy function with respect to each parameter and setting each derivative equal to zero.  If you have two parameters, you then will have two simultaneous equations to solve.  If you have N parameters you will have N simultaneous equations.

The task is to minimize the energy with respect to all the coefficients by solving the N simultaneous equations produced by differentiating Equation 12.2.9 with respect to each coefficient.
                     for  t = 1, 2, 3, … N                                            12.3.2
Actually we also should differentiate Equation 12.2.9 with respect to the , but this second set of N equations is just the complex conjugate of the first and produces no new information or constants.

To carry out this task, rewrite Equation 12.2.9 to obtain Equation 12.3.3 and then take the derivative of Equation 12.3.3 with respect to each of the coefficients.

                                                          12.3.3
Actually we don't want to do this differentiation N times, so consider the general case where the coefficient is ct.  Here t represents any number between 1 and N.

This differentiation is relatively easy, and the result, which is shown by Equation 12.3.4, is relatively simple because some terms in Equation 12.3.3 don't involve ct and others depend linearly on ct.  The derivative of the terms that don't involve ct is zero (e.g.  ).  The derivative of terms that contain ct is just the constant factor that multiples the ct, (e.g.  ).  Consequently, only terms in Equation 12.3.3 that contain ct contribute to the result, and whenever a term contains ct, that term appears in Equation 12.3.4 without the ct because we are differentiating with respect to ct.  The result after differentiating is

                                                        .                                              12.3.4

If we take the complex conjugate of both sides, we obtain                     
                   .                                                          12.3.5
Since ε = ε*, , and , this equation can be reversed and written as

                                                                       12.3.6
or upon rearranging as

          .                                                           12.3.7
There are N simultaneous equations that look like this general one; N is the number of coefficients in the LCAO.  Each equation is obtained by differentiating Equation 12.3.3 with respect to one of the coefficients.

Exercise 12.3.1.  Explain why the energy ε = ε*, show that  (write out the integral expressions and take the complex conjugate of Srt), and show that  (write out the integral expressions, take the complex conjugate of Hrt, and use the Hermitian property of quantum mechanical operators).