Vibrational spectroscopy by means of first-principles molecular dynamics simulations
Funding information: University of Zurich
Abstract
Vibrational spectroscopy is one of the most important experimental techniques for the characterization of molecules and materials. Spectroscopic signatures retrieved in experiments are not always easy to explain in terms of the structure and dynamics of the studied samples. Computational studies are a crucial tool for helping to understand and predict experimental results. Molecular dynamics simulations have emerged as an attractive method for the simulation of vibrational spectra because they explicitly treat the vibrational motion present in the compound under study, in particular in large and condensed systems, subject to complex intramolecular and intermolecular interactions. In this context, first-principles molecular dynamics (FPMD) has been proven to provide an accurate realistic description of many compounds. This review article summarizes the field of vibrational spectroscopy by means of FPDM and highlights recent advances made such as the simulation of Infrared, vibrational circular dichroism, Raman, Raman optical activity, sum frequency generation, and nonlinear spectroscopies.
This article is categorized under:
- Electronic Structure Theory > Ab Initio Electronic Structure Methods
- Theoretical and Physical Chemistry > Spectroscopy
- Molecular and Statistical Mechanics > Molecular Mechanics
- Electronic Structure Theory > Density Functional Theory
Graphical Abstract
1 INTRODUCTION
Spectroscopy, in general terms, is the study of the interaction of light with matter. This includes absorption and scattering effects observed when a light source is pointed at a sample consisting of a gas, liquid, solid, and mixtures thereof. Such experiments can take place in the whole range of the electromagnetic spectrum, ranging from low-frequency radio and microwave frequencies for nuclear magnetic resonance (NMR) and electron paramagnetic resonance (EPR) experiments, over the infrared region for vibrational spectra, the ultraviolet-visible (UV/VIS) range for electronic excitations, and the XUV/X-ray region for ionization experiments.
From a theoretical point of view, there are two main ways of simulating vibrational spectroscopic experiments: the static approach and the dynamic approach. In the static approach, the nuclear positions are first optimized at a fictitious temperature of 0 K. Then the coordinates are either displaced according to a finite differences scheme or perturbatively, while usually only the electron density response to the radiation is considered for the calculation of properties. Utilizing the harmonic approximation and computing the second derivative of the electronic energy with respect to nuclear positions (the Hessian) around a local minimum nuclear structure, the vibrational spectrum results from derivatives of the system's properties along the vibrational normal modes. In this picture, the eigenvectors of the mass-weighted Hessian matrix are the vibrational normal modes, while the corresponding eigenvalues give access to vibrational frequencies {ωi}. A static calculation, where the nuclear positions are fixed, is usually an appropriate approach for many kinds of spectroscopy, such as X-ray and NMR. However, it can be advantageous to sample the vicinity of the equilibrium structure and then take the averages of the properties of interest retrieved from multiple calculations,1 especially when simulating spectra of nonrigid and large molecules. Generally, for vibrational spectra and spectra probing reaction dynamics, the dynamic approach is superior due to being able to sample the potential energy surface (PES) at finite temperatures. In this approach, the nuclear positions are propagated according to classical or quantum dynamics.
Nuclear quantum effects, can have a large influence on computed spectra depending on the studied system, and have been used to achieved chemical accuracy for small and rigid molecules.2-4 Approaches including nuclear quantum dynamics incur a high computational cost and are thus limited to small system sizes.5 Nuclear quantum effects, which can also be retrieved from path integral molecular dynamics,6 are in general most significant at low temperatures and in the presence of light atoms, such as hydrogen.
The first classical MD simulations, based on empirical force fields, have kicked off the field of (IR) spectra simulations via molecular dynamics.7-10 Since then, more elaborate schemes have appeared to improve the force fields based on chemical intuition and available computational resources. Advances were not only made in force field development but also in methodology. For example, to extract the time-dependent Hamiltonian operator from classical MD trajectories to calculate IR spectra.11
A higher accuracy compared to classical MD can usually be expected from first-principles molecular dynamics (FPMD). Vibrational spectroscopy via FPMD simulations is the focus of this review. We restrict this article to the discussion of vibrational spectra and do not consider rovibrational transitions. We cover the basic theory behind the constituents of molecular dynamics simulations, including the general ideas in the Born–Oppenheimer (BO) and Car–Parrinello approaches. For detailed derivations and explanations on the peculiarities of implementations we refer the reader to the literature.12, 13 We will focus on the advancements and applications of first principles-, especially density functional theory (DFT)-based, MD simulations for vibrational spectra, that were achieved in recent years by leveraging efficient implementations and ever-growing computational power of supercomputers around the world.
The great advantage of FPMD simulations compared to static calculations—where usually no sampling of the PES is carried out and quantum dynamics are neglected—is the possibility of relating spectroscopic signals with the evolution of the microscopic structure of the sample, including rearrangements and the natural inclusion of certain anharmonic effects. Additionally, complex interactions in floppy molecules and condensed phase systems, such as hydrogen-bonding networks, isomerizations, and solvent interactions are directly observable in MD simulations. Thus, especially for large systems, FPMD is the preferred method for the simulation of vibrational spectra. A frequently studied and yet not fully understood system is, for instance, liquid water.14 The complex hydrogen bonding network results in spectroscopic bands that can only be understood by combining experiments with (FP)MD simulations. Chemical reactions and other dynamic processes can also only be understood experimentally and theoretically in a time-dependent FPMD description of the model system. Such theoretical descriptions might also be combined with the solution of the (time-dependent) electronic structure problem via, for example, time-dependent DFT15 or density functional perturbation theory (DFPT)16 for properties needed to calculate specific vibrational spectra.
Our review is divided into two parts. First, we explain the theoretical framework of FPMD, within the BO approximation. Then we shortly explain how the use of time-correlation functions allows for calculating spectra from MD trajectories. In the second part of the review, we highlight applications of FPMD for infrared absorption (IR), vibrational circular dichroism (VCD), Raman, Raman optical activity (ROA), sum-frequency generation (SFG), and nonequilibrium spectroscopy. Finally, we present the conclusion and an outlook highlighting possible improvements and novel techniques that might be applied for vibrational spectroscopy via FPMD in the future.
2 THEORY
2.1 Constituents of first-principles molecular dynamics
2.1.1 General
MD simulations are used to simulate the time-resolved dynamics of molecular, liquid, and solid-state systems at finite temperatures. Time discretization into fixed steps δt is used to propagate the positions and velocities of the atoms in the system to integrate the (classical) equations of motion. The propagation can be governed solely by quantum chemical (QC) calculations of the PES, by a mix of QC and classical physics in semi-classical approaches, or purely by classical MD employing parameterized force fields. For the latter two, the leap-frog or velocity Verlet algorithm17 is used in standard calculations. In this case, at each (half) time step, either the positions or the forces and velocities are updated according to the chosen level of theory.18
For simulations of large systems and time scales, it is advantageous to use force fields fitted to experiments or data acquired from first principles. In this case, the forces are determined by considering bond lengths, bond angles and dihedral angles among other classifiers.19-26 Such force fields can also include information on bond cleavage and formation to simulate reactive processes,27, 28 although the quality of the description of such processes depends on the fitting procedure and training data and is not generally transferable to other systems. Alternatively, the forces acting on the nuclei can be determined on the fly from QC calculations. QC methods are then employed to calculate the gradient of the PES to retrieve accurate forces based on the chosen level of theory. The quality of the forces is the most important factor determining the quality of the physical quantities that are extracted from the MD simulation.29 While ideally, high-level QC methods would be employed, their computational cost—especially for large and condensed systems—makes their application difficult. The electronic structure method employed most often and very successfully is DFT in combination with BO-MD30 or Car–Parrinello molecular dynamics (CPMD).13 While DFT-MD can be regarded as a semi-classical method because of the treatment of the nuclei by Newtonian dynamics, it is free of ad hoc parameterizations compared to force fields that are the result of an empirical fitting procedure.29 Still, the choice of exchange-correlation functional and basis set can, among others, significantly impact the obtained results in Kohn–Sham DFT-MD. Temperature effects are taken into account by coupling to a heat bath via a thermostat. In this way, the temperature of the simulation can be defined and observed during the MD run.31-33 The simulated MD trajectories should, in principle, be as long as possible to ensure sufficient sampling of the phase space. In practice, the calculated spectra are usually monitored to stop the MD run as soon as the spectrum does not change anymore, assuming sufficient sampling. The achieved time scales in MD simulations vary from picoseconds to microseconds depending on the level of theory and number of atoms, where the number of atoms or particles can go up to 1012 when high-performance computing architectures are leveraged.34-37
2.1.2 Born–Oppenheimer approximation
MD calculations are carried out at finite temperatures. A thermostat has to be applied to maintain the ensemble during an MD run. There are multiple choices for thermostats.33, 39, 58-62 Furthermore, one must choose a statistical ensemble according to which to propagate the system in question. In the microcanonical ensemble, the number of particles, the volume, and the energy are kept constant (NVE).63 In the canonical ensemble, instead of the total energy, the temperature is conserved (NVT).63 A special case of the Gibbs canonical ensemble keeps the pressure constant instead of the volume (NPT).64 And in the grand canonical ensemble, which poses the greatest difficulties in numerical simulations, the chemical potential μ is kept constant instead of the particle number (μVT).63 For the simulation of vibrational spectra via FPMD, the NVT ensemble is usually first applied for equilibration, and the spectra are then collected in the NVE ensemble.
2.1.3 Beyond the BO approximation
While the BO approximation holds for many systems, it is in certain cases advantageous to propagate the system not only on the ground state adiabatic PES but also on multiple PESs belonging to different electronic states. The consideration of additional electronic states is crucial in systems where the electronic states are not well separated in energy. In such cases, very fast chemical processes like photodissociation, photoisomerization, and internal conversion can take place.65, 66
A BO treatment of nuclei neglects not only the couplings of energetically close electronic states but also the quantum nature of light atoms, and the vibrational zero-point energy (ZPE).67 Especially for high-frequency motions of groups involving hydrogen atoms, the ZPE might be large compared to the classical vibrational energy at room temperature, leading to frequency shifts of calculated stretching modes compared to experimental results.46 Furthermore, nuclear quantum effects can be included in MD simulations by employing the path-integral approach of Feynman.2, 68-70 We will not go into the details of the treatment of nuclear quantum effects in this review and instead refer the reader to the literature.2, 71
2.2 Analysis of FPMD trajectories for the simulation of vibrational spectra
At the core of MD simulations is the PES, a 3-N-dimensional hypersurface in the configuration space of the investigated system, where N is the number of atoms. The configuration space needs to be sampled such that the statistical ensemble is approximated in order to be able to apply statistical analyses to trajectories resulting from an MD run. From a given trajectory, it is impossible to determine whether all parts of the configuration space are sampled sufficiently.76 In many cases, enhanced sampling techniques like well-tempered metadynamics77 or umbrella sampling78 can be used to mitigate this problem, but they do not apply to vibrational spectroscopy. Although it is impossible to determine whether the time average of a physical observable has converged to the ensemble average, the length of an FPMD run is in any case limited by the available computational resources. Because the employed time step needs to be small enough to resolve vibrational motion, the total run time cannot be extended easily. In the case of vibrational spectroscopy, the change observed in the simulated spectra after each time step can still serve as a signal for the convergence of the simulation, in particular, if no large conformational changes are expected in the simulation. This issue can be further alleviated by taking averages not only of a single MD run but averaging multiple MD runs that have been started from different initial conditions. These initial conditions, in turn, can be found from separate MD runs at the quantum or classical level. In this way probabilistic statements can be made about the simulations so that researchers can provide errors bars and standard deviations when plotting results.21, 79-82
2.2.1 Sampling of the configuration space
The averaging over many points in configuration space is the most important advantage of MD spectroscopy over static calculations. In the harmonic approximation, the PES around the equilibrium geometry is approximated by a harmonic potential. This approximation does, of course, not hold in the limit of dissociation reactions but can also be a source of error in vibrational spectra governed by anharmonic effects.83 Corrections to vibrational modes that are obtained from the Hessian matrix at a local minimum on the PES have been published for different wave function methods, basis sets, and DFT functionals84-86 in the form of empirical linear scaling factors. A nonempirical way of approximating the anharmonicity of vibrational modes and frequencies is the evaluation of higher-order derivatives of the PES by perturbation theory,87 vibrational configuration interaction, and vibrational self-consistent field (VSCF) approaches.88-90 Without perturbation theory, the anharmonic effect due the shape of the PES is accessible in a nonempirical manner via FPMD. Nuclear quantum effects are still not accounted for in this approach91 as are overtones, mode coupling, combination bands, and Fermi resonances,47 reducing the overall accuracy compared to high-level methods based on VSCF. An additional problem of FPMD, and especially DFT-MD, is the difficulty to systematically increase the accuracy of simulations, or to even quantify the error introduced by the choice of functional, basis set, and other approximations. Approaches based on the solution of the vibrational Schrödinger equation, such as the multiconfigurational time-dependent Hartree method,92 do not suffer from this problem, but are not applicable to large systems at all.
In the widely used static approach, where usually only a single geometry on the PES is used for obtaining a vibrational spectrum, IR, and Raman intensities are calculated from the derivatives of the electric dipole moment and electric dipole–electric dipole polarizability, respectively, along the vibrational normal modes. This approach is only tractable for gas-phase molecules or small bulk systems. In the presence of solvent molecules in condensed phase systems, finding minima of the PES is often challenging, or the number of local minima can be vast because of the complex interactions of hydrogen bonds and flexible solvent molecules that can undergo large rearrangements along a shallow PES. In such cases, the calculation of vibrational normal modes becomes increasingly demanding with the size of the system. Additionally, the problem of choosing an appropriate point in configuration space for calculating the Hessian matrix arises. Moreover, using a temperature of 0 K as done in the static approach does not provide a realistic description of the system studied at finite temperature. It is possible to use a Boltzmann distribution to average the contributions of different conformers, corresponding to local minima on the PES, for the calculation of vibrational spectra.93 Geometry optimized snapshots from FPMD trajectories can generally be used to calculate multiple spectra from the Hessian matrix that are averaged afterwards,94 but we limit the discussion in this review to spectra obtained from time-correlation functions.
2.2.2 Time-correlation functions
2.2.3 Mode assignment
The needed molecular properties are usually determined on the fly using an electronic structure method. While the energy contributions, temperature, and nuclear velocities are always accessible during the MD run, the analysis of spectroscopic observables can be carried out either during the FPMD run, using the same electronic structure code for the calculation of properties, or after the trajectory has been generated, by using specialized software108, 109 or other electronic structure codes, taking the nuclear structures of the FPMD run as input for property calculations.
2.2.4 Condensed phase systems
FPMD can provide more accurate results than static calculations, in the gas-phase, by taking into account the effects of the PES shape, temperature, and applied pressure110 on the frequencies, intensities, and associated line shapes.73, 111 The more important application though lies in the investigation of more complex systems. In the case of solvent/solute systems, it is well known that the simulations have to include a sufficient number of solvent molecules. In the QM/MM method, the solvent molecules are usually treated explicitly using a force field, while the solute is treated by a QC method. Alternatively, a solvent continuum model offers a computationally cheap treatment of such systems by including the dielectric constant of the solvent.112 However, this approach usually cannot replicate the crucial dynamics of hydrogen bonding networks of the solvent,113-115 although extensions of continuum models to include such interactions have been published.116
In the treatment of liquids and solid state compounds, periodic boundary conditions can be introduced to avoid artifacts arising from surface effects at the edges of the simulation cell. This poses an additional difficulty because the periodic boundary conditions can introduce unphysical jumps of interatomic distances in the postprocessing of trajectories and the position operator is ill-defined (see Section 3.1 for more details).
3 TYPES OF SPECTROSCOPY
3.1 Infrared absorption
Applications of FPMD for IR spectra range from gas-phase molecules to solid/water interfaces. Marx et al. have studied the THz range of IR spectra focusing on water in bulk124, 125 and simpler environments. Instead of investigating liquid bulk water, it is possible to consider microsolvated systems such as Zundel cations126 or HCl–water aggregates including up to six water molecules127 shedding light onto the influence of individual ions on the IR spectra, while keeping the computational cost low. The protonated methane ion CH5+ is another prototypical system for FPMD since it exhibits a very anharmonic PES compared to the harmonic PES of methane. Studies in gas-phase75, 128 have elucidated the structure and spectroscopic signature thereof. The difficulty of describing the protonated methane ion arises from a flat PES that allows for motions with large amplitudes. In a microsolvated system, the effects of tagging the methane ion with up to three H2 molecules have been elucidated, and individual vibrational modes have been identified.129
It has been shown that it can be beneficial to average trajectories from different initial configurations to increase coverage of the PES further. The (R)-butan-2-ol molecule has nine local minima on the PES, corresponding to different conformations. By averaging nine gas-phase trajectories corresponding to the possible monomer conformations, IR spectra in excellent agreement with experiments are obtained.130 The sampling of the PES, parameterized by two dihedral angles, is reproduced from the study of Kirchner et al. and shown in Figure 1.

In the mentioned study of (R)-butan-2-ol130 a temperature of 400 K has been applied to enhance the sampling, and the study compares results for the bulk and gas-phase, showcasing the influence of intermolecular interactions on IR and VCD spectra. Such an exploration of the conformation space is most important in gas-phase calculations, where no intermolecular energy exchange can mediate conformational changes, which are naturally present in bulk calculations. As a result, an FPMD run in gas-phase will tend only to sample a single conformer.130 The group of Gaigeot studied flexible polypeptides in the gas-phase extensively, computing, among others, IR spectra of a protonated polypeptide. It was shown via FPMD that these peptides do not possess well-defined structures and instead explore the conformational space in contrast to the more rigid butan-2-ol molecule mentioned earlier.131 The spectrum retrieved for the Ala-Ala-H+ ions, obtained by means of a CPMD simulation and from experiment is shown in Figure 2. The simulated spectrum qualitatively matches the experimental one and a mode assignment of the bands was carried out by analyzing the obtained VDOS.131 Other studies focused on the isomerization dynamics of biological systems in gas-phase by applying CPMD.132-134 Further proving a point often made, it has been shown by Gupta et al. that a microsolvated system does not replace a fully periodic treatment in the bulk-phase because even solvation shells far away from the solute can influence the spectroscopic signature in the THz region.135

FPMD has also been applied to study the influence of the solvation shells on THz spectra around small ions136 as well as small molecules such as glycine124, 137, 138 and trimethylamine N-oxide (TMAO) in solution.139 Furthermore, the influence of the pressure exerted on liquid water was studied recently in this context by applying pressure as an external constraint in THz spectra calculations.125 Further, Marx et al. studied the liquid bulk water THz spectrum140 including solvated Na+ and Cl− ions.141 The group of Gaigeot studied flexible polypeptides in solution,142, 143 demonstrating that DFT-MD is an appropriate tool for reproducing and understanding experimental results. A combination of static and dynamical IR spectroscopy144 was also successfully applied to the silica/water interface, characterizing the nature of water in the hydration layer on the surface.145
One often-used way of investigating the origin of spectroscopic signals are localized MOs (e.g., Pipek–Mezey localized functions) in case of nonperiodic systems. The counter part for periodic systems are MLWFs. By localizing the Wannier functions and computing the expectation values of the position operator for each Wannier center, positions and thus also velocities can be assigned to the orbitals. Via the classical connection of the electric dipole moment with positions and charges, local dipole moments can be assigned to each Wannier function.146 Marx et al., for example, decomposed the total electronic contributions to the electric dipole moment of zwitterions into contributions from molecular fragments via MLWFs. In this way, they were able to split the dipole–dipole correlation function into autocorrelation and cross-correlation terms, elucidating which modes are coupled and which modes are independent.127 The same approach has been applied for the mode assignment in the H2 tagged CH5+ ion using MLWFs and was called the maximum entropy method.129
Another way of separating the spectroscopic signals was applied by Luber and is based on periodic subsystem DFT. In this approach, the ill-definition of the position operator in periodic systems can be circumvented by defining subsystems in the total investigated system efficiently using atom-centered basis functions. The electron density is then separated, and a formula for corresponding subsystem electric dipoles was derived. This approach allows for a computationally cheaper treatment while offering the separation of dipole moments and even IR intensities into intra-atomic and inter-atomic contributions as long as the subsystem DFT embedding performs reasonably.146 A recent extension of the formula of local electric dipole moments along this line is based on following the trajectories of Wannier centers during an FPMD run instead of subsystem DFT.147
3.2 Vibrational circular dichroism
The chiral equivalent of IR spectroscopy is VCD148-150 which relies on the different absorption of left and right circularly polarized light by chiral molecules. Its origin lies in the angular momentum of a photon and the interaction thereof with the sample.
In the adiabatic approximation, the ground state electron density is real-valued, whereas the magnetic dipole operator is imaginary and thus vanishes in standard calculations. First-principles methods for retrieving the AATs or magnetic dipoles for the calculation of VCD spectra are the magnetic field perturbation (MFP), which has only been applied in static calculations employing the harmonic approximation150, 153-155 up to now, and only for nonperiodic systems, and nuclear velocity perturbation (NVP) which has also been applied in FPMD simulations.156-160
VCD implementations should take care of the origin dependence due to the definition of the magnetic dipole operator, which results in the definition of a gauge.161 In the common origin gauge, the same origin of the magnetic dipole operator is applied in all response calculations. In the distributed origin gauge, on the other hand, the origin of the magnetic dipole operator is shifted to the nuclear coordinate in each response calculation. In practice, both gauge choices result in identical VCD spectra, and the AATs calculated in one gauge can be translated to the other.155 In the MFP approach, this problem has been solved by utilizing gauge-including atomic orbitals (GIAO), which has resulted in gauge independent rotational strengths in static DFT calculations.155, 162 In the NVP approach, two strategies have been employed to approach this issue.152, 161 MLWFs lead to the definition of local magnetic dipole moments and are related closely to the modern theory of polarization via the definition of the Wannier centers in terms of the Berry phase.152 Jähnigen et al. instead obtained an explicitly gauge-invariant expression for time-correlation VCD spectra by enforcing the nearest image convention on the time-derivative of the electric dipole moment evaluated in the different periodic images of the simulation cell.161
Scherrer et al. implemented the NVP as proposed by Nafie151 in DFT using a plane-wave basis in the CPMD code.160 They subsequently applied the implementation to the FPMD trajectory of a series of model systems, including (S)-d2-oxirane and (R)-propylene-oxide in the gas-phase and solution.152 In this way, it is possible to treat solvent effects such as the chirality transfer from a chiral molecule to an achiral solvent, taking into account the dynamics of the system.152
In another work, the molecular crystal of l-alanine was studied, showing that the coherent vibrations of the solid-state structure enhance the VCD signal, especially in the carbonyl region, reproducing the experimental results well.163 The solvent effects on (S)-1-indanol in nonpolar CCl4 and polar DMSO have been investigated by Barbu-Debus et al. in a combined static and dynamic study.164 An assessment of cluster models, static calculations, classical MD and FPMD was carried out using the flexible (1S,2S)-trans-1-amino-2-indanol molecule in DMSO solution.165 The NVP approach was also extended to fully periodic calculations and applied to the molecular crystal of (1S,2S)-CHDO161 and used to investigate the hydrogen bonding network in an aqueous solution of lactic acid.166
The reference point has been set to the system's center of mass to reduce the origin dependence. In this approach, the electron density is stored at each time step so that finite differences with respect to the time propagation can be applied afterwards.169 They applied their method to the IR and VCD spectra of (R)-2-butanol solution in carbon disulfide130, 169 and glucose in an ionic liquid.170
3.3 Raman

In one of the earliest FPMD studies of Raman scattering, Putrino et al. applied DFPT182 and the Berry phase approach for polarization to the trajectory of ice at different high pressures. In this way, they were able to reproduce the Raman spectra associated with the phase transitions of water in the GPa range, also showing the effects of anharmonicity.183 Similar studies were carried out with a naphthalene crystal, including additionally van der Waals corrections.184
The first Raman FPMD study of the dynamics of liquid water by Wan et al., containing 64 heavy water molecules at 400 K, employed the MLWF approach to decompose the calculated Raman intensities. In this way, the differences of intramolecular and intermolecular contributions and the vibrational signature of intermolecular charge fluctuations were made apparent.179 Luber et al. applied DFPT to liquid (S)-methyl-oxirane and derived a decomposition of the Raman signal into intramolecular and intermolecular contributions exploiting atom-centered basis functions with the mixed Gaussian and plane-wave approach.180 In a study by Mattiat and Luber, a first Raman excitation profile, including the nonresonant and resonant parts for liquid (S)-methyl-oxirane, was calculated in one go by making use of RTP.177 Brehm and Thomas applied RT-TDDFT in a bulk study of uracil in aqueous solution, calculating the electric dipole derivatives with respect to time by Voronoi integration.175
An FPMD study of surface-enhanced Raman scattering employing linear-response TDDFT has also been reported by Aprà et al. The authors set up multiple FPMD trajectories of two aromatic thiols adsorbed to a silver substrate. For each of the trajectories, the molecule was tilted by a different angle (in steps of 20°) and first optimized. By running FPMD simulations starting from each of the structures and computing the corresponding Raman spectra, the authors showed that they could extract the orientation of the molecule relative to the surface, matching the results of X-ray experiments.185
3.4 Raman optical activity
The chiral variant of Raman spectroscopy is ROA spectroscopy. The calculation of vibrational ROA spectra requires in the general case three polarizabilities: the electric dipole–electric dipole polarizability tensor α, the electric dipole–electric quadrupole polarizability tensor A, and the electric dipole–magnetic dipole polarizability tensor G′.
Different methods for the calculation of ROA spectra based on FMPD have been proposed. The first FPMD implementation of ROA used the DFPT approach and was reported by Luber and applied to an S-methyl-oxirane molecule in the gas-phase101 (see Figure 4). Gauge independent results were obtained by using the velocity representation for the electronic part of the electric dipole moment.101

A more approximate and computationally straightforward approach, proposed by Brehm and Thomas, relies on the classical approximation of these tensors via Voronoi tessellation and a finite differences derivative with respect to an external electric field for obtaining the ROA invariants. The method was applied to a bulk system of liquid (R)-propylene oxide. In order to minimize the gauge dependency, the origin of the electric moments was set, as an approximation, to each molecule's center of mass in this study.100
3.5 Sum frequency generation
Sulpizi et al. have shown that the vibrational SFG spectra can be decomposed using MLWFs to extract the contributions of individual molecular electric dipole moments and polarizability tensors.190, 191 For this purpose, the polarizability and electric dipole moments are calculated (classically) from the positions of the Wannier centers, applying a finite differences formula to a weak external field. In their study, they investigated the air–water interface and were able to qualitatively reproduce the experimental results reported for this system—despite a relatively small system size of 128 water molecules, short MD trajectory of 3 ps and neglect of cross-correlation terms between the individual molecules. The computational cost of such studies makes a wide application still very challenging because long trajectories have to be considered while also applying an explicit electric field to the model when using finite differences.190 Liang et al. devised another scheme for reducing the computational cost of SFG simulations of interfacial water. The polarizability tensor was decomposed into molecular polarizabilities of individual water molecules using MLWFs and CCSD calculations. For this purpose, the system was split into a single water molecule and point charges at the Wannier centers of the remaining water molecules. The process was then repeated for each water molecule, and the molecular polarizability was computed at the CCSD level for each water molecule, in an approach akin to embedding.192 The resulting SFG spectrum is shown in Figure 5.

Instead of relying on a finite differences formula including an applied external electric field, DFPT can be applied to calculate SFG spectra.193 This was demonstrated for a FPMD SFG calculation of a solid–gas interface, calculating the polarizability tensor using DFPT. Another advantage is the higher accuracy and reduced computational cost because more calculations are needed when applying finite differences.193 DFPT has been applied to a layer of acetonitrile molecules adsorbed to a rutile (110) TiO2 surface, investigating the asymmetric methyl stretching vibration.193 Wan et al. employed the DFPT treatment for the SFG simulation of a periodic ice system,179 also going beyond the standardly used electric dipole approximation by determining the electric quadrupole contributions using MLWFs and decomposing the spectrum into bulk and interfacial contributions.194
In the quest for understanding water surfaces at the molecular level in even more detail, the partitioning of water surfaces into the binding interfacial layer (BIL), the diffuse layer (DL), and the bulk offers another way of understanding the complex hydrogen bonding network.195 Pezzotti et al. have provided an unambiguous definition of these layers in simulations from the intrinsic structural properties of the water model. The DL structure is adopted by liquid water influenced by interfacial charges, while the BIL is specific to each interface. For this reason it is advantageous to define the SFG spectrum in terms of two contributions, that is, . This approach was applied to bulk water, the air/water interface including KCl, and a α-quartz/water interface196 and the resulting SFG spectra are reproduced in Figure 6. In the same study, the third-order susceptibility χ(3) was additionally calculated from the electric field resulting from the surface charge. The approach was extended to extract information on the surface potential and charge, the isoelectric point, electric double layer formation, and the relationship between pH and surface hydroxylation state in α-quartz.197 The controversial silica/water interface was studied using the same formalism198, 199 also including THz spectroscopy as an additional probe into the structure of the interface.200

Similar to the calculation of Raman spectra,186 the Wannier polarizability method has been applied for on-the-fly simulations of vibrational SFG spectra of the air/water interface.207 While the authors in the studies mentioned above computed the time-averaged vibrational SFG spectra, it is also possible to calculate time-resolved SFG spectra via a pump-probe scheme,208 although to our knowledge, this has not been carried out in a fully first-principles framework.
3.6 Nonequilibrium molecular dynamics
By applying more than one pulse to the investigated system, 2D and time-resolved spectra can be observed. In the frequency domain, a tunable narrow-band IR pump pulse excites the vibrational states, while another broadband IR probe pulse can then be used to record a spectrum for each pump frequency.211, 212 In this way, absorption spectra can be resolved in two dimensions, allowing to resolve the diagonal peaks at equal pump and probe frequency—corresponding to the 1D-IR spectrum—as well as cross-peaks due to coupling of different vibrational modes.211, 212 Sequences with more than two pulses have been used to retrieve even more detailed information, including the time evolution of vibrational dynamics on ultra-fast time scales.213, 214 A first pulse creates vibrational coherence in the system while the subsequent pulses after tunable delays disturb the coherence and finally reveal information on occurring coherence, population transfers, and energy relaxations.215 In complex biological systems such as peptides, the 2D-IR peaks also make it possible to extract structural information and to distinguish conformers.212 A procedure for transient vibrational spectral analysis, combining FPMD and time-integrated modes obtained by means of TDDFT, has been presented, allowing to study the temporal evolution of vibrational signatures that follow from electronic excitations.216 This approach was also applied to transient Raman spectroscopy,217 providing a direct route to assigning vibrational motion to photodynamics.
There have been two main approaches to calculate such spectra in theoretical studies: the vibrational modes of interest can be excited in a targeted, albeit “classical” way. In this scheme, additional kinetic energy contributions are applied to atoms involved in the vibrational modes of interest, modeling the excitation arising from a pump pulse without having to explicitly treat the different pulses necessary in experimental setups.218
Alternatively, the third-order response functions can be calculated from first-principles by evaluating Equation (43). FPMD studies of 2D-IR spectroscopy have only emerged in the past decade, while previous studies sometimes combined classical trajectories with first-principles methods to calculate electronic properties. The classical MD studies have focused mainly on spectra obtained from bulk water.11, 215, 219 The nonlinear response functions, which are functions of the time delays and pulse frequencies, are 2D-Fourier transformed to retrieve the frequency correlation spectra.220 Only recently, DFT-MD for 2D-IR spectroscopy has been applied for photon echo spectroscopy. The relevant observables are then determined from FPMD trajectories combined with the previously mentioned wavelet approach, while the explicit calculation of the third-order response function gives the time dependence of the observables.220 This approach was applied to elucidate the hydrogen bonding networks in neat D2O,220 solvation shells of ions221 and small molecules in solution222, 223 including the temperature dependence of the dynamics.224
Nagata et al. applied the reverse nonequilibrium molecular dynamics (RNEMD) in an FPMD framework218 for studying the dynamics of bulk water. In this scheme, the O–H stretch vibrations are excited, and then the excess velocity correlation function is determined from the difference of the RNEMD and equilibrium MD trajectories. In this way, the effect of delocalized OH stretch modes on the spectral diffusion has been studied, as well as the contributions of intramolecular and intermolecular energy transfers by tracking specific OH chromophore subgroups.225 In the same NE-FPMD framework, the interfacial OH stretching modes on a water-covered α-alumina (0001) structure were excited, elucidating the coupling to surface phonons and the water adlayer.226 The corresponding VDOS at different time delays is shown in Figure 7. Furthermore, the air–water interface227 and the difference of OH stretching energy relaxation in neat bulk water and deuterated bulk water were investigated in the NEMD framework.228 Similarly, by classically exciting the vibrational states, Lesnicki and Sulpizi investigated the energy relaxation via FPMD in bulk water229 and at the CaF2/water interface.230

4 CONCLUSION AND OUTLOOK
We reviewed the theoretical foundations of FPMD and various vibrational spectroscopies from FPMD, among them, 1D absorption spectroscopy in the non-chiral and chiral variant (IR and VCD), chiral and non-chiral 1D scattering spectroscopies (Raman and ROA) as well as 2D (SFG) and nonequilibrium spectroscopies (pump-probe).
The essential components for FPMD spectroscopy are, the power spectrum obtained from trajectories that need to correctly and sufficiently sample the configuration space using accurate force calculations, and the intensities that usually result from the same electronic structure calculation as used for the forces. The growing availability of high-performance computing facilities and the improvement of consumer-grade hardware in recent years have enabled more groups to collect FPMD trajectories. The analysis of power spectra has been aided by identifying vibrational modes and assigning them to spectral bands,96, 104 whereas MLWFs,47 subset embedding approaches,146 and other local (polarization) methods180 are powerful tools for the in-depth analysis of obtained intensities.
While the power spectra and vibrational absorption spectra can be generated rather cheaply, advances in theoretical methods have opened the possibilities for investigating more kinds of spectroscopies and systems. This includes the recently published implementation of FPMD NVPT for VCD,159 the application of perturbation theory and real-time propagation techniques for (resonance) Raman and ROA spectra,175-177 and the application of perturbation theory and decomposition techniques for SFG spectra.190, 192, 193 Nonequilibrium MD simulations, including 2D IR spectroscopy, pump-probe schemes, and stimulated photon echo studies, have been studied only recently.218, 223 The electronic properties beyond the electric dipole moment have been obtained mainly by applying simple finite difference formulas, DFPT, or real-time and linear response TD-DFT. In principle, more advanced techniques or other electronic structure methods besides DFT can still be employed to retrieve accurate electric and magnetic properties of the studied systems. Worth mentioning is also the simplified Tamm–Dancoff (TD)-DFT formulation that allows for a speed-up of calculations by several orders of magnitude in some cases.231
In the other direction, approximative or more classical methods such as Voronoi tessellation100, 169, 187 or the treatment of Wannier centers as point charges146, 147, 152, 186 have been applied to reduce the computational cost. Additionally, specialized analysis methods have emerged that circumvent some of the complexity arising from high-level QC calculations. Among those is the Wannierization approach186 for the calculation of polarizabilities in the context of Raman spectra.
The wavelet approach allows for the calculation of time-dependent spectra.117, 232 This approach has already proved invaluable in other fields of science and offers great advantages also in the context of FPMD. A more empirical but useful approach lies in leveraging our understanding of vibrational modes in many materials to focus analyses on the main contributors to spectroscopic signals. Ohto et al. specified the widely used time-correlation functions to the OH stretching motions at water surfaces and coined the term of surface-specific velocity–velocity autocorrelation functions (ssVVAF) for this purpose in SFG simulations.201
Many improvements can still be made. This includes the calculation of higher-order response functions for multidimensional spectroscopy220 as well as tweaks to the BO-FPMD scheme as, for example, Nagata et al. have proposed for their reverse nonequilibrium molecular dynamics (RNEMD), that allows for the excitation of vibrational states in a direct way.218, 233 The acceleration of FPMD simulations is also an important goal since it will allow to study larger systems, for longer trajectories that ensure better convergence of the accumulated spectra and will make it possible to apply more expensive electronic structure methods for retrieving properties. Polack et al. for instance, recently published an efficient method for extrapolating the density matrix guess between MD steps for faster convergence, especially with tight convergence requirements.234 Other extrapolation techniques have been proposed, for instance, by Herr and Steele.235 Another example for accelerating FPMD simulations is the use of multiple time steps.236 Second-generation CPMD is another approach that combines aspects of CPMD and BOMD to extend the size of possible simulations,3 and linear scaling DFT offers an important way of investigating systems that are orders of magnitude larger than standard calculations.237 Hybrid functionals for DFT have rarely been employed for calculating vibrational spectra via PFMD because of the large computational cost associated with evaluating the exchange-correlation potential. Advances such as the auxiliary density matrix method (ADMM),238 the resolution of identity approach239 or machine learning could provide speed-ups making the use of these functionals viable.
Machine learning is a field of research that has attracted significant interest recently, also for applications to vibrational spectroscopy. Machine learning approaches have been applied to calculate electric dipoles efficiently for the calculation of IR spectra240-243 also in bulk-phase systems.244 In combined schemes, the electric dipole and molecular polarizabilities have been obtained243, 245-247 and machine learning can generally be applied to accelerate the MD simulations themselves (see, e.g., Refs [242, 248, 249]) and to generate accurate force fields.250 Since, to the best of our knowledge, machine learning has only been used for IR and Raman FPMD simulations, there is still plenty of room to advance other FPMD spectroscopies as well.
AUTHOR CONTRIBUTIONS
Edward Ditler: Methodology (equal); visualization (lead); writing – review and editing (equal). Sandra Luber: Methodology (equal); resources (lead); supervision (lead); visualization (supporting); writing – review and editing (equal).
ACKNOWLEDGMENTS
This work is supported by the University of Zurich. Open Access Funding provided by Universitat Zurich.
CONFLICT OF INTEREST
The authors have declared no conflicts of interest for this article.
RELATED WIREs ARTICLES
Open Research
DATA AVAILABILITY STATEMENT
Data sharing is not applicable to this article as no new data were created or analyzed in this study.