Exploration and Optimization of the One- and Many-particle Basis Sets in Modern Explicitly Correlated Electronic Structure
| dc.contributor.author | Powell, Samuel Ryan | en |
| dc.contributor.committeechair | Valeyev, Eduard Faritovich | en |
| dc.contributor.committeemember | Crawford, Thomas Daniel | en |
| dc.contributor.committeemember | Mayhall, Nicholas | en |
| dc.contributor.committeemember | Welborn, Valerie | en |
| dc.contributor.department | Chemistry | en |
| dc.date.accessioned | 2026-02-17T09:00:08Z | en |
| dc.date.available | 2026-02-17T09:00:08Z | en |
| dc.date.issued | 2026-02-16 | en |
| dc.description.abstract | Basis set incompleteness error (BSIE) remains one of the largest sources of error in modern electronic structure calculations. Despite the many development producing methods that circumvent the issue by using grids or real-space methods, the linear combination of atomic orbitals (LCAO) approach using Gaussian atomic basis sets is still the most popular method for the simplicity of the underlying technology and the low computational cost. Gaussian basis sets require either basis set extrapolation or the use of explicitly correlated wavefunctions that address the deficiencies of standard determinantal expansions at short interelectron distances in order to yield accurate electron correlation energies in molecules. Practical and robust explicitly correlated F12 methods require the use of standard or specialized atomic orbital (AO) basis sets that include diffuse AOs, even for neutral species. Although modern reduced-scaling formulations of explicitly correlated many-body methods have become routinely applicable to systems with hundreds of atoms, application of F12 methods to large molecular systems can be severely hampered due to the onset of poor numerical conditioning brought on by the presence of diffuse AOs in the F12-appropriate orbital basis sets. The development of practical explicitly correlated methods during the 1980s and 1990s was greatly accelerated in the following decade, consisting of a number of significant and often unrelated breakthroughs published by a number of researchers. The current state of these methods is an amalgam of the best approximations and adaptations of many of these contributing research directions. The determination of appropriate basis sets and development of basis sets specifically designed for F12 methods lead to empirical observations about the ideal composition of these basis sets, many of which remain relatively unexplored. The prodigious use of such methods in the modern context, especially for less well-behaved systems or for pushing the limits of attainable accuracy for thermodynamic data have revealed areas for growth in the most efficient use of basis sets for explicit correlation. First, we reexamine the optimal geminal exponent used in the correlation factor most widely used among explicitly correlated methods. These Slater-type F12 geminal lengthscales, originally tuned for the second-order Møller-Plesset F12 method are too large for higher-order F12 methods formulated using the SP (diagonal fixed-coefficient spin-adapted) F12 ansatz. The new geminal parameters reported herein reduce the basis set incompleteness errors (BSIE) of absolute coupled-cluster singles and doubles F12 correlation energies by a significant — and increasing with the cardinal number of the basis — margin. The effect of geminal reoptimization is especially pronounced for the cc-pVXZ-F12 basis sets (those specifically designed for use with F12 methods) relative to their conventional aug-cc-pV$X$Z counterparts. The BSIEs of relative energies are less affected but substantial reductions can be obtained, especially for atomization energies and ionization potentials with the cc-pVXZ-F12 basis sets. The new geminal parameters are therefore recommended for all applications of high-order F12 methods, such as the coupled-cluster F12 methods and the transcorrelated F12 methods. The geminal exponent relates to the many-particle basis, but the composition of the single-particle atomic basis is also important, specifically the use of diffuse AOs. Here we re-examine why diffuse AOs are necessary for application of F12 methods. To this end we develop a dual-basis formulation of traditional and F12 coupled-cluster singles and doubles (CCSD) methods in which the occupied and virtual (correlating) orbitals were expanded in separate AO basis sets. This indicates that the diffuse AOs are fundamentally important for the traditional (non-F12) description of dynamical correlation; the necessity of diffuse AOs in F12 calculations arises indirectly due to the dramatic reduction of the basis set error by the F12 terms such that the error due to the lack of diffuse AOs becomes comparable to the residual basis set incompleteness. The dual-basis CC methods are suggested as an important candidate formalism for accurate (in particular, F12) reduced-scaling many-body methods in molecules. | en |
| dc.description.abstractgeneral | The growth of computer technology, in terms of increased computational power and reduced cost of computations, has opened up a great variety of avenues for the advance of scientific progress. In the field of chemistry, one important advance has been the growth of computational simulation methods that enable more rapid development of chemical technologies such as new drugs and functional materials. While physical experiments are still central to the scientific community's understanding of important chemical systems and their behavior, the use of computational simulation provides important insights and are a valuable tool for accelerating discovery and functional optimization. The rapid increase in the availability and power of computers has lead to the use of computational studies for drug docking, investigating materials for quantum computing, and exploring technologies related to environmental rehabilitation technologies. Computational methods can be used to screen molecules for desirable properties at a scale and speed unattainable for laboratory experiments and are sometimes capable of probing sophisticated quantum mechanical behaviors more precisely than experiments. The growth of the capabilities of computational methods has been driven by a broad and active research community over nearly a hundred years. While the quantum mechanical rules governing chemical behavior are well known, the resulting equations needed to evaluate chemical properties are too complex to be solved directly. The field of theoretical chemistry is largely focused on finding efficient and helpful approximations to achieve maximal accuracy with minimal cost and accurate physics. The use of basis sets composed of simple functions approximating atomic behavior simplifies the problem from a complex 3-$N$ dimensional problem, where $N$ is the number of electrons in the system, to a much simpler linear algebra problem. With a finite set of these functions (finite basis set), there is always some error. In fact, the error associated with a finite and imperfect basis set is one of the largest sources of error in modern computational chemistry calculations. The size and composition of the atomic basis sets is only one direction of research in computational chemistry, it is an important one. During the 1980s, 1990s, and 2000s a class of methods, called F12 theory or explicit correlation, were developed that used a more sophisticated approximation to the model of the atomic system that explicitly involves the distance between each pair of electrons. The F12 methods were designed to accelerate basis set convergence, and they fulfill that purpose remarkably well. The particular structure and implementation of the F12 method varies, and many developments were made over the most active time of research in the field between 2000 and 2012, but a modern convention has emerged over time, with well-defined lengthscale parameters of the F12 interelectron interaction coupled with the use of basis sets including long-range diffuse functions. Herein we report the reoptimization of the lengthscale attenuation parameter, $beta$, for application to modern methods. This reoptimization was necessary, as the lengthscale parameters were originally optimized for older, less accurate computational methods that are not as widely used for high-accuracy calculations. The new recommended parameters suggested here can reduce the basis set incompleteness error by several times for modern methods with some basis sets. We present results representing some of the most accurate nonextrapolative results to date. Additionally, we explore the role of diffuse functions in F12 methods. We report a dual-basis implementation of the coupled cluster method with singles and doubles, a common correlation method. The dual-basis approach allows us to separate the action of diffuse functions on the mean-field orbitals, correlation energy, and explicit correlation energy, showing that diffuse functions can be safely relegated to the correlation portion only, significantly reducing the cost of the computation and improving numerical conditioning. The dual-basis approach is shown to be a useful method to compute highly accurate energies with better numerical conditioning, with applications to difficult systems such as those with strained bonds and solids. | en |
| dc.description.degree | Doctor of Philosophy | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:45537 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/141277 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | Creative Commons Attribution-NonCommercial 4.0 International | en |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | en |
| dc.subject | electronic structure | en |
| dc.subject | basis set | en |
| dc.subject | explicit correlation | en |
| dc.title | Exploration and Optimization of the One- and Many-particle Basis Sets in Modern Explicitly Correlated Electronic Structure | en |
| dc.type | Dissertation | en |
| thesis.degree.discipline | Chemistry | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | doctoral | en |
| thesis.degree.name | Doctor of Philosophy | en |
Files
Original bundle
1 - 1 of 1