Macroscopic and Microscopic pKa

by Corin Wagen · Aug 11, 2025

"Analogy of the Microcosm and Macrocosm of Alchemy" from Opus Medico-Chymicum, Johann Daniel Mylius (1618)

If you've taken a general or organic chemistry class, you've almost certainly run into the concept of pKa: the negative logarithm of the acid dissociation constant, Ka. The pKa of a molecule is a way to quantify how acidic or basic it is, which in turn can be used to predict its stability, reactivity, and ionization state in a variety of downstream environments.

For simple monoprotic acids and bases, a single pKa value is all you need to fully characterize a molecule's response to changing pH. But almost all complex molecules—like amino acids, nucleotides, peptides, polycarboxylic acids—and drugs, have multiple ionizable sites. This is where the distinction between macroscopic and microscopic pKa values becomes crucial.

In this post, we'll explore the difference between these two pKa-prediction paradigms, when they break down, and when to use each method.

Definitions

The "textbook" acid–base equilibrium for a monoprotic acid HA is:

HAA+H+\text{HA} \rightleftharpoons \text{A}^- + \text{H}^+

The equilibrium constant is:

Ka=[A][H+][HA]K_a = \frac{[\text{A}^-][\text{H}^+]}{[\text{HA}]}

and

pKa:=log10KapK_a := -\log_{10} K_a

In this situation, there's only one proton that can be lost, so the pKa you measure is unambiguous. Each of the above species, HA and A-, is a "microstate" which corresponds to a given arrangement of protons on the molecule. A molecule with one ionizable site has two protonation microstates, a molecule with two ionizable sites has four protonation microstates, and more generally a molecule with NN ionizable sites has 2N2^N protonation microstates.

For molecules with more than one microstate, different forms of pKa become possible:

The two definitions coincide only for a monoprotic solute with a single dominant tautomer at each charge.

Macroscopic and microscopic pKa prediction strategies compared. Individual circles represent microstates. Figure from our macroscopic pKa work.

Almost all experimental pKa measurement techniques—like NMR spectroscopy, potentiometric or colorimetric titration, and UV-Vis-spectroscopy-based methods—measure macroscopic pKa values. This is because these measurements are bulk measurements, averaged over all molecules in solution, and thus record the average protonation state of the molecule and not the specific location of protonation or deprotonation. Measuring microscopic pKa values experimentally is possible but typically requires unique and well-resolved signals for individual microstates.

Case Study: Glycine

Glycine, the simplest amino acid, has two ionizable sites: a carboxylic acid and an amine. This gives rise to four microstates across three charge states (+1, 0, and -1). There are four microscopic pKa values for glycine:

The microscopic pKa values for glycine.

Microscopic pKa values for glycine; figure from Zheng et al (2024).

These microstates can be combined into three protonation states:

  1. Fully protonated: H2A+ (left microstate above)
  2. One proton lost: HA(1) and HA(2) (center microstates above)
  3. Two protons lost: A- (right microstate above)

The two macroscopic equilibria, which can be measured experimentally, are:

  1. H2A+HA+H+pKa(1)2.4\text{H}_2\text{A}^+ \rightleftharpoons \text{HA} + \text{H}^+ \quad\quad pK_a^{(1)} \approx 2.4

  2. HAA+H+pKa(2)9.8\text{HA} \rightleftharpoons \text{A}^- + \text{H}^+ \quad\quad pK_a^{(2)} \approx 9.8

Notice that the "HA" species in the middle is actually a mixture of two different microstates: one with the acid group deprotonated, and one with the amino group deprotonated. The macroscopic pKa value treats them as a single "bucket" containing all species with the same net charge.

In this case, the zwitterionic neutral microstate dominates and so the microscopic pKa values corresponding to proton gain/loss from the zwitterionic microstate become the observed macroscopic pKa values. Trying to predict the experimental pKa values from the neutral (canonical) microstate, in contrast, is a fool's errand—the microscopic pKa values will not only be quantitively wrong (4.3 and 7.6, not 2.4 and 9.8) but also qualitatively incorrect, since the location of the "acidic" and "basic" sites will be backwards.

Case Study: Piperidine

Things become more complex when a molecule doesn't have a single dominant microstate. Piperazine is a symmetric diamine with experimental macroscopic pKa values of 5.35 and 9.37 (ref). But starting from the neutral microstate, a microscopic pKa calculation predicts identical pKa values of approximately 9 for the two amines:

This makes a certain amount of sense: protonation of either amine generates the same symmetric monoprotonated structure. But this calculation does not match the experimental data at all, because it never even considers the diprotonated microstates (which is two seperate protonation steps away from the starting structure). Only by running a second calculation from the output monoprotonated structure can we find a pKa value that's even close to the experimental value of 5.35:

In contrast, a macroscopic pKa calculation automatically takes into account all microstates and predicts values of 5.64 and 9.46, in good qualitative agreement with experimental values:

In this case, it's clear that macroscopic pKa prediction is necessary for sane results.

When To Use Each Method

To summarize, macroscopic and microscopic pKa calculations are answering different questions:

If you want to compare computed pKa values to experimental measurements, you should almost always be using macroscopic pKa values—for all but the simplest cases, these are significantly more physically accurate and reliable. Particularly for large druglike molecules with dozens or hundreds of potential microstates, it's almost impossible to get meaningful agreement with experimental data when using microscopic pKa calculations.

However, macroscopic pKa calculations have their downsides. The predicted pKa values can no longer be assigned to a single functional group, making them less interpretable and chemically intuitive. For many use cases—like tracking the protonation of specific moieties or residues—microscopic pKa values may be more useful. Macroscopic pKa calculations are also more complex and frequently more costly to run, since the number of microstates grows exponentially with the number of ionizable sites. (We've spent a considerable amount of time and energy optimizing our implementation of macroscopic pKa here at Rowan—smart beam-search-based microstate pruning, ML-based scoring functions, and so on—and it's still quite slow for large molecules.)

Regardless of which paradigm is better for your scientific question, knowing the difference between macroscopic and microscopic pKa is crucial.

Further Reading

Banner background image

What to Read Next

Macroscopic and Microscopic pKa

Macroscopic and Microscopic pKa

Two different ways to calculate acidity, what they mean, and when to use them.
Aug 11, 2025 · Corin Wagen
Modeling Thia-Michael Reactions

Modeling Thia-Michael Reactions

In which the addition of a thiolate to an enone proves to be unexpectedly difficult to model.
Jul 25, 2025 · Corin Wagen
API v2, New BDE Methods, MCP, And More

API v2, New BDE Methods, MCP, And More

new API philosophy; streamlined interfaces for workflows; using NNPs and g-xTB to predict bond strength; an MCP server; .sdf files; benchmarking protein–ligand interactions; Diels–Alder visualizations
Jul 21, 2025 · Spencer Schneider, Corin Wagen, Ari Wagen, Jonathon Vandezande, Ishaan Ganti, and Isaiah Sippel
ExpBDE54: A Slim Experimental Benchmark for Exploring the Pareto Frontier of Bond-Dissociation-Enthalpy-Prediction Methods

ExpBDE54: A Slim Experimental Benchmark for Exploring the Pareto Frontier of Bond-Dissociation-Enthalpy-Prediction Methods

ExpBDE54 is a benchmark dataset of experimental homolytic bond-dissociation enthalpies (BDEs) for 54 small molecules, used for benchmarking DFT, semiempirical methods, and NNPs.
Jul 17, 2025 · Jonathon E. Vandezande, Corin C. Wagen
Benchmarking Protein–Ligand Interaction Energy

Benchmarking Protein–Ligand Interaction Energy

How new low-cost computational methods perform on the PLA15 benchmark.
Jul 11, 2025 · Ishaan Ganti
Efficient Black-Box Prediction of Hydrogen-Bond-Donor and Acceptor Strength

Efficient Black-Box Prediction of Hydrogen-Bond-Donor and Acceptor Strength

Here, we report a robust black-box workflow for predicting site-specific hydrogen-bond basicity and acidity in organic molecules with minimal computational cost.
Jul 1, 2025 · Corin C. Wagen
Tracking Boltz-2 Benchmarks

Tracking Boltz-2 Benchmarks

Tracking the community's response to the new Boltz-2 model, plus some notes about Chai-2.
Jul 1, 2025 · Corin Wagen
g-xTB, Credit Usage, & More

g-xTB, Credit Usage, & More

the new g-xTB model from Grimme and co-workers; an easy visual overview of credit usage; better credit handling for organizations; bulk PDB download; a new collapsible JSON viewer
Jun 27, 2025 · Jonathon Vandezande, Ari Wagen, Spencer Schneider, and Corin Wagen
Representing Local Protein Environments With Atomistic Foundation Models

Representing Local Protein Environments With Atomistic Foundation Models

A guest post about how to use NNP embeddings for other prediction tasks.
Jun 20, 2025 · Meital Bojan and Sanketh Vedula
Co-Folding Updates

Co-Folding Updates

Boltz-2 FAQ and launch event recap; new visuals for co-folding workflows; new submission options; PDB bugfixes; new credit-management tools
Jun 12, 2025 · Ari Wagen, Spencer Schneider, and Corin Wagen