Competing Interests: The authors have declared that no competing interests exist.
In the case of airborne diseases, pathogen copies are transmitted by droplets of respiratory tract fluid that are exhaled by the infectious that stay suspended in the air for some time and, after partial or full drying, inhaled as aerosols by the susceptible. The risk of infection in indoor environments is typically modelled using the Wells-Riley model or a Wells-Riley-like formulation, usually assuming the pathogen dose follows a Poisson distribution (mono-pathogen assumption). Aerosols that hold more than one pathogen copy, i.e. poly-pathogen aerosols, break this assumption even if the aerosol dose itself follows a Poisson distribution. For the largest aerosols where the number of pathogen in each aerosol can sometimes be several hundred or several thousand, the effect is non-negligible, especially in diseases where the risk of infection per pathogen is high. Here we report on a generalization of the Wells-Riley model and dose-response models for poly-pathogen aerosols by separately modeling each number of pathogen copies per aerosol, while the aerosol dose itself follows a Poisson distribution. This results in a model for computational risk assessment suitable for mono-/poly-pathogen aerosols. We show that the mono-pathogen assumption significantly overestimates the risk of infection for high pathogen concentrations in the respiratory tract fluid. The model also includes the aerosol removal due to filtering by the individuals which becomes significant for poorly ventilated environments with a high density of individuals, and systematically includes the effects of facemasks in the infectious aerosol source and sink terms and dose calculations.
It is well known that some diseases such as influenza, the common cold, Mycobacterium tuberculosis, measles, and Severe Acute Respiratory Syndrome Coronavirus 1 (SARS-CoV-1) are airborne; meaning they can be transmitted by particles (also called liquid droplets, aerosols, or, if completely dried, droplet nuclei) exhaled by infected individuals that stay suspended in the air for some time rather than immediately falling to the ground. These particles come from the fluid of the lungs, vocal chords, mouth, and nose; which hereafter are all noted as “respiratory tract”. While these particles that stay airborne as well as larger ones that tend to fall on the ground and surfaces are all drops/droplets unless they have completely dried out to solid solute and they are technically aerosols (albeit, sometimes large), the literature usually refers to small airborne ones as aerosols and the larger ones that don’t get suspended in the air as drops/droplets, which we shall do here as well. Note that these diseases can have additional transmission pathways, which can be more or less significant depending on the circumstances. Whether Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is an airborne disease or not and the relative importance of the airborne pathway to the pathway of exhaled droplets too large to stay airborne ballistically getting on susceptible individuals and surfaces have been topics of ongoing discussion and debate throughout the pandemic [1–3]. Due to the possibility that SARS-CoV-2 might be an airborne disease among other transmission pathways, the SARS-CoV-2 pandemic has brought an increased interest in airborne disease transmission dynamics and models.
The risk of getting infected from such airborne particles for an individual or a population has been the subject of numerous studies and analyzes [1, 4–10]. Many of the transmission mitigation strategies rely on results obtained by models that take into account a variety of factors to assess the likelihood of transmission, a good example of which is the World Health Organization’s 2009 guidelines Natural Ventilation for Infection Control in Health-Care Settings [11]. Two well-known families of models are dose-response and Wells-Riley models, which have been extensively used to model the spread of airborne diseases [12].
There are several dose-response models for various diseases in existence which consider the risk of infection for an average dose of pathogen copies, taking full account of the counting statistics [13]. Two common models are the exponential and beta-Poisson models, which are described in great detail by Haas, Rose & Gerba [13]. Many diseases follow the exponential model, which has the added simplicity of having only a single adjustable parameter. Both the exponential and beta-Poisson models assume that the minimum number of pathogen copies required for infection, the threshold, is one; but other models exist for non-unity thresholds. Both models, along with many others, assume that the number of pathogen copies absorbed follows a Poisson distribution; though modification of the exponential model for doses following a beta or gamma distribution has been conducted [5].
The Wells-Riley model, in its original form, takes the steady state balance of sources and sinks of airborne infectious pathogen copies (in units of quanta) over a period of time in a well-mixed indoor environment such as a room or several rooms connected via ventilation (homogeneous concentration assumption) to calculate the average dose received by susceptible individuals over a time period, which is then run through an exponential dose-response model [4]. The original model measures pathogen copies in units of quanta, which is defined as ID63.21 pathogen copies [12]. Sources such as exhalation by infectious individuals in the environment and air exchange with other environments with infectious aerosols and sinks due to fluxes with outside, filtering by the ventilation, filtering by masks, inactivation, settling, and deposition have all been considered as well as full temporal modelling of the infectious aerosol concentration rather than assuming steady-state [1, 4–10]. At the model’s heart, it is essentially a conservation of infectious aerosols model, choosing some sources and sinks to explicitly include and considering others to be negligible, to get the pathogen concentration and then the average inhaled dose, before using a dose-response model (usually the exponen-tial model) for the infection risk. Note, in the literature the term “Wells-Riley model” is some-times used to refer only to when this formulation is used with an exponential model, and the terms “Wells-Riley equation” and “dose-response model” used if other dose-response models are used instead (e.g. [12]). We will use the term “Wells-Riley formulation” to refer to both.
Wells-Riley formulations are a statistical treatment of airborne disease transmission. Underneath its source, sink, and respiratory tract absorption parameters (as well as the choices of which to include and exclude) and its well-mixed assumption and their caveats/limitations are a mix of fluid dynamics with inertial particles (aerosols), the biological processes of the respiratory tract and diseases, thermodynamics, aerosol chemistry, human behavior and safety interventions (e.g. wearing masks), etc. This includes breathing rates for different activities [14–17]; the dynamics of exhaled puffs and the particles within them by breathing, speech, coughing, etc. [18–21]; the generation and ejection of aerosols and larger droplets by breathing, speech, coughing, etc. [16, 19, 22, 23]; aerosol/droplet growth/evaporation in response to temperature and humidity [19, 24–28]; the dynamics of inertial particles in turbulence; mixing and transport [18–21, 29, 30]; ventilation and convection in indoor environments [30]; etc. There have been a number of recent papers that each go into several of these topics written during the course of the ongoing SARS-CoV-2 pandemic [19, 28–31], which while focused on SARS-CoV-2 are also applicable to other airborne diseases. In this manuscript, we will mostly focus on a statistical treatment.
In the past, various generalizations and improvements have been applied to the Wells-Riley formulation for situations beyond its original design and to address its limitations [12]. For example; Nicas, Nazaroff & Hubbard [9] included sink terms for pathogen inactivation, aerosol settling, and deposition as well as less than unitary efficiency of the respiratory tract absorbing infectious aerosols. Wells-Riley formulations have also been combined with SIR (Susceptible-Infectious-Removed) and SEIR (Susceptible-Exposed-Infectious-Removed) models [6, 32]. Noakes & Sleigh [33] made a stochastic model with compartmentalization of the environment into well-mixed subregions that have less mixing with other regions that can work for periods of time longer than the incubation period. Recent Wells-Riley based analyzes during the ongoing SARS-CoV-2 pandemic also include the effects of masks (such as [10]) unless they are investigating scenarios in which individuals are not wearing any mask [1], though including the effect of masks predates the pandemic by decades [5–8].
One of the biggest assumptions of the Wells-Riley formulation is that the indoor environment is sufficiently well-mixed [1, 4, 7–10, 12, 33]. Essentially, it assumes that the infectious aerosol concentration is homogeneous enough that the concentration inhaled by susceptible individuals and at all sinks is approximately equal to the volume average concentration [1, 4, 7–10, 12, 33]. In reality, there can be concentration gradients on both large and small length scales in the environment. For example, the infectious aerosol concentration at close range directly in front of an infectious individual will usually be larger than the volume average of the whole environment since exhaled puffs from the infectious individual will not have dispersed much before inhalation. This means that a susceptible individual located where they can inhale such puffs would be at greater risk of infection than if they were not directly in the exhaled puffs of any infectious individuals. The nature of the ventilation plays a significant role in the validity of the assumption [30] The practice of social distancing, using fans to better mix the room, etc. all improve the quality of this assumption, but room conditions and people’s proximity to each other in real-world situations can be far away from the well-mixed state with everyone inhaling well-mixed air. We will make this same exact assumption in the model presented in this manuscript, and will neither be using nor developing corrections for close proximity between individuals and localized sinks and other sources, though the nature of partial corrections will be briefly discussed.
Besides the well-mixed assumption, there are several other assumptions associated with Wells-Riley formulations, which are not necessarily always true. As an example, there is an additional loss term that has not been fully considered yet that is the loss of the infectious aerosols absorbed by the individuals themselves, though the self-proximity depletion of infectious aerosols in the vicinity of susceptible people has previously been mentioned as an effect to consider [29]. This is despite the fact that this is exactly the reason that susceptible individuals get infected. In some cases this can be safely neglected, e.g. if the combined breathing volume exchange rate of all individuals in the environment is negligible compared to that of ventilation. But in a poorly ventilated room with many individuals inside, this sink term must be taken into account—not incorporating it leads to false risk predictions.
Another large assumption is that the absorbed doses follow a Poisson distribution, which is implicit in the use of the exponential dose-response model even if not stated explicitly [1, 4, 6, 9, 10], though there has been work on doses following beta and gamma distributions [5]. The Poisson distribution assumption requires that the pathogen-carrying aerosols have at most one pathogen inside, i.e. a mono-pathogen assumption. However, this assumption is violated if the pathogen concentration in an infectious individual’s respiratory tract is high. For this poly-pathogen situation the Wells-Riley formulation and the dose-response models must be generalized to consider a larger number of pathogen in an individual aerosol explicitly. We will use the term multiplicity to refer to the number of pathogen copies in an aerosol.
Ignoring multiplicity causes the infection risk to be overestimated even though the expected average pathogen dose does not change. Using a modified version of the worked example later in this manuscript, Fig 1 shows this effect on the time required to reach a 50% infection risk for different pathogen concentrations in the respiratory tract fluid with and without considering multiplicity. For low pathogen concentrations and small infection probabilities per pathogen, ignoring multiplicity has only a small effect. But for high pathogen concentrations and/or pathogen copies with a high infection probability per pathogen, ignoring multiplicity has a significant impact. For a respiratory tract pathogen concentration of 1011 cm-3 where the average number of pathogen copies per aerosol is approximately 6500 for a 50 μm in diameter at production, if the single pathogen infection probability (r) is large enough that multiplicity matters, this means taking into account multiplicities up to approximately 7000.


Effect of ignoring multiplicity.
Ratio of the time required to reach a 50% infection risk when multiplicity is ignored τ50,ignore to when it is fully accounted for τ50,full for single pathogen infection probabilities r (an average dose of r−1 Poisson distributed pathogen copies gives a mean infection risk of 63.21%) and different pathogen concentrations ρp in the respiratory tract fluid of the infectious individual as in the worked example later in the manuscript with a disease following the exponential model, but at steady-state with just the speaking mask-less infectious individual and the risk to a mask-less susceptible individual whose exposure starts after steady state is reached. This is a simplified version of Fig 5.
In this manuscript, we will consider the following generalizations and modifications to the Wells-Riley formulation:
Fully accounting for the multiplicity of pathogen copies in aerosols and the effect on the dose-response models.
Additional sink terms due to the filtering of air by people inhaling and then exhaling it back out, including the effects of masks.
Working exclusively in units of pathogen copies and aerosols instead of quanta (note, quantum is undefinable when accounting for multiplicity).
We will first generalize dose-response models that assume Poisson distributed doses for the distribution that results from poly-pathogen aerosols being present. Then we will develop the general pathogen concentration model that is a generalization of the Wells-Riley formulation. This results in a linear inhomogeneous coupled system of ODEs (Ordinary Differential Equations) for each initial aerosol diameter at production (diameter when exhaled), with one equation for each multiplicity that must be considered. We then derive the general solution, and then simplify the general solution for coefficients that are constant in time. Requirements and heuristics are developed for finding the appropriate cutoff in the multiplicity, Mc. This is important because the number of ODEs to solve is equal to Mc; and the computational effort scales as for the numerical solution, or worse than for
Throughout this manuscript, we will use the Poisson distribution, which describes the probability of counting some number, m, of independent events/objects/etc. as a function of the ensemble mean of the number counted, μ. The Probability Distribution Function (PDF) of the Poisson distribution is

Most dose-response models assume that the number of pathogen copies absorbed follows a Poisson distribution. For the case of a dose-response model, the average number of pathogen copies absorbed over some period of time would be the μ and then PP would give the probability that a person absorbed exactly m pathogen copies. For clarity in the rest of this manuscript, we will now define Δ to be the number of pathogen copies absorbed (instead of m) and the average number of pathogen copies absorbed is 〈Δ〉, where we have used 〈⋅〉 to denote the average. The use of a Poisson distribution for the doses requires that the pathogen copies are independent (i.e. no clumping); and as we will later show, that the number of pathogen copies in aerosols is assumed to be one or zero, which is generally assumed by existing models but won’t be in the model presented in this manuscript.
Let R(Δ) denote the infection probability when exactly Δ pathogen copies are absorbed, and
There are two ways to construct

The other method instead considers the number of pathogen copies that survive to try to infect, Δi, and does a double sum over Δi (starting from the threshold) and Δ of the product of the probability of the dose Δ and the probability of exactly Δi out of Δ surviving to try to infect [13] (this is NOT R(Δ)). The two methods are equivalent, with this extra sum being implicitly included in the definition of R(Δ). This is why R(Δ) is a piece-wise function when the threshold is not one. For some models it may be easier to do this other method explicitly rather than try to construct R(Δ).
The exponential model assumes that all pathogen copies are identical, all people are equally vulnerable to infection, the pathogen copies are acting independently of each other, and that each pathogen has an equal probability of causing infection r [13]. These assumptions implicitly mean that the threshold is one. Each pathogen has a probability 1 − r to not infect. Then the exponential model’s infection risk for an exact dose Δ is just one minus the probability that all Δ pathogen copies did not infect.

If the dose follows a Poisson distribution, then Eq (2) can be calculated for the exponential model [12], yielding

The beta-Poisson model is essentially the exponential model but instead of considering everyone to be equally vulnerable, each person has their own value for r which comes from the beta distribution [12, 13]. The beta distribution PDF [13] is



Wells-Riley formulations, both the original model and many subsequent uses, measure pathogen copies in units of quanta [1, 4, 8–10, 12, 33]. A quanta is defined as ID63.21 pathogen copies [12]. This means that one quantum is equal to D = 1/r pathogen copies. For the case of r = 1 such as Mycobacterium tuberculosis, one quantum is one pathogen [9, 12]. Using these units, the exponential model from Eq (4) becomes

Let NI be the number of infectious individuals, σ be the average production rate of infectious quanta per infectious individual, λ be the volumetric breathing rate of susceptible individuals, Q be the volumetric rate that clean air is brought into the particular indoor environment, and τ be the time period of exposure of susceptible individuals. Then, in its simplest form, the Wells-Riley Model’s infection probability for time periods smaller than the incubation period of the disease [4] is

If the pathogen concentration in an infectious individual’s respiratory tract fluid ρp is low enough, almost all exhaled pathogen copies will be the only pathogen in their aerosols, i.e. mono-multiplicity aerosols, and poly-multiplicity aerosols can reliably be ignored. We will use the tailing subscript k to denote aerosols with k pathogen copies inside them. An aerosol cannot contain more pathogen copies than will fit in its volume, and there is a limit to how large an aerosol/droplet a person can exhale. Let M be the maximum number of pathogen copies that can fit in the largest aerosol/droplet that can possibly be exhaled. This is the hard cutoff/limit on k. There also exists a soft cutoff/limit Mc ≤ M for which contributions of aerosols with k > Mc is negligible. In a worst case Mc = M, but in practice it is much lower since the pathogen volume fraction of respiratory tract fluid is quite low even at the upper pathogen load for some diseases and the largest droplets don’t stay airborne and ballistically fall to the ground. For example, SARS-CoV-2 at the very upper end of its concentration range at 1011 cm-3 [35, 36] would give a volume fraction of approximately 5 × 10−5, if we treat the virus as a 100 nm sphere (approximate size of the SARS-CoV-2 virus [37]). This is important because an aerosol with a diameter of 1 μm could contain up to approximately 740 spherical pathogen copies with diameter 100 nm, if we assume hard-sphere packing (packing fraction of 74%). An aerosol with a diameter of 10 μm could contain up to approximately 7.4 × 105 of the same pathogen copies for the same packing fraction.
To properly account for higher multiplicities, we must consider the separate doses for each multiplicity. Let Δk be the number of pathogen copies absorbed from aerosols with multiplicity k, and let mk be the number of aerosols absorbed with multiplicity k. The aerosol and pathogen doses are related by Δk = kmk. The total pathogen dose from all aerosols is just the sum of the doses for each multiplicity, which is
As long as the aerosols are randomly distributed in space (well-mixed with no clustering nor avoidance), then the PDF of each mk follows a Poisson distribution with mean μk. Since Δk = kmk, the PDF of Δk is not a Poisson distribution for k > 1. It is instead a scaled-Poisson distribution of the form

The deviation from the Poisson distribution is most visible in the fact that this distribution has holes. For example with k = 2, Pk = 0 for all odd Δk. Since Δ is the sum of a Poisson distribution for k = 1 and some number of possibly non-negligible scaled-Poisson distributions, the PDF of Δ will not be a Poisson distribution unless the contributions from k > 1 are negligible compared to k = 1. So we can’t just naively put the expected average dose into dose-response models expecting a Poisson distribution.
Instead, we must change the summation in Eq (2) to get the infection risk


Then, putting RE from Eq (3) into Eq (11), the exponential model mean infection risk is


The integral over r commutes with the sums in Eq (10). So as was with the case when multiplicity is not considered in Eq (7), we can get the moments by taking the result for the exponential model and integrating it times the beta distribution PDF over r. This is

Unfortunately, as is the case for when the dose is Poisson distributed [13], the integral cannot be solved analytically and must be solved numerically or approximated, though now it is harder with the extra terms for Mc > 1.
Now that we have dose-response models corrected for the multiplicity via Eq (11), we must determine the average aerosol doses μk for each multiplicity before the infection risk can be calculated. We now generalize the Wells-Riley formulation for multi-pathogen aerosols to get this. In the following sections, we will describe the environment, people, aerosols, sources, sinks, etc. to get the model equations. Let nk(d0, t) be the concentration density of aerosols with original diameter d0 (diameter at production) and k pathogen copies in them over time, which has units of [L]-4 where [L] is the unit of length since nk(d0, t)dd0 is the concentration of infectious aerosols with original diameters between d0 and d0+ dd0. To get a concentration, nk(d0, t) must be integrated with respect to d0.
In the end, we will get the following system of ODEs (Ordinary Differential Equations) in time t and the original diameter at production d0 for the nk, which is




| Term | Meaning | Form |
|---|---|---|
| βr,k(d0, t) | transport from other rooms | qr(t)nr,k(d0, t) |
| βI,k(d0, t) | production by infectious individuals |
![]() |
| αo(t) | air exchange with outside | qo(t) |
| αr(t) | air exchange with other rooms | qr(t) |
| αv(d0, t) | filtering by ventilation | qv(t)Ev(w(d0, t)d0) |
| αg(d0, t) | gravitational settling |
![]() |
| αd(d0, t) | deposition on surfaces | found elsewhere |
| αI,f(d0, t) | filtering by infectious ind. inhaling |
![]() |
| αS,f(d0, t) | filtering by susceptible ind. inhaling |
![]() |
| αO,f(d0, t) | filtering by other ind. inhaling |
![]() |
Like most Wells-Riley formulations, we consider the infection risk in one sufficiently well-mixed indoor environment such as a room or set of rooms sufficiently coupled together with respect to their air that they have the same infectious aerosol concentration densities. And we assume that sources, sinks, and individuals are far enough apart from each other that the local concentration densities at their locations are approximately equal to the average concentration density in the whole environment. Note that the particular kind of ventilation has an impact on the validity of this assumption [30]. See the Discussion for when this assumption is not valid. The environment could also be split into coupled well-mixed zones with weaker mixing between them [7, 33], but that shall not be considered here.
Let the volume of the environment be V. Air is exchanged with outside, with other rooms, and circulated internally through the ventilation system. Let Qo, Qr, and Qv be the volumetric rate of air exchange with outdoors, other rooms, and the circulating ventilation of the environment (ventilation system that pulls air out of the environment and puts it back in). These will be normalized by the environment volume; yielding qo ≡ Qo/V, qr ≡ Qr/V, and qv ≡ Qv/V since target values of these parameters are often the design goals for HVAC systems.
Consider the concentration of infectious aerosols over time. To be completely accurate, we need to consider the concentration density for each multiplicity k as a function of time, current diameter d while in the environment, and the solute content (including inactivated pathogen copies). We have to consider both d and the solute content because an exhaled aerosol’s equilibrium diameter is a function of its solute content, the humidity, and the temperature [27]. Higher solute concentrations decrease the vapor pressure of the aerosol, which allows equilibrium to be reached as long as the environment isn’t super-saturated or too close to saturated [26, 27]. For higher humidities, an aerosol will continue to grow by condensation indefinitely, though the growth rate slows towards a crawl for d > 20 μm [26, 38]. But such super-saturated conditions can cause clouds/fog, which rarely occur in indoor environments. So we will assume the environment is sub-saturated. If the environment is dry, the aerosols can evaporate at most to the point where they are purely precipitated solid with no water left. Note that as a drop (whether large or a small aerosol) dries, the solute fraction increases, until at some point the solute makes the shape non-spherical (not enough water to spherically encapsulate the insoluable components, solute causing anisotropy and/or inhomogeneity in the surface tension, etc.). This will occur at a humidity no lower than the efflorescence relative humidity of the solute mix, where the soluble solutes will homogeneously nucleate and the water completely evaporates away. Infectious aerosols always have at least two components of the solute (whatever is in the respiratory tract fluid plus the pathogen/s), so there is the possibility of heterogeneous nucleation causing the water to completely evaporate away at a higher humidity.
This means that we have four different diameters to consider, which are
dcurrent diameter in the environment (spherical equivalent diameter if it is completely dry or almost dry and the solute causes a non-spherical shape)
deequilibrium diameter in the environment
d0wet diameter at production (original diameter), which determines the distribution of initial multiplicities
dDspherical equivalent dry diameter when all water is evaporated away and just solute remains (note that the aerosol may no longer be spherical, so the spherical equivalent diameter for the same volume must be used)
For any aerosol; d0 and dD are fixed and never change as long as collisional-coalescence and shattering don’t occur (can be treated as fixed if these processes are negligible), de is dynamic in time if the environment’s temperature and/or humidity changes, and d is dynamic in time unless the environment’s temperature and humidity exactly match those inside the respiratory tract at the point of production.
Small wet/nucleated aerosols respond very quickly to the humidity and temperature, evaporating/condensing to their equilibrium diameter in a very short period of time due to their high surface area to volume ratio [9, 25, 26]. Assuming the environment is well-mixed enough that the time between exhalation from an infectious individual and inhalation by any person is long compared to the evaporation/condensing time scale, we can make the approximation that all aerosols are at their equilibrium diameter when in the environment (d ≈ de). This means that when de increases from de = dN (completely evaporated) to de > dN (wet/nucleated), we are assuming that the time the aerosols require to nucleate and grow to de is short compared to other time scales in the model and therefore also make the approximation d ≈ de even when de increases from de = dN to de > dN. This means that we just need to worry about the equilibrium diameter and its changes, and not the non-immediate response to shifting equilibrium diameters. There is one complication, however. Aerosols will initially stay in the exhaled plume where the humidity is higher, so they won’t reach the well-mixed equilibrium diameter till they leave the plume or the plume is diluted and mixed with the environment, which brings us back to the well-mixed environment assumption.
We will also make the assumption that the temperatures and humidities in different individuals’ respiratory tracts (and the volume under their facemasks if they are wearing any) are similar enough and change negligibly enough over time that the equilibrium diameter in people’s respiratory tracts is d0. If the aerosols have not completely dried out in the environment (de > dD), the aerosols will start to grow inside people’s respiratory tracts back towards d0 and thus d ≈ d0 inside the respiratory tract. But the time scale of breathing is short and for completely dried out aerosols it takes time to nucleate and grow back to d0, which means that some fraction of dry aerosols might not reach d = d0 while in the respiratory tract. However, we will make the assumption/approximation that dry aerosols have returned to their original diameter by the time they are exhaled back out if they were not absorbed in respiratory tract. This last approximation only affects the sink from individuals inhaling aerosols αC,f(d0, t) from Eq (42) if they are wearing masks, which is usually small compared to other sinks. When the individuals in the environment are wearing masks and the αC,f(d0, t) sinks dominate, then a better approximation or an explicit treatment of the diameter when exhaled should be used. Combined, our assumption/approximation is

Let us define ratios between the remaining diameters: the evaporation ratio w, the dilution ratio δ, and the initial solute ratio ζ as



Note that w and δ are potentially functions of time, as well as diameter due to the effect of surface curvature (through surface tension) on equilibrium vapor pressure [26, 27]. Also, different solutes have different molar densities, different practical osmotic coefficients, and maximum concentrations before they precipitate; and therefore different functional relationships between the saturation vapor pressure and the concentration [27]. So different solute compositions will cause w and δ to be different even for aerosols with the same ζ.
But, we will make the assumption that the value of ζ and the solute composition (except for the pathogen copies) is approximately constant from each infectious individual to the next and over time with each infectious individual, and we will ignore the contribution of the pathogen copies (both active and inactivated) to the equilibrium vapor pressure and therefore de. We will also assume that ζ has no diameter dependence (i.e. attraction and repulsion of solutes from the liquid surface at production has a negligible effect on solute fraction and composition). With these approximations, we have a single constant value of ζ and single functions for w and δ, possibly over time and d0 (or equivalently dD), for all infectious aerosols in the environment.
This means we can choose to track one of de, d0, or dD and always know the other two through the ratios that are the same for all infectious aerosols at the same moment of time with the same value of the chosen diameter parameter. Thus we have two independent variables, t and one diameter parameter.
Processes such as gravitational settling, deposition, filtering or exchange by the ventilation, filtering by facemasks when inhaling are all functions of the current diameter, which is approximately de, making de convenient. Additionally, any non-drying aerosol instrument can be used in the environment to measure de. But, because de can change over time for a fixed dD or d0, the equations for the aerosol concentration density in terms of t and de have a flux term (from evaporation/growth) with a partial derivative with respect to de; making the equations PDEs (Partial Differential Equations) which adds complications in the analysis. This can be seen by considering the total time derivative of the aerosol concentration density

Since dD and d0 are fixed for a given aerosol over time regardless of how the temperature or humidity in the environment might be changing, the equivalent flux term is zero and thus the equivalent functions are ODEs, which are much easier to solve. Thus, we eliminate de as a choice for the diameter parameter.
The model in this manuscript can be constructed with either choice of d0 or dD, with w appearing in places if d0 is chosen, and both δ and ζ appearing in places if dD is chosen. We choose d0 because then we only need one of the ratios (w only), the diameter limits are easier to express in it, and the literature on the diameter distributions of exhaled aerosols generally work hard to convert their measurements (vary between whether they are de or dD) into expressions in terms of d0 rather than dD.
Now, nk(d0, t) is the concentration density of aerosols in terms of t and the original diameter d0. Let



Let n0,k(d0) be the initial concentration density in the room for a multiplicity k at the initial time t = t0 and nr,k(d0, t) be the volume averaged concentration density of the air coming in from other rooms. We are assuming that the concentration density outdoors is negligible.
For the model, we will limit ourselves for each multiplicity to the range d0 ∈ [dm,k, dM] where dm,k is the minimum aerosol diameter required to hold k pathogen copies, and dM is a diameter cutoff separating larger aerosols that are more ballistic and gravitationally settle to the ground too quickly to become well mixed and smaller aerosols that more closely follow the flow and mix. Let Km(d0) be the largest number of pathogen copies that can fit in an aerosol at production. We will consider

For a spherical pathogen with diameter dp, we can use the crude approximation of just considering the total pathogen volume and a packing efficiency e = 0.74 (hard pack spheres) with a minimum of 1 and completely neglect the aerosol shape that small number of pathogen copies would force (two pathogen copies, for example, can’t be arranged into a configuration that even vaguely resembles a sphere). We can use the same idea to get Km(d0). Both of them are


At the lower limit near dm,k, the pathogen/s take up a disproportionate amount of the space in the aerosol compared to other solutes and the assumption of approximately equal solute concentrations at production is violated and the evaporation ratio has a strong dependence on d0 and the initial multiplicity, the latter of which we aren’t tracking at all. However, as long as the total liquid volume of exhaled aerosols with diameters close to dm,k (say, those whose diameters are small enough that their volume is only a few times larger) is small compared to total liquid volume of the rest of the range in d0, this problem will have a negligible effect. Additionally, the diameter dependence of many of the sink terms may be much smaller close to dm,k for submicron pathogen copies which means that the effect of assuming the wrong evaporation ratio may be small. The smaller the pathogen, the less issues this will pose. It will be least important for small viruses, and possibly quite important for large bacteria and eukaryotic pathogens.
The upper limit is rather imprecise since there is no single hard separation scale that could be chosen unless the air is completely still in which case one can use a so called “Wells curve” (same Wells as of the Wells-Riley model) for the environment’s humidity to determine the largest size that won’t settle to the ground before evaporating to their equilibrium diameter, such as the original one [24] or newer ones [25]. But mixing of any sort complicates this. One might think that one could just rely on the fact that the gravitational settling sink term keeps growing with diameter and not bother with the problem. But, the well-mixed assumption breaks down and the lifetime of the aerosols converges towards depending solely on the initial diameter and the height of the infectious individual’s mouth and nose from the ground. Additionally, the time to evaporate to the equilibrium diameter increases with increasing size. And from a practical standpoint, it is necessary in order to keep Mc from getting too large since


Both grow quadratically with diameter, which does not lend itself to a well defined cutoff scale. And additionally one must consider that once exhaled, the aerosols will tend to evaporate (relative humidity in the environment is typically lower than in the respiratory tract where it is close to 100%) thereby reducing their inertia and terminal velocities. For 10 μm, 20 μm, and 50 μm diameter aerosols; the terminal velocities at 20°C and atmospheric pressure are 3.0 mm s-1, 1.2 cm s-1, and 7.5 cm s-1 respectively. However, larger aerosols take longer to evaporate/grow to their equilibrium diameter and therefore will settle at a faster rate initially than their final equilibrium diameter suggests, which makes them even more likely to be lost due to settling than smaller aerosols.
The simulations of Chong et al. [21] indicate that 100 μm aerosols are quite ballistic and quickly fall out of the exhaled plume, but 10 μm aerosols are carried along with the plume and stay in the air despite their evaporation being greatly slowed. This suggests that dM should be chosen somewhere in the 10–100 μm range, which is further supported by the Wells curves found by Xie et al. [25]. For lack of a better suggestion; we suggest the use of dM = 50 μm, which will be explored in the Discussion. Before evaporating, the terminal velocity is 7.5 cm s-1. If the evaporation ratio is a typical value in the
We will denote infectious individuals by the subscript I, susceptible individuals by the subscript S, and other individuals by the subscript O. The Other category is all the individuals who are non-infectious non-susceptible. This includes individuals that are immune before they enter the environment (following Jimenez [10]), all of the Removed group in SIR and SEIR models except for the individuals who died or leave the environment, and all of the Exposed group in SEIR models. If one wants to make a full SEIR model from the model presented in this manuscript, the two subgroups (Exposed, and the part of Removed that is still within the environment and breathing plus the previously immune individuals) within this group will have to be treated explicitly. Let the number of individuals in category C be NC. The total number of individuals is N = NI + NS + NO. The subscript A will be used to refer to all individuals in all categories. Each count is potentially a function of time as individuals can come in and out of the environment. Let 〈⋅〉C denote taking the average over all individuals in category C.
Let λC,j(t) be the volumetric breathing rate of the j’th person in category C. Let EC,m,in,j(d) and EC,m,out,j(d) be the filtering efficiency of the mask (if any) of the j’th person in category C for inhalation and exhalation respectively.
The filter efficiencies of most masks vary significantly with aerosol diameter. Note that it is important that the leak rate of the mask be included in its filtering efficiency. These two filtering efficiencies are generally not equal because masks tend to leak more during exhalation than inhalation and aerosols have higher velocities on exhalation than inhalation. We will assume that all infectious aerosols caught by the mask aren’t later re-aerosolized.
Let EC,r,j(d0, w, λC,j) be the filtering/absorption efficiency of the respiratory tract of the j’th person in category C. This term is non-zero, but it is also not equal to one since the respiratory tract does not absorb all infectious aerosols that pass through it [5, 7, 9, 12]. The best example of this is the observation that individuals can inhale smoke (which is composed of many aerosols) and then exhale some of it back out. The filtering efficiency depends on the original diameter of the aerosols and the evaporation ratio in the environment since d0 and w give both the initial diameter on inhalation (d ≈ de = wd0), the diameter the aerosols grow towards (d0) if they are wet on inhalation or nucleate inside the respiratory tract if they are completely dry on inhalation, as well as the time they spend inside the respiratory tract which is inversely proportional to λC,j. It must capture the time it takes for the aerosols to nucleate and grow if they are dried out, the growth process inside the respiratory tract, and the absorption probability as they pass through the respiratory tract. A useful reference for the nucleation and the growth processes would be Pruppacher & Klett [27], and a useful reference for the absorption processes for particles in the respiratory tract would be ICRP [39].
The diameter will be de = wd when passing through the mask on inhalation, and d0 when passing through the mask on exhalation since the humidity between the mouth and nose and the mask is high and the distance is short, so there is little time for evaporation. It is often easier to work with the survival efficiencies rather than the filtering efficiencies, defined as



We will assume that the number of infectious pathogen copies in each exhaled droplet/aerosol follow a Poisson distribution where the mean count is equal to the droplet/aerosol’s initial volume times the pathogen load in respiratory tract fluid at the point of production. This excludes diseases where pathogenic agents stick together and clump. Note that this implicitly means we are assuming that the pathogen volume fraction in the respiratory tract fluid is small. Otherwise, the non-Poissonity caused by there being a maximum number of pathogen copies that can fit in a finite sized drop will NOT be negligible.
Let ρj(d0, t)dd0 be the number density in exhaled air of the aerosols with diameters between d0 and d0 + dd0 exhaled by the j’th infectious individual at time t. Let ρp,j(t) be the pathogen concentration in the j’th person’s respiratory tract fluid where the aerosols are being produced. The mean/expected multiplicity for infectious aerosols produced by the j’th infectious individual for any d0 is

If the pathogen copies are Poisson distributed in the fluid that makes up the aerosols (no clumping, etc.), then

We will denote sources by the symbol β with a subscript denoting the individual source. All of them are normalized by the volume of the environment, V.
First, ventilation with other rooms brings infectious aerosols inside at a rate, normalized by the environment volume, of

The other source is the infectious individuals exhaling aerosols with pathogen copies in them. The total production from the infectious individuals normalized by the environment volume is the sum of the products of the breathing rate, the exhaled aerosol concentration density, and the survival efficiency of the mask [7, 10]; which is

Other than not replacing the average of the product with the product of the averages, following aerosols by multiplicity and diamater, and not using quanta; this term is identical to the equivalent term by Nazaroff, Nicas & Miller [7] and Jimenez [10] and, if masks are removed, that of the original formulation [4].
Now, it may be the case that an infectious person has different respiratory tract pathogen concentrations at different locations where exhaled aerosols are produced (e.g. different concentrations in the lungs and mouth). In this case, one would split the term in Eq (37) for the particular infectious person into separate terms for each location of production and use different ρj(d0, t) and 〈k〉(d0, t)j in nI,j,k(d0, t) from Eq (35).
Sinks are proportional to the concentration density nk. We will denote all sinks divided the concentration density by the symbol α with a subscript denoting the individual source. All of them are normalized by the volume of the environment, V. Unlike the sources, none of the sinks (except inactivation, considered separately) depend on the multiplicity and therefore the subscript k is dropped. Note that inactivation is treated separately later since it is a flux term when considering each multiplicity separately, unlike in the traditional formulation where it is a sink.
The volume normalized loss rate coefficients of infectious aerosols due to exchange of clean air with outdoors and other rooms are just the volume normalized flow rates [9, 33] and are


Let Ev(d) be the filtering efficiency of the circulating ventilation system for aerosols with diameter d. The diameter when an aerosol reaches this filter is d ≈ de = w(d0, t)d0. Then the volume normalized loss rate coefficient from the circulating ventilation system [4] is

Aerosols also gravitationally settle and deposit onto surfaces. We will treat these processes as simple loss rates proportional to their concentration densities just as one does with radioactive decay. The volume normalized loss rates divided by the concentration density, of gravitational settling and deposition are defined to be αg(w(d0, t)d0) and αd(w(d0, t)d0) respectively; which depend on the room geometry, aerosol diameter, and air flow in the room. A possible approximate expression for the settling loss term [9] would be

Unfortunately, when individuals inhale infectious aerosols, some are absorbed thereby causing a risk of infection. While this phenomena is not desired for susceptible individuals, we must consider the loss rate from this process by the susceptible individuals as well as the infectious individuals and the non-infectious non-susceptible individuals. There are three steps to the filtering process for the j’th person of category C: passing through the mask on inhalation, passing through the respiratory tract, and then passing through the mask on exhalation.
The total survival probability of an aerosol going through all three steps is the product of the individual survival rates. The total filtering efficiency is then one minus the total survival rate. But, there is a time delay between when the aerosols are removed from the environment on inhalation and when the survivors are exhaled back out. As long as this time is short compared to all other time scales such as mixing times in the room, the time scales of all other sinks, the time scale of inactivation, etc.; we can ignore this time delay and consider the re-exhalation to occur at the same time. This assumption implies that we can neglect possible changes in multiplicity by inactivation while the aerosols are in the respiratory tract. In most situations, this is a reasonably good assumption. But, at a swimming pool where people regularly hold their breath for long periods of time, this assumption could be violated for the highest multiplicities since the inactivation rate from k to k − 1 is proportional to k.
We assume that the individuals are far enough away from sources and that the environment is well-mixed enough that the concentration density in the air inhaled by each individual is approximately the average concentration density nk(d0, t). See the Discussion for a brief qualitative discussion of what the required corrections would look like when this assumption is not valid. Note that we will make the assumption that the self-proximity correction for infectious individuals is negligible (each infectious individual is by definition in close proximity to an infectious individual, themself), though this could pose an issue when the transport of infectious aerosols in the environment to an individual is weak [29]. Then the number of aerosols that are inhaled by a person is equal to λC,j(t)nk(d0, t). The volume normalized sink coefficient from this filtering is then

When a pathogen in an aerosol with multiplicity k inactivates, the aerosol’s multiplicity changes to k − 1. We will model inactivation of pathogen copies as exponential decay with inactivation rate γ(t), which might depend on time (e.g. dependence on UV light intensity, humidity, etc. that could be fluctuating in time). For aerosols with a multiplicity of k, the volume normalized loss rate to multiplicity k − 1 is just

Two pathogen copies will never inactivate at exactly the same time; so we don’t have to consider flux terms beyond the two neighboring multiplicities.
All of the sources, sinks, and flux terms can be collected to make the system of differential equations describing the infectious aerosol concentration density, which is

We have assumed that shattering and collisional coalescence of infectious aerosols, whether from turbulent induced collisions or differential gravitational settling, is negligible. Collisional coalescence could begin to be important if there are a significant number of very large aerosols and/or nk is very large. Particularly, d > 100 μm aerosols/droplets, even though they will generally settle to the ground/floor before evaporating to their equilibrium diameter [24, 25], can capture smaller aerosols on their way to the ground/floor [26, 27, 38]. This will generally be negligible unless individuals are situated in the environment such that the large aerosols exhaled by one person (who need not be infectious) will fall through the exhaled aerosol plume of an infectious individual, and potentially negligible even then. If the aerosol concentration, including non-infectious aerosols, reach the levels seen in atmospheric clouds, collisional coalescence might also have to be considered along with keeping track of k = 0 aerosols; though this is very unlikely in indoor environments except when there is a lot of smoke or artificial fog machines are in use, like in a discotheque or theater.
Then, putting the flux terms into Eq (44), we have the following system of ODEs to get the concentration density

Luckily this is a system of ODEs rather than PDEs with flux terms in diameter (involving derivatives with respect to diameters). This is the advantage of choosing d0 or dD instead of de. For practical applications, this also means that we can also split the diameter range into bins and solve it for each bin separately since there are no flux terms between bins. (See S3 Appendix for how to bin the model with respect to diameter.).
This is a linear inhomogeneous finite system of coupled ODEs at each d0. The number of equations in the system is finite since k is non-negative and there is the maximum theoretical multiplicity M. Moreover, we don’t even need to care about k = 0 since those aerosols are no longer an infection hazard. Additionally, the system that needs to be solved is smaller if Mc < M. If Mc = 1, then we have only one ODE. This situation occurs if the pathogen load of respiratory tract fluid is low enough that very few aerosols have 2 or more pathogen copies in them.
Note that this model demonstrates superposition with respect to sources since it is linear, as expected intuitively—each aerosol is independent of all others, therefore the response (concentration density and expected dose) from each individual source is independent of all other sources. If nk,1 and nk,2 are solutions for the same α and γ but different sources βk,1 and βk,2 respectively, then the solution for βk = βk,1 + βk,2 is nk = nk,1 + nk,2.
Let μj,k be the average number of aerosols with multiplicity k absorbed by the j’th susceptible individual from time t0 to time t. At any particular instant of time, the average number of such aerosols of each original diameter d0 entering the person’s mask if they are wearing a mask or their mouth and nose if they aren’t is λS,j(t)nk(d0, t). Note that we have assumed that the j’th susceptible individual is not close enough to any sources or filtering sinks that the concentration density of the air they are inhaling deviates significantly from nk(d0, t). For susceptible individuals in close proximity to infectious individuals, close to the output of ventilation, etc.; corrections must be applied. See the Discussion for a qualitative discussion on what the required corrections would look like.
A fraction SS,m,in,j(d0, t) will survive the mask to enter the respiratory tract [5–8, 10, 12]. A fraction ES,r,j(d0) of those survivors will be absorbed by the respiratory tract [5, 7, 9, 12], which contributes to the dose. The expected average aerosol dose is then the double integral of this over the d0 and the time between t0 and t, which is

In order to use the μj,k in the multiplicity-corrected dose-response model for the particular disease of interest
But it also requires that the effect of turbulent inertial clustering is negligible. We will now show that it is negligible except possibly at extremely high aerosol concentrations. It will be negligible if the aerosol Stokes numbers St = τp/τη are very small (St ≪ 1) [41, 42] where τp is the aerosol inertial response time scale from Eq (29) and τη is the Kolmogorov time scale of the turbulence in the environment, which is
There is an analytical solution to Eq (45), though it is not closed form unless the time dependence of α, β, and γ allow it. Eq (45) can be rewritten in matrix-vector form as


The general solution in matrix-vector form, shown in S1 Appendix, is

Working this out using the structure of the diagonalization of A in S1 Appendix, the general solution for each k is

We cannot go further in simplifying the general solution from Eq (50) without knowing the time dependence of α,















It is possible for λS,j to be a function of t but α not be (i.e. there is cancelation). But if λS,j and w are constant, the expected average aerosol dose of multiplicity k for the j’th susceptible individual in Eq (46) becomes

Calculation of




This recursive analytical solution for
In order to reduce the number of equations that have to be solved, we need to find a suitable cutoff Mc < M if at all possible, whether for the whole diameter range or for each diameter bin (advantage of doing a separate one for each bin is that Mc tends to be small for the small diameter bins), such that the contribution of all higher multiplicities is less than a threshold T ∈ (0, 1] fraction of the total contribution from all multiplicities. In many cases, this depends only on the ρp,j of the infectious individuals and one can skip directly to Eq (80) for the value of Mc to use (shown in Fig 2 for a few ρp,j). However, some cases such as when one starts the model after some number of infectious individuals have left the environment, when there is significant transport from other rooms, etc. require additional heuristics. These heuristics are developed below.


Required Mc based on pathogen concentration in infectious individuals.
Mc,I,j required to capture 99% of pathogen production for each diameter at aerosol production d0 from an infectious individual, with each line being a different pathogen concentration in their respiratory tract fluid ρp,j (see legend).
A cutoff is suitable if the total contribution for all k > Mc to the average pathogen dose and therefore infection risk is small compared to the total contribution for k ≤ Mc. It is almost always true that Mc < M, and in many cases it can even be Mc = 1. This depends on the distribution of exhaled aerosol sizes and the pathogen concentration ρp in the respiratory tract fluid where the aerosols are produced. For very low pathogen loading, one can use Mc = 1. Let d− and d+ be the bounds in d0 of the bin (or whole range in which case d− = dm,1 and d+ = dM) being considered.
The most reliable way to determine Mc is to use the model with the cutoff M and determine Mc afterwards using the result, but that defeats the point of finding Mc since the effort one wants to save has already been expended. So we need heuristics to determine Mc ahead of time. All of them consider the dose contribution from high multiplicity aerosols and consider a simplified kμj,k from Eq (46) with a particular concentration density multiplied by the average absorption efficiency of susceptible individuals. For each heuristic, we will define this parameter to be

An equivalent way to express this heuristic is to look at the ratio of the sum of

First, we define the average absorption efficiency of the susceptible individuals as

If the α, β, γ, and w are constant in time; it is a lot less effort to calculate n∞,k(d0) using Eq (65) than nk(d0). Then, each μj,k ∼ AS n∞,k. If qr(t) and nr,k(d0, t) are non-zero, the doses from them have a similar scaling. If the initial concentration density includes a lot of aerosols with high multiplicities, we will need to set Mc to be large enough to include them even if they won’t matter after the initial time. We need to consider this if n0,k ≫ n∞,k for any k > 1, and they will have a similar scaling. These heuristics are



The last heuristic is similar but considers the infectious individuals inside the environment instead of the concentration density. This has the advantage of not needing to determine n∞,k(d0). We essentially take the average over the d0 interval of βI,k(d0) from Eq (36) times the absorption efficiency of the average susceptible individual. We thus define the infectious individuals heuristic parameter

But there are practical difficulties in using it directly. So instead, we will define the heuristic for each individual infectious individual using the largest diameter in the range d+, and one would use the maximum Mc indicated by all of these. This has the advantage that there is a simple form for the required Mc, which is derived in S4 Appendix. It is


Fig 2 shows Mc,I,j as a function of d0 for several different ρp,j. Increasing ρp,j approximately just shifts the curves for Mc,I,j to the left on a log-scale. Notice the very strong effect of ρp,j on Mc, with values a little under 7000 being required for the largest diameter bin for ρp,j = 1011 cm-3 and a value of 2 being required for the same bin for ρp,j = 106 cm-3. Since Mc increases with d0, the vast majority of the effort to determine the concentration density and the infection risk will be spent on the largest bins except for small values of ρp,j.
We consider a hypothetical example based on the ongoing SARS-CoV-2 pandemic—a poorly ventilated seminar room with two infectious individuals with SARS-CoV-2 at the very upper end of viral concentrations (viral load) and one of them continuously coughing. We assume that the room is well-mixed and that the individuals are far enough apart from each other and the ventilation that no corrections to nk(d0, t) need to be applied at any source or sink, nor in the calculated absorbed doses. Let the room have volume V = 200 m-3 with a height of h = 4 m, with ventilation qr = 0, qv = 0, and qo = 0.5 hr-1. We will ignore surface tension’s effects on w. Let the humidity be such that the evaporation ratio is
We use mask filter efficiencies of the functional form

none (no mask) E0 = E∞ = 0.
mask simple1 E0 = 0.2 and E∞ = 0.8.
mask simple2 E0 = 0.95 and E∞ = 0.99.
The filtering efficiencies of both the simple1 and simple2 masks are shown in S1 Fig. The mask parameters were chosen such that they are more efficient at filtering large aerosols/droplets than small ones, with the simple2 mask being better than the simple1 mask. The simple1 and simple2 masks could reasonably correspond to a reasonably well fitted home-made cloth mask and an excellently fitted FFP2 mask, though here we have treated their leak rate to be the same during inhalation as exhalation (not true with most real masks). At the largest sizes, leakage doesn’t matter as much since the aerosols are more ballistic. Let us assume that
We assume that an exponential-dose response model is the correct model to use for SARS-CoV-2 since the exponential model works better than the beta-Poisson model for two other human infecting corona viruses (SARS-CoV-1 and HCoV-229E) [34]. In absence of a good value to use for r, we use the same value of r as found for SARS-CoV-1 in mice which is r = 2.45 × 10−3 and the same value of r as found for HCoV-229E in humans which is r = 5.39 × 10−2 [34]. We use γ = 0.64 hr-1 as the inactivation rate for SARS-CoV-2 [45].
We approximate the SARS-CoV-2 pathogen as a sphere with a diameter of 100 nm, which is close to the correct size and the rough shape with the surface proteins removed (actually an ellipsoid) [37]. We use the aerosol size distributions for speaking and coughing from Johnson et al. [22], but extrapolate them to smaller diameters (from 800 nm to 100 nm). This is used with Eqs (26) and (27) to get the βI,k. They are shown in the top-right panel of Fig 3. The aerosol size distributions have two peaks at approximately 2 μm and 100 μm. This puts dM between the trough (between the two peaks) and the second larger diameter peak.


Model solution for example.
Solution to the example case. (Top-Left) The total pathogen and infectious aerosol concentrations over time. (Top-Right) The infectious aerosol concentration densities in the room as a function of d0 at t = 6 hr compared to the aerosol concentration densities being exhaled by speaking and coughing individuals from Johnson et al. [22] scaled by 10−4 to make them have comparable magnitudes. (Bottom-Left, Bottom-Right) The mean infection risk
We now find the infectious aerosol concentration densities and doses, and mean infection risks
The model is solved for Stage 1 and then the final values used as initial values for Stage 2 because this makes it so that α and βk are constant in time when solving the model (all changes are between stages). For Mc, we used the maximum value of Mc,I,j for each infectious individual present at each Stage with T = 10−3. Note that Mc stayed the same or increased for each bin going from Stage 1 to Stage 2 with the addition of one more infectious individual.
For the i’th bin, the nk|i(t) and μj,k|i(t) are solved analytically if Mc ≤ 500 using the recursive solution and numerically if Mc > 500, both in IEEE-754 binary64 floating point (also known as double precision and float64). This threshold between analytical and numerical solving was chosen to use the analytical solution as much as possible without overflow in Vk (see S5 Appendix). As shown in S5 Appendix, binary64 numbers provide sufficient precision and allowed maximum magnitude. Note that overflow is easy to spot as infinities, which were not seen so this number format was sufficient to prevent overflow. When doing it numerically, Eq (47) along with
The total pathogen concentration is slightly less than double the infectious aerosol concentration in Stage 1, and slightly higher than double in Stage 2. This means that the average multiplicity in both stages is approximately two, and it increases slightly from Stage 1 to Stage 2 which is expected with the higher viral load in the second infectious individual. Also, as expected, increasing r (infection risk of each individual pathogen) increases the infection risk. As expected, susceptible individuals wearing masks decrease their infection risk and increasing exposure increases their infection risk.
Comparing the infectious aerosol concentration density in the room with the aerosol concentration densities exhaled by the infectious individuals as a function of d0 (see top-right panel of Fig 3); we can see how as d0 increases, the probability of an aerosol being infectious increases (infectious aerosol concentration density decreases slower after the first peak than the exhaled aerosol concentration densities) but at the largest d0 > 15 μm the increasing α due to stronger gravitational settling causes the infectious aerosol concentration density to grow slower after the trough than the exhaled aerosol concentration densities from the infectious individuals (including the speaking individual who is not wearing a mask). To see the latter, the strengths of the sinks α and total sinks α+ kγ are shown in Fig 4 and we can see that settling causes α to increase by over a factor of 10 from 100 nm to 50 nm. Fig 4 additionally shows the increase in the total sink strength for the largest multiplicities Mc being considered due to inactivation. The large difference between the total sink strength between k = Mc and k = 1 makes the system of ODEs stiff.


Sink strength by bin.
The strength of the sink terms for each bin with 80 bins, which is α without inactivation, α + γ for k = 1, and α + Mc γ for k = Mc (different values for Stage 1 and 2).
The pathogen concentrations as a function of d0 and k right after the beginning and at the end of each Stage are shown in S2 Fig. For large diameters, the concentrations at the beginning of each Stage are initially in a narrow band around the expected multiplicity in each diameter bin but by the end of each Stage the distributions have widened downward as inactivation fills in the lower multiplicities.
The results of choosing different numbers of bins (5, 20, and 80) is shown in S3 Fig. The difference in the concentration densities between 5 bins and 20 bins is substantial, but the difference between 20 and 80 is small. This means that in our example; for concentration densities, 20 bins is sufficient to capture the variation in α(d0) and βk(d0) with respect to diameter, but 5 is too few and 80 is a lot more effort for little gain. But for the
We consider a few hypothetical examples to ellucidate the importance of multiplicity in the dose-response using the corrected exponential model in Eq (12). Another dose-response model could be chosen and the resulting values would differ, but the general pattern would be the same.
First, let’s reconsider the example case but with all pathogen production forced to be mono-multiplicity. We set the new


Effect of ignoring multiplicity, full version.
Full version of Fig 1 with more ρp and the effect of masks. Plot of the ratio of the time required to reach a 50% infection risk when multiplicity is ignored τ50,ignore to when it is fully accounted for τ50,full for different respiratory tract fluid pathogen concentrations ρp. We are considering the same situation as in the worked example, but at steady-state with just the speaking mask-less infectious individual and the risk to a susceptible individual whose exposure starts after steady state is reached. The ratio is shown for different combinations of mask on the susceptible individual (none and simple2) and for different r. The legend lists the r, mask combinations in the same order as the lines from top to bottom. We assumed a 100 nm diameter spherical pathogen and used 80 diameter bins and chose the Mc (maximum multiplicity considered) heuristic threshold to be T = 0.01 (include 99% of pathogen production).
The underestimation increases with increasing ρp and r, and decreases when wearing a mask that is more efficient at filtering large aerosols than small aerosols. The largest aerosols have the greatest multiplicities, which means that a mask that filters them out better than small aerosols reduces the effect of ignoring multiplicity. As ρp increases, the expected multiplicity range for each d0 increases which makes ignoring multiplicity underestimate τ50 more. For the r values considered here, ρp ≤ 109 cm-3 underestimates τ50 by at most 20% and ρp ≤ 108 cm-3 underestimates it by at most 12%. But for ρp = 1011 cm-3, the underestimation is up to 67%. To better understand these patterns, we need to consider two more hypothetical situations.
Let the average pathogen dose be 〈Δ〉 = r−1 and all infectious aerosols have the exact same multiplicity k. Then, the μ for all other multiplicities is zero and μk = 〈Δ〉/k. Essentially, we are dividing the same number of pathogen copies among fewer and fewer aerosols as we increase the number of pathogen copies in each one. The mean infection risk for this constant average dose is shown on the left side of Fig 6 as a function of k for four different r. As the multiplicity increases, the mean infection risk decreases even though the average dose is the same. For k ≪ r−1, the effect of multiplicity on


Multiplicity’s impact on infection risk.
Plots of mean infection risk (
Another way to see this is to consider another hypothetical. Let’s consider the mean infection risk if all aerosols have multiplicity k as we vary r〈Δ〉 for fixed r. This is shown on the right side of Fig 6 for r = 10−2. For low k ≪ r−1, the infection risk curves are nearly identical. For k ≥ r−1, the infection risk decreases for increasing k.
Overall, this means that if the typical infectious aerosol multiplicity is on the order of or greater than r−1, there can be a significant decrease in the infection probability for the same average dose. This has implications for large aerosols when the respiratory tract fluid pathogen concentration ρp,j is large. Large aerosols where 〈k〉 ≳ r−1 will contribute less to the infection risk than would otherwise be expected from their resulting average pathogen dose 〈Δk〉. While we must have Mc > 〈k〉, Mc is usable as a proxy for which diameters the multiplicity causes a substantial correction to the dose-response. If we were to consider r = 2.45 × 10−3 as was done in the example, Fig 2 shows that this would be important for d0 > 15 μm for a high viral concentration of ρp,j = 1011 cm-3 and d0 > 30 for the lower but still high viral concentration of ρp,j = 1010 cm-3. If we were to consider r = 5.39 × 10−2 as was also done in the example, Fig 2 shows that this would be important for d0 > 5 μm for a high viral concentration of ρp,j = 1011 cm-3 and d0 > 10 for the lower but still high viral concentration of ρp,j = 1010 cm-3.
Going back to the risk overestimation from ignoring multiplicity in Fig 5, decreasing r decreases the underestimation in τ50 because the ratio of the average multiplicity in the larger diameter bins to r−1 is smaller. A mask that filters large aerosols better than small aerosols reduces the effect of ignoring multiplicity because larger aerosols have higher multiplicities.
We introduced the sink terms αC,f for filtering by the individuals in the environment as they inhale aerosols with many being absorbed by their mask or respiratory tract rather than being exhaled back out into the environment. To determine when this sink matters, we need to consider the total volume of air that is filtered, ignore the filtering efficiencies, and compare it to the ventilation. The volumetric rate of air filtration by the individuals normalized by the volume of the environment is

The mean adult breathing rates from sedentary/passive to high intensity activity ranges between 0.25 m3 hr-1 and 3.2 m3 hr-1 [14]. For sitting, it would be hard to get σA to be more than 1 m-2 but it would be possible while standing (some public events) though the well-mixed assumption would be breaking down in either case. For a typical room height of 〈h〉 = 4 m, this density limit would yield max(qp)∈[0.063, 0.8] hr-1. If the environment is poorly ventilated (total ventilation rate qv + qo + qr less than 1 hr-1), this high people density would mean the filtering effect of the people would not be negligible compared to the ventilation. But with even moderate ventilation, the contribution of αC,f would be negligible unless all the ventilation is circulating ventilation (qo = qr = 0) with no filter or a very poor filter. For 1.5 and 2 m social distancing, the maximum σA are 0.14 and 0.080 m-2 respectively. For a typical room height of 〈h〉 = 4 m, this density limit would yield max(qp) ∈ [0.005, 0.11] hr-1 which would be negligible in almost all circumstances. For taller rooms, the contribution would be smaller if the total ventilation rate is held constant.
If the fraction of individuals who are infectious is held constant, then NI ∼ σA. Since βk ∼ NI and αC,f ∼ NC but the non αC,f terms of α stay constant, the source increases faster than the sinks meaning that nk increases and therefore
The filtering effects of masks show up in the source βI,k, the sinks αC,f, and the total dose over time μj,k. Masks can substantially improve the total filtering efficiency of the people in αC,f since aerosols have to pass through the mask twice, once on inhalation and again on exhalation at a larger diameter (many masks are better at filtering larger diameters than small diameters). But unless the ventilation is poor and there are a lot of people, this increase in αC,f will have only a small effect on the total sink α. Instead, the main contribution is to reducing βI,k and μj,k which are both linearly proportional to the mask survival efficiency, which can be seen in the example situation.
In the example during Stage 1, there is one infectious individual in the room who is not wearing a mask and the total pathogen concentration reaches about 40 m-3 after 3 hr (Fig 3). During Stage 2, an addition infectious individual has entered the room. The second infectious individual’s ρp is 10 times greater than the first person’s and they are breathing at 4 times the rate; which would mean 40 times the pathogen exhalation rate by itself. Additionally, they are coughing rather than speaking, with the resulting larger exhaled aerosol concentration density ρj (top-right panel of Fig 3); which increases the number of exhaled pathogen copies further. But, they are wearing a mask which reduces the number of infectious aerosols that survive to reach the environment by a factor of 20–100 depending on the diameter. Due to this, the total pathogen concentration doesn’t increase by a factor of over 40 but instead approximately triples, reaching approximately 140 m-3.
The reduction in the average dose μj,k and therefore infection risk
Let’s consider the case where all infectious individuals have the same mask survival efficiency and all susceptible individuals have the same mask survival efficiency. If the effects of masks on α is negligible (αC,f is generally small compared to the other sinks) and βr,k is negligible; the combined effect of both infectious and susceptible individuals wearing masks on the dose is quadratic in the survival efficiencies, which has shown up in other Wells-Riley formulations in the past [7, 10]. Due to superposition of sources, nk ∼ SI,m,out since βI,k ∼ SI,m,out. Then, μj,k ∼ SS,m,in nk ∼ SS,m,in SI,m,out, which is a quadratic term. Now αC,f ∼ SC,m,in SC,m,out makes the effect stronger (usually only slightly stronger) than quadratic since it only serves to increase α and therefore decrease nk further. If everyone wears masks with the exact same survival efficiency S for both inhalation and exhalation that is constant with respect to d0, then if exposure starts at steady state, μj,k ∼ Snk,∞ ∼ Sβk/α ∼ S2/(1 − cS2) where c ∈ [0, 1) is a constant that depends on the relative importance of the αC,f in the total α. In this form, it is easier to see how μj,k scales super-quadratically in the mask survival efficiency. If just the susceptible or just the infectious individuals wear masks, the reduction drops to being stronger than linear (direct contribution of the mask on reducing βI,k or reducing μj,k plus the effect on αC,f). If only non-susceptible non-infectious individuals wear masks, there is still a reduction in the dose but it is small since αO,f is generally small compared to the other sinks, giving a sublinear reduction.
The biggest limitation to the model presented here, like all Wells-Riley formulations, is the well-mixed environment assumption. In almost all indoor environments, the assumption breaks down to varying degrees—the infectious aerosol concentration densities at the locations of susceptible individuals and all sinks (except possibly inactivation) depend on their locations in the environment relative to the sources and the air flow. Social distancing helps with this assumption (reduces direct inhalation of undiluted exhaled puffs of aerosols from infectious individuals), but the assumption is still often dubious.
In situations where people, other sources, and localized sinks (or their outputs) are located close to each other; corrections to nk(d0, t) must be applied at the location of the individual, other source, or sink. Here, we will qualitatively discuss what simple partial corrections that don’t depend on the history of nk(d0, t) would look like. For proximity to the output of filtering sinks, a multiplicative correction would need to be applied with a factor between the sink’s filtering efficiency and one, inclusive, that depends on the location and the properties of the sink such as the flow rate. For proximity to the output of ventilation, the respective filtering efficiency is Ev(w(d0,t)d0). For proximity to individuals, the respective filtering efficiency is 1−SC,m,in,j SC,r,j,k SC,m,out,j,k. For proximity to sources, the correction would be to use a weighted average of nk(d0, t) and the concentration of the air coming from the source/s with the weights depending on the location and the nature of the source flows and mixing, such as flow rates. For close proximity to ventilation coming from other rooms, this would mean a weighted average with nr,k(d0, t) (if there is more than one room, it would be the concentration coming from the room/s whose air is not yet diluted at the location). For close proximity to infectious individuals, this would mean a weighted average with nI,j,k(d0, t)[1 − EI,m,out,j(d0)]. These partial corrections could be done for specific cases (e.g. susceptible individual 2 is 1 m directly in front of infectious individual 5) or in a statistical way if the pair correlation functions between individuals of each two categories (including in-category) as well as the equivalent correlation functions for relative angles of orientation by distance. More extensive corrections could depend on the history of nk(d0, t) and would turn the system of ODEs into a system of Delay Differential Equations (DDEs) or Integro-Differential Equations (IDEs), which would most likely be much harder to solve. At some point, however, it could be easier to do a full fluid and aerosol dynamics treatment.
Any corrections developed for mono-multiplicity Wells-Riley formulations could either be used as is or could be adapted to the poly-multiplicity model presented in this manuscript. Full fluid dynamics simulations with infectious aerosols simulated as passive scalars or as discrete aerosols such as those done by Löhner et al. [40] are the common way to address this limitation entirely and can be used to develop corrections, which are considerably more difficult. Further investigation is needed to find simple approximate ways to generalize the Wells-Riley formulation presented in this manuscript for non-well-mixed environments that are easier than full fluid dynamics with suspended aerosols simulations.
Another limitation of the model presented here is that it assumes that all infectious aerosols have the same ζ and solute composition, and therefore the same w(d0, t). This is more easily circumvented in one case. If the solute concentration and composition is constant over time for each individual source (reasonable assumption over small time spans), the model can be solved for each source individually and then the resulting nk and μk,j summed over the individual solutions. This would also be the solution if ζ varies in different locations in the respiratory tract where infectious aerosols are produced for an infectious person. If ζ changes over time for the sources but the solute composition is constant, then one could generalize the model to additionally track ζ (or equivalently dD) and initial diameter at production d0 separately.
Another problem is the choice of diameter limits d0 ∈ [dm,k, dM] for each multiplicity. We have neglected the fact that the solute concentration is much greater for d0 near the lower limit dm,k as pathogen copies are taking up a very large fraction of the volume and that surface effects may cause additional deviations in the number of pathogen copies in the aerosol from a Poisson distribution. Further work is needed to lift this limitation; though for small pathogens, the total fluid volume and therefore pathogen content in the smallest aerosols where this matters is much less than that of the larger aerosols (see top-right panel of Fig 3) meaning that the effect could be small for small pathogens.
The upper limit dM is the cutoff where aerosols are so large that they are more ballistic and either settle to the ground before evaporating to equilibrium or still settle too quickly to be mixed even after evaporating to their equilibrium diameter. Based on Xie et al. [25] and Chong et al. [21], we suggested a value dM = 50 μm. To look at it, we took the example case and re-calculated it for 23 equal log-width bins between 100 μm and 100 μm and considered the concentration densities and mean infection risks if the top 0, 2, and 4 bins were discarded, thereby setting decreasing dM to 100 μm, 54.8 μm, and 30.1 μm. The time step for the numerical solution had to be reduced to 5 × 10−6 hr due to the increase in Mc at the larger dM. This is shown in Fig 7. Increasing dM increases the total pathogen concentration being tracked since a lot of exhaled respiratory tract fluid volume is contained in the large diameter aerosols, but the total number concentration does not increase much since these big aerosols are few in number. For the larger r = 5.39 × 10−2, the effect on


Effect of upper diameter limit dM.
The example situation was calculated for different values of the upper diameter limit dM (technically, calculated at the largest and then truncated down as needed). (Left) The total pathogen and infectious aerosol concentration densities over time for each dM. Note that the differences in the total infectious aerosol concentration density are so small that the lines are right on top of each other. The mean infection risk for each combination of masks on a susceptible individual (none, simple1, simple2) for (Middle) r = 2.45 × 10−3 and (Right) r = 5.39 × 10−2.
The number of pathogen copies in infectious aerosols must be taken into account if the number of pathogen copies in poly-multiplicity aerosols is not negligible compared to the number of pathogen copies in mono-multiplicity aerosols. We have generalized the Wells-Riley formulation and two common dose-response models (exponential and beta-Poisson) for poly-multiplicity aerosols and shown how to generalize other dose-response models. The generalized Wells-Riley formulation tracks infectious aerosols for each multiplicity individually rather than quanta as is traditional, which then can be put into the generalized dose-response model of choice. The generalized Wells-Riley formulation results in a linear inhomogeneous coupled system of ODEs, one for each multiplicity, at each initial aerosol diameter at production d0 (or bin of d0). The general solution is presented; along with simplified versions for time independent sources, sinks, and humidity and splitting the diameter range into bins. The model is accompanied by an example case for for a poorly ventilated room with SARS-CoV-2, which is presented and solved. The example illustrates how the cutoff multiplicity Mc is determined, the effects of bin size on the solution, and the effects of mask usage on the infection risk. Additional takeaways are
Ignoring multiplicity causes the infection risk to be over-estimated, which is particularly signficant for high respiratory tract fluid pathogen concentrations and high single-pathogen infection probabilities (see Fig 5).
The people in the environment filter the air by breathing, which increases the loss rate for infectious aerosols and is included in the model.
Facemasks on everyone cause a stronger than quadratic reduction in the inhaled dose by susceptible individuals
In summary, we have developed a tractable generalization of the Wells-Riley model for the infection risk from any airborne disease in well mixed indoor environments applicable to both mono- and poly-multiplicity aerosols.
We would like to thank Oliver Schlenczek for important discussions early in the development of the model, Jan Moláček for comments and discussion during editing, and Hani Kaba and Simone Scheithauer for useful references and discussing those references.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45