The authors have declared that no competing interests exist.
Wheat (Triticum spp.) gluten consists mainly of intrinsincally disordered storage proteins (glutenins and gliadins) that can form megadalton-sized networks. These networks are responsible for the unique viscoelastic properties of wheat dough and affect the quality of bread. These properties have not yet been studied by molecular level simulations. Here, we use a newly developed α-C-based coarse-grained model to study ∼ 4000-residue systems. The corresponding time-dependent properties are studied through shear and axial deformations. We measure the response force to the deformation, the number of entanglements and cavities, the mobility of residues, the number of the inter-chain bonds, etc. Glutenins are shown to influence the mechanics of gluten much more than gliadins. Our simulations are consistent with the existing ideas about gluten elasticity and emphasize the role of entanglements and hydrogen bonding. We also demonstrate that the storage proteins in maize and rice lead to weaker elasticity which points to the unique properties of wheat gluten.
During the breadmaking process, expanding gas bubbles cause the dough to increase volume. Gluten proteins act as an elastic scaffold in that process, allowing the wheat dough to grow more than other kinds of dough. Thus, explaining the unique viscoelastic properties of gluten at the molecular level may be of great interest to the baking industry. Assessing the structural properties of gluten is difficult because its proteins are disordered. We provide the first molecular dynamics model of gluten elasticity, that is able to distinguish gluten and proteins from different plants, like maize and rice. Our model shows the structural changes the gluten proteins undergo during their deformation, which mimics the mixing of dough during kneading. It also allows for a determination of the force required to extend gluten proteins, as during baking. The data confirms existing theories about gluten, but it also provides molecular-level information about the extraordinary elasticity of gluten.
When a dough from the wheat flour is gently washed in water, the cohesive mass that remains is gluten. Seed storage proteins constitute over 80% of this mass [1]. They exhibit viscoelastic properties that are vital for the properties and quality of bread [2]. Here, we neglect other substances present in real gluten samples and conduct molecular simulations of gluten proteins only, trying to recreate and study the polymeric network they form in gluten.
Gluten proteins have sequences with repetitive low complexity fragments that are rich in glutamines and prolines [3]. As a result, they do not possess any preferred tertiary structure [4–6] and are thus (mostly) intrinsically disordered (IDPs).
They can be divided into two fractions: glutenins (which can form large complexes, bound covalently through disulfide bonds) and gliadins (monomeric proteins that constitute around 70% of the total gluten protein mass) [1, 3].
On average, glutenin chains are longer [3] (see also Table 1), and their complexes may be stabilized also by entanglements [7]. Glutenins are thought to be the key ingredients of gluten elasticity [8], which is crucial in the breadmaking process [2]. Gliadins can be considered as a viscous plasticizer of the polymer network made by glutenins [7].

| system | <nCys> | <N> | ΣN | z |
|---|---|---|---|---|
| maize | 6.1 | 221 | 3504 | 5.5 ± 0.1 |
| gliadins | 6 | 290 | 4322 | 6.88 ± 0.03 |
| glutenins | 7.2 | 445 | 3985 | 6.55 ± 0.08 |
| gluten | 6.5 | 384 | 4271 | 6.65 ± 0.07 |
| rice | 3.6 | 208 | 3503 | 5.62 ± 0.09 |
The observed viscoelastic behavior of gluten implies the existence of properties that are both solid-like (an elastic response to deformation) and liquid-like such as the viscous drag that arises from irreversible deformations. These properties can be characterized by the dynamic Young modulus G*. It is one of the few quantities that can be used to directly compare simulations to experiment. We propose to use a one-bead-per-residue coarse-grained Dynamic Structure-Based (DSB) model [9, 10] that is a a generalization of the structure-based (or Go-like) models to the case of the intrinsically disordered proteins. In the DSB model, the contacts between residues are determined from the instantaneous shape of the backbone and not from any reference structure. As a result, the contacts acquire a dynamic character—they can be born and then disappear, making their list fluent.
The DSB model captures the most important features of gluten and agrees qualitatively with available data. The main limitation of the model is the usage of time scales that are substantially shorter than those used in the experiments. Therefore, we do not expect to match the actual experimental values, especially if they are frequency dependent. Our protein model enables milisecond-scale simulations of biochemically large protein systems and incorporates the amino acid specificity. Our molecular model allows us to determine the relative importance of different factors that affect the properties of gluten. This work is especially important because gluten proteins are insoluble and disordered [11] so one cannot infer their properties from any structural features. The only simulations of gluten proteins we could find considered merely single chains instead of many protein complexes [12, 13]. However, the crowding conditions that are present in gluten are expected to affect the single-chain behavior. We study systems of ∼ 4000 residues consisting of about 14 chains. This allows us to check the validity of existing theories of gluten structure on a molecular level and illustrate them in simulations. These theories have identified three main factors that contribute to gluten’s elasticity:
The formation of hydrogen and disulfide bonds: in every gluten protein over 30% of amino acids is glutamine, whose side chain can be both a donor and an acceptor of hydrogen bonds [2]. Disulfide bonds can hold gluten proteins together and generate a sort of a polymer gel [8];
The loops-and-trains model [4, 14] explains gluten elasticity in terms of “loops” that are created by neighboring polymers: they stay together because of hydrogen bonding, but when some bonds are broken (e.g. by water), a free space emerges between the polymers. Under stretching, cavities disappear and hydrogen bonds are made between elongated and nearly parallel chain fragments, forming “trains” that are formed along the direction of stretching and cause resistance to further stretching;
The entangled nature of the polymer network itself leads to a strain resistance, regardless of its chemical nature [7].
We can study these phenomena in our molecular dynamics model by measuring the number of inter-residue contacts described by the Lennard-Jones (L-J) potential, the number of disulfide bonds (that can form and rupture dynamically), the number and size of cavities (as determined by using the Spaceball algorithm [15, 16]) and the number of entanglements (using the Z1 algorithm [17–20]). Structural properties, like the typical distance between the ends of a protein chain, can also be evaluated. We have also computed the mechanical work required to stretch gluten.
We consider gluten and the systems of its different fractions separately in order to elucidate the differences between these proteins. In addition, we also consider a control sample of proteins that do not exhibit such extraordinary viscoelastic properties [21]. Specifically, we use proteins derived from maize and rice for this task. Thus, altogether, we study 5 different systems. We show that many of the properties of gluten are substantially affected by “kneading” which we immitate by introducing periodic deformations.
Our molecular dynamics model is described in details in ref. [9]. The description, however, has not dealt with the formation of the disulfide bonds that are important for gluten.
Briefly, we use periodic boundary conditions in two directions (X and Y) and solid walls in the third direction, denoted as Z. The starting conformations of the proteins are constructed by generating self-avoiding random walks. They are then evolved by the methods of the coarse-grained molecular dynamics that were shown to correctly recreate properties of a set of intrinsically disordered proteins [9]. Rheological information is then obtained by deforming the box containing gluten proteins and recording the resulting force of response (see Subsection, “Simulation protocol”).
We simulate gluten as a set of protein chains in an implicit solvent. Each amino acid is represented by one pseudoatom and subsequent residues in a chain are connected by a harmonic potential. Chain stiffness is introduced through bond angle and dihedral angle potentials as obtained from a random coil library [9, 22]. These angle potentials have different forms based on the identity of residues constituting a given bond or dihedral angle: proline and glycine are treated as special cases. These particular amino acids are abundant in gluten [3].
Electrostatics are governed by the Debye-Hückel potential with relative permittivity that is dependent on the distance [23]: κ = 40 nm−1 r. Other nonlocal interactions are modeled by the 6-12 L-J potential, which can be attractive (in its full form) or repulsive (truncated at the minimum and shifted). Contacts between residues form and disappear dynamically based on different criteria. The criteria have been developed by studying atomic-level overlaps that occur in contacts arising in structured proteins. We distinguish between contacts arising from sidechain-sidechain (ss), sidechain-backbone (bs) and backbone-backbone (bb) interactions. However, the resulting depth of the potentials well, ϵ, is common, no matter what is its origin. Contacts involving sidechains are amino-acid specific. The specificity is introduced primarily through the locations of the potential minima. Some gluten proteins are thought to be partially structured, so a part of the contacts are always attractive.
The Dynamic Structure-Based model has been introduced in ref. [9] and here we just give a brief summary. We label the interacting pseudoatoms by indices i and j. Their positions are and

We solve the equations of motion by using the 5th order predictor-corrector algorithm [25]. Energy unit ϵ is of order 1.5 kcal/mol [26, 27]. The room temperature is then about 0.3–0.35 ϵ/kB, where kB is the Boltzmann constant. Here, the simulations are run at 0.3 ϵ/kB. The results are presented, approximately, in the metric units.
The excluded volume of the residues (whether involved in a contact or not) is ensured by the L-J potential that is cut at ro = 0.5 nm (so that Vr(ri,j ≥ ro) = 0):

The distinction between the bb, bs and ss contacts in our one-bead-per-residue model is accomplished in the following way. When two beads come sufficiently close to each other, we compute two vectors based on positions of three consecutive beads (i-1,i,i+1):

For modeling disulfide bonds, we considered using harmonic potential that is switched on and off, following Thirumalai et al [32]. However, we have found out that it is equivalent but much simpler to use the L-J potential with the strength of 4 ϵ and
The disulfide bond potential is turned on and off in the same manner as other ss contacts, i.e. adiabatically within approximately 10 τ. It should be noted that the real timescales of the disulfide bond formation and their duration are much longer [33]. The only modification besides the amplitude change is that after two cysteins form such a contact, they cannot form additional disulfide bridges (without a prior rapture of the existing bridge).
Some of the gluten proteins contain domains near their termini that tend to be more structured than the whole chain. For example, homology of the A domain of high molecular weight glutenins to a plant inhibitor of digestive enzymes [34] suggests that one or two pairs of cysteines in that domain form fixed intra-chain disulfide bridges. We incorporated available information about possible structured parts of gluten by using a structure-based Go model for those regions [26]. The identification of segments that are recognized as structured domains was based on the UniProt [35] partition of gluten proteins into domains (see Section 1 of the S1 Text for details). For the backbone stiffness of the structured segments we used the same potential as for the unstructured parts.
The actual structures in these segments were predicted by using the ITASSER server [36, 37]. The contact maps were generated based on those all-atom structures, using the overlap criterion (spheres corresponding to heavy atoms of the residues in contact must overlap [26]). The contacts present in the contact map in the structured regions are always attractive (they are “static”, in contrast with the “dynamic” contacts operating in the disordered parts) and the L-J potential describing them has a minimum that corresponds to the distance between the Cα atoms in the reference all-atom structure.
Residues listed in the “static” contact map can still make “disordered” contacts with the residues outside the contact map. The number of “static” contacts made by a residue is determined by the same r < rf criterion, but the potential does not switch off if r > rf.
We performed simulations of 5 different systems specified in Table 1. The exact protein composition of all simulated systems is presented and described in detail in Section 1 of the S1 Text.
While cereal storage proteins are still in the grain, they are usually located in the protein bodies or protein storage vacuoles [38]. After grain maturation, these deposits can fuse and form a continous network [1]. This is reflected in our simulation protocol: in the beginning, the proteins are in a low-density box and have time to attain their shapes separately [39]. Then, as the box dimensions get smaller, different chains connect together, forming one big cluster. Unfortunately we cannot reproduce drying and hydratation in an implicit-solvent model, but the final density of the system reflects the density of real gluten samples [40]. Mixing and kneading of the flour and the resulting dough is represented by the periodic deformations of the box, that can also be compared to the experimental deformations of (much larger) gluten samples in a rheometer [41].
The first step of the simulations is generating the chains as self-avoiding-random-walks positioned randomly in a box with density ρ = 0.1 res/nm3 (residues per cubic nanometer). If an initial random walk configuration happens to cross the boundary of the box, the box is enlarged to keep all chains inside, resulting in a slighly lower initial density. The density of 0.1 res/nm3 is more than 30 times lower than the density of gluten, which should ensure that proteins are in monomeric or dimeric form during the initial equilibration (see inset 1 in Fig 1). Boundary conditions along the X and Y axes are periodic, whereas the walls along the Z direction are repulsive. They are separated by the distance of s. The residues are repelled from the wall by the repulsive part of the L-J potential with the depth 4ϵ.


Time dependence of the distance s separating the two opposite box walls in the Z direction.
The blue curve shows the force exerted on the system by those two walls (summed over all the residues and averaged over 100 ns time interval). The data is for gluten (defined in Section 1 of the S1 Text), with the density of 3.5 nm−3. The period of oscillations is 40 ms. The six insets correspond to the snapshots of the system taken at different stages of the simulation (each bead represents one amino acid, the protein chains are shown in different colors). The same coordinate system (top left) is used for all 6 insets.
After equilibrating the system for 125 μs the box dimensions are reduced from the initial size with the speed of 2 mm/s, after the density ρ0 (usually corresponding to the real density of gluten [40] ρ0 ≈ 3.5 res/nm3) is reached (see Section 3 of the S1 Text for more information about the density). At this stage, the distance between the repulsive walls has a value that is denoted by s0. The system is then equilibrated for another 150 μs.
After the second equilibration stage (depicted in inset 2 in Fig 1) the walls along the Z direction start to attract the proteins. When the distance between any residue and the wall gets closer than dmin = 0.5 nm, the attractive part of the L-J potential with depth of 4ϵ is turned on adiabatically (in the same manner as for disulfide bonds). Then the interaction between the wall and a residue attached to it is described by Vwall = 4VLJ(ri,w), where ri,w is the distance between the ith pseudoatom and an ith interaction center on the wall. The X and Y coordinates of the ith interaction center are fixed at the values of the first bead attachment (when it came to the wall closer than dmin = 0.5 nm, which is the distance corresponding to the minimum of the Vwall potential). The Z coordinate of an interaction center is 0 on one rigid wall and s on the opposite wall. If a wall moves so that s ≠ s0, the interaction centers move together with the wall. If the bead departs away from the interaction center by more than 2 nm, it is considered detached and the interaction center disappears. The bead may reattach at another place later. For d < dmin the repulsive part of the L-J potential is always turned on to ensure that no residue can pass through the wall. The force acting on the walls is a sum of forces that individual residues exert on both walls. It is averaged over 100 ns interval to filter the thermal noise out. An alternative way to introduce interactions with the walls (involving the fcc lattice of psudoatoms) is discussed in Section 4 of the S1 Text.
After the wall attraction is enabled, we give another 150 μs for the equilibration of the system (inset 3 in Fig 1). After this third equilibration stage, when the attraction is already turned on, the walls in the Z direction are moved to the distance of s = s0 − A, where A is the amplitude of the oscillations in the normal direction (we set A = 1 nm). After being moved to the minimum (shown in inset 4 in Fig 1), the wall position changes periodically: s(t) = s0 − A cos(ωt). The wall at Z = 0 stays fixed. In the experiments, ω ∼ 1 Hz [2], which is not possible to achieve in our simulations. Our default period is 40 μs, corresponding to f ≈ 25 kHz. The maximum displacement is shown in inset 5 in Fig 1. For shearing simulations, the walls are also displaced with the same amplitude, A, but the oscillations take place in the X direction and are described by s′(t) = −A cos(ωt) (where s′ = 0 corresponds to one rigid wall being directly above the other). See S1 Fig for an illustration.
After oscillations in the normal direction (or after additional 100 μs of equilibration if no oscillations were made) the system is equilibrated again for 75 μs and then the walls in the Z direction extend gradually to 2s0 so that the total work required for such an extension can be measured, as well as the distance and force required to break the system apart (recreating conditions for measuring uniaxial extension [41]). Inset 6 in Fig 1 depicts this elongation. The speed of elongation is 0.5 mm/s, which is a typical pulling speed employed in AFM stretching experiments in coarse-grained simulations [26, 42]. The maximum speed during oscillations (vmax = Aω) is even lower.
Based on refs. [3, 8, 11, 43–49] we chose proteins that were representative for each of the five systems we study: gliadins, glutenins, gluten and storage proteins from maize and rice. We then used the UniProt database [35] to select the corresponding sequences. The resulting compositions are described in Section 1 of the S1 Text. Table 1 summarizes the key properties of each of the systems studied. A large number of cysteines per chain should increase the propensity to form inter-chain disulfide bonds. On the other hand, longer chains should make entanglements easier. Finally, the “stickiness” of the system is reflected in the coordination number z, which is defined as the mean number of contacts made by each residue (both “dynamic” and “static” combined).
We performed three types of simulations: a) without any oscillations; b) with oscillations with deformation in the normal (Z) direction; c) with oscillations with deformation in the shear (X) direction. All of the properties are calculated in the stage after oscillations (in the absence of oscillations—after the equivalent time is passed). The parameter z does not seem to be affected by the oscillations.
However, the values of 5 other quantities depend on the prior history of the system. Those quantities are: the number of entanglements lk, the maximum force Fmax, the mechanical work Wmax required to stretch the system, the number of the interchain contacts ninter and the RMSF (root mean square fluctuation) as averaged over all residues and determined as a deviation from a mean location of the residue.
It should be noted that each of the systems studied is simulated in a somewhat different box (to achieve the same density for different numbers of residues) and thus comparison of the absolute values of these quantities is not very meaningful. Instead, we analyze the ratios of the five quantities to their average values in the absence of oscillations (denoted by the tildas). Fig 2 shows these ratios for the case of the shearing oscillations whereas S3 Fig for the uniaxial oscillations.


The ratio of 5 different properties calculated after 5 shearing oscillations (in the nominator), and without any oscillations (in the denominator, marked by ~).
The properties are the average number of entanglements lk, the maximum force Fmax during the elongation in the last simulation stage and the maximum work Wmax required to elongate the system in that stage, the number of inter-chain contacts ninter and RMSF (root mean square fluctuation) averaged over all residues. The ratio of 1 is marked by a broken line. Each color corresponds to a different system, as displayed at the top, and defined in Section 1 of the S1 Text. The system density is 3.5 nm−3.
The first of the quantities studied is the total number of entanglements. We have determined lk by using the Z1 algorithm [17]. An entanglement occurs if a path connecting the first and last residue in a chain cannot be shortened to a straight line without crossing a path from another chain [18, 19]. After minimizing the lengths of all such paths with the constraint that they cannot cross one another, each kink, where two paths intertwine, counts as an entanglement [20]. An example of an entanglement is shown in inset 6 in Fig 1, where the end of the grey chain (a low molecular weight glutenin) crosses a loop made by a blue chain (a high molecular weight glutenin). Due to the low number of the chains, each snapshot of the simulation usually has only a few entanglements. We averaged lk over all snapshots made after oscillations.
We use a deterministic random number generator, which produces the same output when provided with same initial random number, called “random seed” [50]). The simulations for each system were repeated at least three times with a different random seed. The same set of random seeds was used for the shearing, pulling and equilibrium simulations. The result shown in Fig 2 uses the mean values from all simulations, but the error of the mean is quite high. Nevertheless, a clear increase is observed for gluten: the oscillating deformations cause more entanglements. We did not include self-entanglements (knots) in Fig 2, but they are also present in gluten: some long proteins can form very deep knots (see S5 Fig), that can last for over 100 μs.
The differences between the studied systems are most pronounced in the case of Fmax, the maximum force required to stretch the system in the last stage of the simulation (illustrated in inset 6 in Fig 1). The blue curve in Fig 1 shows how the force rises and then drops near the end due to the rapture. However, different random seeds may lead to different force curves for the same system. Fig 3 shows three stretching force curves for three different simulations of gluten. Each curve has a number of force peaks. In order to determine the characteristic largest force, Fmax, we take 5 largest force points and calculate their mean value. These mean Fmax are also indicated in Fig 3 as dotted lines surrounded by strips that represent their standard deviation. The Fmax used in Fig 2 is a weighted average of those values (from simulations with different random seeds), where the weights are calculated from the standard deviation of the mean. Large glutenins experience the largest, 5-fold increase in Fmax after oscillations, while for gliadins there is even a very slight decrease. The rise in Fmax for gluten is substantially bigger than for corn and rice proteins.


The force exerted on gluten with the density of 3.5 nm−3 (top panel) during elongation in the last stage of the simulation and the resulting work (bottom panel) as a function of the distance between the rigid walls.
The three trajectories are shown in different colors. The dotted lines correspond to Fmax and Wmax for a given trajectory. For Fmax the strips around the dotted lines represent the uncertainty resulting from averaging Fmax over 5 points with the biggest force. The oscillation period is 40 μs. For F, the thin lines represent raw simulation data and the thick lines are smoothed.
The last stage of the simulations includes moving the Z walls apart with a constant speed. The response force may be easily integrated over distance, giving the work required to stretch the system. The maximum work Wmax required to stretch the system is also bigger if oscillations were introduced before. Again, the difference is largest for glutenins, this time followed by maize proteins. Sometimes the force drops to near zero after reaching the maximum (meaning that the system was ruptured) and the work stops rising (as seen for the dark blue curves on Fig 3). There are other cases where a substantial mechanical work is still required for elongating the system after reaching Fmax (cyan curve), but the total mechanical work Wmax is correlated with the height of the Fmax peak. The proteins apply some effective pressure on the walls, however it is much smaller than the mechanical stress resulting from the stretching (the atmospheric pressure would exert 0.01 nN of force on the walls).
Even if entanglements cause sharp peaks in the force curves, the resistance against deformation is also controlled by the number of inter-chain contacts, ninter, that have to be ruptured during elongation. The biggest increase in ninter is observed for maize proteins. They have a lower number of contacts in total (as indicated by their coordination number z shown in Table 1), but their residue composition allows them to make more contacts due to the mixing effect of the oscillating deformations.
The oscillatory mixing effect is general and it operates in each of the systems studied. It results in a stronger network of proteins that are connected together more tightly, which results in the lowering of RMSF. Less mobile proteins may be more sturdy, but, as a trade-off, the maximum force may occur for a lower inter-wall distance s [51].
According to the “loops and trains” picture, the response of gluten to deformation consists of two stages: at the first stage the chains elongate and the cavities between them shrink, which requires smaller force than disentangling the chains and moving them alongside each other that takes place in the second stage [14, 52]. We tested these predictions by measuring how 6 different quantities change during the final extension of the simulation box (see Fig 4). The snapshots shown in this figure indicate that the “trains” do form, and that shearing between the chains drawn in red and blue may lead to a substantial force by disrupting and reforming the contacts between the chains. Indeed the number of inter-chain contacts decreases in the beginning of the elongation, but then it increases again as “trains” are formed. The chains become more rod-like and less globular, as indicated by the distortion parameter w [53] (averaged over all chains) that is equal to 0 for an ideal sphere, and 1 for an ideal rod. It is defined as


The time dependence of 6 different quantities that are measured for gluten (as defined in section 1 of the S1 Text) during the elongation in the last stage of the simulation.
The solid blue (orange) lines correspond to the quantities listed along the left (right) y-axis. The lines are smoothed, solid (hollow) dots represent raw data from simulation. The quantities analyzed are: the number of inter-chain contacts ninter and the average distortion parameter w, force F and work W, the maximum volume of a cavity
Formation of a “train” (two parallel chain fragments connected by hydrogen bonds) does not coincide with the maximum of the force, because Fmax occurs near the beginning of the elongation and is associated with the biggest decrease in the number of contacts. Nevertheless, the force does not drop to zero and the work required to stretch the system steadily increases, because the chains from the opposite ends of the box are still connected and cause further resistance against deformation.
In order to quantify the number and size of cavities that form in gluten, we used the Spaceball algorithm [15, 16], which uses a probe with a user-defined radius to determine a three-dimensional contour of the system. The volume of the cavities is computed by filling a grid with those probes and rotating the grid with respect to the system. Our system is quite large, so we used grid with 0.1 nm lattice constant, a probe with radius 0.38 nm and one rotation of the grid. The snapshots generated show that the biggest detected cavity is dispersed over the whole simulation box, but during the elongation it becomes associated with the set of proteins attached to the Z = 0 wall (for the trajectory shown). As they are more and more elongated, the size of the cavity decreases. Larger cavities become divided into many smaller cavities, which is reflected in the increase in nc.
Fig 4 illustrates just one simulation (the same properties for the same system, but with a different random seed, are shown in S6 Fig), but the volume of the biggest cavity indeed decreases after elongation in most cases (see S4 Fig). The chains become more rod-like not only during the final stretching, but also after the oscillations (see S2 Fig): deformations cause the proteins to be briefly stretched, but after the deformation ends, the proteins do not return to the exactly the same shape as before, but stay elongated in one direction (which may be called a memory effect).
One of the few fundamental (independent of the size and shape of the sample) rheological properties of gluten is the dynamic shear modulus G* = G′ + iG′′, measured by deforming the gluten sample with sinusoidal deformation (shear strain for G* and normal strain for E*) [41]. The real part (G′, elastic modulus) represents the in-phase part of stress response, the imaginary part (G′′, loss modulus) the out-of-phase response (phase difference δ is defined as

In Table 2 and Fig 5 we compare the results with experiment. Table 2 shows that gliadins are responsible for the viscous properties of gluten (big δ), and glutenins for the elastic part (small δ). Despite largely different values of the results, this trend is still visible in our simulations for the longest oscillation period (70 μs): tan δsim is biggest for gliadins and smallest for glutenins.


Dynamic shear modulus (top panel) and tan δ (bottom panel) as a function of oscillation frequency f.
Results are from our simulations (on the right) and from experiments (on the left). Purple points are from [57], red and light green from [56]. On the top panel, orange and dark green points are from [58]. All curves fitted to the data are of the form a ⋅ fb + c, where a, b and c are the fit parameters.

| System | G′ [kPa] | G′′ [kPa] | tan δ |
![]() |
![]() | tan δsim |
|---|---|---|---|---|---|---|
| Gluten | 2.5 | 1.3 | 0.5 | 6.0 ± 1.6 | 6.1 ± 1.7 | 1.0 ± 0.1 |
| Gliadins | 0.5 | 0.6 | 1.2 | 5.6 ± 1.7 | 8.0 ± 2.3 | 1.4 ± 0.1 |
| Glutenins | 40 | 9 | 0.2 | 8.3 ± 2.2 | 7.9 ± 2.1 | 0.95 ± 0.1 |
We use frequencies that are 4 orders of magnitude larger than in experiment, but some extrapolations can be deduced from the experimental frequency spectra of gluten’s G*. We used the results from two experimental papers [56, 57] that use hydrated gluten samples. Both predict that tan δ should be rising with frequency—see the bottom panel of Fig 5, where tan δsim is between the extrapolations predicted by [56] (green curve) and [57] (purple curve). Such large discrepancies are the effects of different gluten compositions, that greatly influence the results (we use the Aubaine dataset from [57]).
Growing δ means that G′ and G′′ converge, as seen in the top panel of Fig 5. Both G′ and G′′ are rising with f, however this rise is insufficient to explain why the values from simulation are 3 orders of magnitude larger. Either there is a significant non-linearity for higher frequencies and the extrapolation no longer holds, or this is a limitation of our model. We use implicit solvent which cannot account for differences in gluten’s hydration that may greatly affect the results (even “dry” gluten is slightly hydrated) [56].
On the other hand, the results of the simulation are in good agreement with the experiments done on gluten-based bioplastics (as shown on the top panel of Fig 5), where gluten proteins are mixed with glycerol and melted at over 380 K [58]. Even for bioplastics, the tensile strength (the stress needed to break the deformed system) is less than 1 MPa [58], while the Fmax divided by the wall area (which is around 100 nm2) can be more than 10 times higher (see Fig 3).
We have used our coarse-grained model with the implicit solvent to provide microscopic-level understanding of gluten and the related proteins. Kneading and mixing gluten samples is known to increase their tensile strength [59]. In our simulation, we mimick these actions by introducing periodic deformations. We have found that as a result of these deformations gluten and glutenins became significantly more resistant to stretching, as indicated by the rise in Fmax and Wmax. Our model is also able to correctly distinguish gluten from storage proteins from other plants, and to tell the difference between durable glutenins and viscous gliadins. This difference appears to come from the combined effect of entanglements and inter-chain contacts (that represent mainly hydrogen bonds between glutamines in the case of gluten). Maize proteins formed more inter-chain contacts than rice proteins (which may justify why maize flour is more useful for making gluten-free doughs [60]). Its chains were too short to gain additional mechanical resistance from being entangled.
The importance of entanglements and inter-chain hydrogen bonding is already well-known in the experimental literature [7, 8], but our simulations provide an additional confirmation of that. We also elucidated microscopic mechanisms of the loops-and-trains picture [14]. We have found that the decrease in the volume of the biggest cavity is accompanied by the rise in the number of the inter-chain contacts during stretching.
Due to the small system size only a few inter-chain disulfide bonds were simultaneously present during our simulations (most of them were observed for maize proteins) and they did not seem to influence the results significantly. It is possible that they may become more important at larger length scales, where covalent linkage of the chains affects properties like solubility [3]. However, for about 20 gluten chains, the entanglements and non-covalent contacts are sufficient to explain elasticity of the system.
We used the periodic deformation to calculate the dynamic Young modulus G*. We have observed that the theoretical frequency dependence shown in Fig 5 agrees with the experimental results for glycerol-plasticized gluten [58] better than with those for unrefined hydrated gluten [56, 57]. In our implicit solvent model glycerol may theoretically replace water as the solvent. However, our model was parameterized with the assumption of a watery solvent [9]. It should be noted, however, that the experimental range of the frequencies is orders of magnitude smaller than implemented here. Further simplifications are needed to access the longer time scales.
Another reason for the very high G* may be the coarse-graining itself: a coarse-grained pseudoatom has smaller volume than an all-atom amino acid. This may lead to diffrent elastic behavior, as was observed previously in the case of virus elasticity [61].
Despite the deciding role of gluten in determining the dough quality, its measured G* values are not strongly correlated with it [2]. During baking, the gas bubbles inside the dough extend it far beyond the limit of linear deformations. Therefore, measurements of extension to over 200% of the initial size are more informative [2]. Such extensions were generated in the last stage of our simulations and were discussed above. Unfortunately, properties like Wmax and Fmax depend on the system size and geometry, so they cannot be directly compared to experimental values.
Another hypothesis that can be checked in our simulation is that gluten proteins can form a β-spiral from consecutive β-turns made by repetitive fragments like QPGQ [13] (such fragments are common especially in glutenins [4]). Scanning tunneling microscopy discovered a groove with 1.5 nm periodicity in glutenins [4]. However, the sample preparation procedures used prevented the study from validating the existence of the β-spiral in bulk gluten. Models of single glutenin structures consisting of repetitive β [62] and γ [12] turns were reported in the literature, however we did not observe similar structures in our simulations.
We do not discuss here how gluten proteins interact with the human body, but this widely disputed subject [63, 64] can also be tackled by our model in the future. Simulations can also check what part of gluten proteins is solvent-accessible, and thus available for digestive enzymes in the stomach, which could shed new light on the gluten intolerance problem [65].
We thank Mariusz Raczkowski for his help in developing the potential for the disulfide bonds.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65