Machine learning paired with modeling takes guesswork out of nitrous oxide estimates
The Science
Nitrogen in the soil helps plants grow. Combined with oxygen as nitrous oxide, it can leak into the atmosphere, where it traps heat from the sun. The amount of nitrous oxide released depends on lots of factors, including weather, soil conditions, and crop management practices. That makes it hard to estimate how much of the gas is escaping. Mathematical models simulate complex interactions to predict how changes — such as tillage or when fertilizer is applied — will affect nitrous oxide emissions. But these process-based models aren’t very reliable. Machine learning is a type of artificial intelligence in which computers can be trained to identify complex relationships. But because they see patterns rather than understanding of the underlying causes, models often struggle to generalize or to predict “what-if” scenarios. As a result, many tools for estimating greenhouse gas emissions from farming use simplistic approaches based almost entirely on fertilizer inputs.
By combining process models with machine learning, scientists with the Great Lakes Bioenergy Research Center built a tool that correctly predicted daily emissions four times better than process-based models. Scientists trained the model on more than 12,000 nitrous oxide measurements collected by sensors at 17 sites across the Midwest and Great Plains. Spanning six cropping systems and 35 farm management practices, it is one of the most comprehensive datasets of its kind. The hybrid model predicted both the size and timing of emission peaks at the four test sites with 63% to 98% accuracy. It also identified the main drivers of nitrous oxide emissions, which researchers can now use to improve the process-based models.
The Impact
Nitrous oxide in the atmosphere traps energy from the sun, which contributes to changing weather patterns. Fertilized cropland is one of the main sources of nitrous oxide emissions. The challenge of predicting emissions makes it hard to come up with ways to predict or reduce them. This hybrid model provides more accurate predictions and can be used to improve existing models and guide efforts to use nitrogen fertilizer more efficiently.
Summary
Nitrous oxide (N2O) emissions are affected by weather, soil conditions — including the supply of mineral nitrogen, water content, temperature, acidity, and organic carbon — as well as management practices such as tillage and fertilizer timing. Process-based ecosystem or biogeochemical models can simulate daily N2O emissions mechanistically but often miss high-flux events and require site-specific calibration. Machine learning algorithms improve predictability but lack mechanistic understanding and cannot reliably evaluate hypothetical scenarios not included in the training datasets.
Here, scientists with the Great Lakes Bioenergy Research Center combined five process-based ecosystem models (APSIM, EPIC, SALUS, DSSAT, and STICS) to provide daily plant and soil properties that were used as input data for four blended ML models (Random Forest, Gradient Boosting, Support-Vector Regression, and XGBoost). The ensemble modeling system (EMS) was trained and validated with 12,181 N2O measurements at 17 sites across the Midwest and Great Plains spanning six crops and 35 management practices.
The EMS accurately predicted daily N2O fluxes at both training and held-out testing sites. Analyses identified six dominant N2O drivers: soil organic carbon, NH4+, NO3-, water-filled pore space, temperature, and aboveground biomass production. Wet, warm soils produced large N2O peaks only with sufficient SOC and mineral N. Incorporating these drivers into process-based models may significantly improve predictive capacity. The EMS demonstrates a strong potential to predict N2O fluxes at unseen sites, enabling more reliable regional inventories, improved gap-filling where measurements are sparse, and enhanced understanding of mechanisms to advance targeted mitigation strategies in food, feed, and bioenergy crops.