Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information
  1. Single-colony MALDI mass spectrometry imaging reveals spatial differences in metabolite abundance between natural and cultured Trichodesmium morphotypes

    Trichodesmium, a globally significant N2-fixing marine cyanobacterium, forms extensive surface blooms in nutrient-poor ocean regions. These blooms consist of a dynamic assemblage of Trichodesmium species that form distinct colony morphotypes and are inhabited by diverse microorganisms. Trichodesmium colony morphotypes vary in ecological niche, nutrient uptake, and organic molecule release, differentially impacting ocean carbon and nitrogen biogeochemical cycles. Here, we assessed the poorly studied spatial abundance of metabolites within and between three morphologically distinct Trichodesmium colonies collected from the Red Sea. We also compared these results with two morphotypes of the cultivable Trichodesmium strain IMS101. Using matrix-assisted laser desorption/ionization (MALDI) mass spectrometry imaging (MSI) coupled with liquid extraction surface analysis (LESA) tandem mass spectrometry (MS2), we identified and localized a wide range of small metabolites associated with single-colony Trichodesmium morphotypes. Our untargeted MALDI-MSI approach revealed 80 unique features (metabolites) shared between Trichodesmium morphotypes. Discrimination analysis showed spatial variations in 57 shared metabolites, accounting for 62% of the observed variation between morphotypes. The greatest variations in metabolite abundance were observed between the cultured morphotypes compared to the natural colony morphotypes, suggesting substantial differences in metabolite production between the cultivable strain IMS101 and the naturally occurring colony morphotypes that the cultivable strain is meant to represent. This study highlights the variations in metabolite abundance between natural and cultured Trichodesmium morphotypes and provides valuable insights into metabolites common to morphologically distinct Trichodesmium colonies, offering a foundation for future targeted metabolomic investigations.

  2. Longitudinal analysis of host protein serum signatures of treatment and recovery in pulmonary tuberculosis

    A better understanding of treatment progression and recovery in pulmonary tuberculosis (TB) infectious disease is crucial. This study analyzed longitudinal serum samples from pulmonary TB patients undergoing interventional treatment to identify surrogate markers for TB-related outcomes. Serum that was collected at baseline and 8, 17, 26, and 52 weeks from 30 TB patients experiencing durable cure were evaluated and compared using a sensitive LC-MS/MS proteomic platform for the detection and quantification of differential host protein signatures relative to timepoint. The global proteome signature was analyzed for statistical differences across the time course and between disease severity and treatment groups. A total of 676 proteins showed differential expression in the serum over these timepoints relative to baseline. Comparisons to understand serum protein dynamics at 8 weeks, treatment endpoints at 17 and 26 weeks, and post-treatment at 52 weeks were performed. The largest protein abundance changes were observed at 8 weeks as the initial effects of antibiotic treatment strongly impacted inflammatory and immune modulated responses. However, the largest number of proteome changes was observed at the end of treatment time points 17 and 26 weeks respectively. Post-treatment 52-week results showed an abatement of differential proteome signatures from end of treatment, though interestingly those proteins uniquely significant at post-treatment were almost exclusively downregulated. Patients were additionally stratified based upon disease severity and compared across all timepoints, identifying 461 discriminating proteome signatures. These proteome signatures collapsed into discrete expression profiles with distinct pathways across immune activation and signaling, hemostasis, and metabolism annotations. Insulin-like growth factor (IGF) and Integrin signaling maintained a severity signature through 52 weeks, implying an intrinsic disease severity signature well into the post-treatment timeframe. Previous proteome studies have primarily focused on the 8-week timepoint in relation to culture conversion status. While this study confirms previous observations, it also highlights some differences. The inclusion of additional end of treatment and post-treatment time points offers a more comprehensive assessment of treatment progression within the serum proteome. Examining the expression dynamics at these later time periods will help in the investigation of relapse patients and has provided indicative markers of response and recovery.

  3. Models, data, and scripts associated with “Prediction of Distributed River Sediment Respiration Rates using Community-Generated Data and Machine Learning”

    This data package is associated with the publication “Prediction of Distributed River Sediment Respiration Rates using Community-Generated Data and Machine Learning’’ submitted to the Journal of Geophysical Research: Machine Learning and Computation (Scheibe et al. 2024). River sediment respiration observations are expensive and labor intensive to obtain and there is no physical model for predicting this quantity. The Worldwide Hydrobiogeochemisty Observation Network for Dynamic River Systems (WHONDRS) observational data set (Goldman et al.; 2020) is used to train machine learning (ML) models to predict respiration rates at unsampled sites. This repository archives training data, ML models, predictions, and model evaluation results for the purposes of reproducibility of the results in the associated manuscript and community reuse of the ML models trained in this project. One of the key challenges in this work was to find an optimum configuration for machine learning models to work with this feature-rich (i.e. 100+ possible input variables) data set. Here, we used a two-tiered approach to managing the analysis of this complex data set: 1) a stacked ensemble of ML models that can automatically optimize hyperparameters to accelerate the process of model selection and tuning and 2) feature permutation importance to iteratively select the most important features (i.e. inputs) to the ML models. The major elements of this ML workflow are modular, portable, open, and cloud-based, thus making this implementation a potential template for other applications. This data package is associated with the GitHub repository found at https://github.com/parallelworks/sl-archive-whondrs. A static copy of the GitHub repository is included in this data package as an archived version at the time of publishing this data package (March 2023). However, we recommend accessing these files via GitHub for full functionality.Please see the file level metadata (flmd; “sl-archive-whondrs_flmd.csv”) for a list of all files contained in this data package and descriptions for each. Please see the data dictionary (dd; “sl-archive-whondrs_dd.csv”) for a list of all column headers contained within comma separated value (csv) files in this data package and descriptions for each. The GitHub repository is organized into five top-level directories: (1) “input_data” holds the training data for the ML models; (2) “ml_models” holds machine learning models trained on the data in “input_data”; (3) “scripts” contains data preprocessing and postprocessing scripts and intermediate results specific to this data set that bookend the ML workflow; (4) “examples” contains the visualization of the results in this repository including plotting scripts for the manuscript (e.g., model evaluation, FPI results) and scripts for running predictions with the ML models (i.e., reusing the trained ML models); (5) “output_data” holds the overall results of the ML model on that branch. Each trained ML model resides on its own branch in the repository; this means that inputs and outputs can be different branch-to-branch. Furthermore, depending on the number of features used to train the ML models, the preprocessing and postprocessing scripts, and their intermediate results, can also be different branch-to-branch. The “main-*” branches are meant to be starting points (i.e. trunks) for each model branch (i.e. sprouts). Please see the Branch Navigation section in the top-level README.md in the GitHub repository for more details. There is also one hidden directory “.github/workflows”. This hidden directory contains information for how to run the ML workflow as an end-to-end automated GitHub Action but it is not needed for reusing the ML models archived here. Please the top-level README.md in the GitHub repository for more details on the automation.

  4. Evaluation of riparian enhancement actions in the Columbia River Basin

    Riparian enhancement is a common restoration technique in the Columbia River Basin (CRB) and the Pacific Northwest. However, relatively few studies have evaluated its success and even fewer studies include long-term monitoring. Forty-one riparian planting projects located in the CRB, each with a paired treatment and control reach, were evaluated in the summer of 2018 and 2019 using an extensive post-treatment (EPT) design. At each reach, we quantified woody plant abundance, richness, diversity, and vegetation and canopy cover as riparian response variables, and measured terrace height and documented individuals with bud browse, deceased, and with predator protection as potential explanatory variables. Species richness, woody plant abundance (all height classes combined and within shrub height class), and the proportion of woody plants with bud browse or deceased were all higher in treatment than control reaches. However, no significant improvements were observed for any other riparian response or explanatory variables. Implementation techniques, including invasive species removal efforts, follow-up planting, watering, and the type of restoration project (floodplain restoration and planting, in-stream restoration and planting, or solely planting) were also considered as potential explanatory variables and were found to have significant impacts on restoration response for abundance and cover variables. Further, our results suggest that planting implementation methods, site level physical factors, and time since planting all influence the success of riparian planting projects and if not addressed in restoration design and implementation can contribute to a lack of detectable response.

  5. synDNA—a Synthetic DNA Spike-in Method for Absolute Quantification of Shotgun Metagenomic Sequencing

    Microbiome studies have the common goal of determining which microbial taxa are present, respond to specific conditions, or promote phenotypic changes in the host. Most of these studies rely on relative abundance measurements to drive conclusions. Inherent limitations of relative values are the inability to determine whether an individual taxon is more or less abundant and the magnitude of this change between the two samples. These limitations can be overcome by using absolute abundance quantifications, which can allow for a more complete understanding of community dynamics by measuring variations in total microbial loads. Obtaining absolute abundance measurements is still technically challenging. Here, we developed synthetic DNA (synDNA) spike-ins that enable precise and cost-effective absolute quantification of microbiome data by adding defined amounts of synDNAs to the samples. We designed 10 synDNAs with the following features: 2,000-bp length, variable GC content (26, 36, 46, 56, or 66% GC), and negligible identity to sequences found in the NCBI database. Dilution pools were generated by mixing the 10 synDNAs at different concentrations. Shotgun metagenomic sequencing showed that the pools of synDNAs with different percentages of GC efficiently reproduced the serial dilution, showing high correlation (r = 0.96; R2 ≥ 0.94) and significance (P < 0.01). Furthermore, we demonstrated that the synDNAs can be used as DNA spike-ins to generate linear models and predict with high accuracy the absolute number of bacterial cells in complex microbial communities.

  6. Disc dichotomy signature in the vertical distribution of [Mg/Fe] and the delayed gas infall scenario

    Context. Analysis of the Apache Point Observatory Galactic Evolution Experiment project (APOGEE) data suggests the existence of a clear distinction between two sequences of disc stars in the [α/Fe] versus [Fe/H] abundance ratio space, known as the high- and low-α sequence, respectively. This dichotomy also emerges from an analysis of the vertical distribution of the [α/Fe] abundance ratio. Aims. We aim to test whether the revised two-infall chemical evolution models designed to reproduce the low- and high-α sequences in the [α/Fe] versus [Fe/H] ratios in the solar neighbourhood are also capable of predicting the disc bimodality observed in the vertical distribution of [Mg/Fe] in APOGEE DR16 data. Methods. Along with the chemical composition of the simple stellar populations born at different Galactic times predicted by our reference chemical evolution models in the solar vicinity, we provide their maximum vertical height above the Galactic plane |zmax| computed assuming the relation between the vertical action and stellar age in APOGEE thin-disc stars. Result. The vertical distribution of the [Mg/Fe] abundance ratio predicted by the reference chemical evolution models is in agreement with that observed when combining the APOGEE DR16 data (chemical abundances) with the astroNN catalogue (stellar ages, orbital parameters) for stars younger than 8 Gyr (only low-α sequence stars). Including the high-α disc component, the dichotomy in the vertical [Mg/Fe] abundance distribution is reproduced considering the observational cut in the Galactic height of |z|< 2 kpc. However, our model predicts an overly flat (almost constant) growth of the maximum vertical height |zmax| quantity as a function of [Mg/Fe] for high-α objects in contrast with the median values from APOGEE data. Possible explanations for such a tension are that: (i) the APOGEE sample with |z|< 2 kpc is more likely than ours to be contaminated by halo stars, causing the median values to be kinematically hotter, and (ii) external perturbations – such as minor mergers – that the Milky Way experienced in the past could have heated up the disc, and the heating of the orbits cannot be modeled by only scattering processes. Assuming a disc dissection based on chemistry for APOGEE-DR16 stars (|z|< 2 kpc), the observed |zmax| distributions for high-α and low-α sequences are in good agreement with our model predictions if we consider the errors in the vertical action estimates in the calculation. Moreover, a better agreement between predicted and observed stellar distributions at different Galactic vertical heights is achieved if asteroseismic ages are included as a constraint in the best-fit model calculations. Conclusions. The signature of a delayed gas infall episode, which gives rise to a hiatus in the star formation history of the Galaxy, are imprinted both in the [Mg/Fe] versus [Fe/H] relation and in vertical distribution of [Mg/Fe] abundances in the solar vicinity.

  7. Milky Way Satellite Census. IV. Constraints on Decaying Dark Matter from Observations of Milky Way Satellite Galaxies

    We use a recent census of the Milky Way (MW) satellite galaxy population to constrain the lifetime of particle dark matter (DM). We consider two-body decaying dark matter (DDM) in which a heavy DM particle decays with lifetime $$\tau$$ comparable to the age of the Universe to a lighter DM particle (with mass splitting $$\epsilon$$) and to a dark radiation species. These decays impart a characteristic "kick velocity," $$V_{\mathrm{kick}}=\epsilon c$$, on the DM daughter particles, significantly depleting the DM content of low-mass subhalos and making them more susceptible to tidal disruption. We fit the suppression of the present-day DDM subhalo mass function (SHMF) as a function of $$\tau$$ and $$V_{\mathrm{kick}}$$ using a suite of high-resolution zoom-in simulations of MW-mass halos, and we validate this model on new DDM simulations of systems specifically chosen to resemble the MW. We implement our DDM SHMF predictions in a forward model that incorporates inhomogeneities in the spatial distribution and detectability of MW satellites and uncertainties in the mapping between galaxies and DM halos, the properties of the MW system, and the disruption of subhalos by the MW disk using an empirical model for the galaxy--halo connection. By comparing to the observed MW satellite population, we conservatively exclude DDM models with $$\tau < 18\ \mathrm{Gyr}$$ ($$29\ \mathrm{Gyr}$$) for $$V_{\mathrm{kick}}=20\ \mathrm{km}\, \mathrm{s}^{-1}$$ ($$40\ \mathrm{km}\, \mathrm{s}^{-1}$$) at $$95\%$$ confidence. These constraints are among the most stringent and robust small-scale structure limits on the DM particle lifetime and strongly disfavor DDM models that have been proposed to alleviate the Hubble and $$S_8$$ tensions.

  8. Self-consistent dispersal puts tight constraints on the spatiotemporal organization of species-rich metacommunities

    Dispersal can be critical to the maintenance of ecosystems as it allows local communities to be recolonized after extinction. However, it remains unclear whether the extinction-mitigating effect of dispersal persists when the number of competing species is large. Based on a spatially explicit mathematical description of metacommunities, we show that when many species coexist, each species operates near its extinction threshold, barely surviving due to dispersal. This has general consequences for spatiotemporal abundance patterns. For short-range dispersal, species organize into fractal spatiotemporal extinction patterns characteristic of a directed percolation phase transition. As species approach their extinction threshold, biodiversity is very sensitive to perturbation, suggesting that dispersal within a metacommunity puts tight constraints on the robustness and evolution of species-rich metacommunities. Biodiversity is often attributed to a dynamic equilibrium between the immigration and extinction of species. This equilibrium forms a common basis for studying ecosystem assembly from a static reservoir of migrants—the mainland. Yet, natural ecosystems often consist of many coupled communities (i.e., metacommunities), and migration occurs between these communities. The pool of migrants then depends on what is sustained in the ecosystem, which, in turn, depends on the dynamic migrant pool. This chicken-and-egg problem of survival and dispersal is poorly understood in communities of many competing species, except for the neutral case—the “unified neutral theory of biodiversity.” Employing spatiotemporal simulations and mean-field analyses, we show that self-consistent dispersal puts rather tight constraints on the dynamic migration–extinction equilibrium. When the number of species is large, species are pushed to the edge of their global extinction, even when competition is weak. As a consequence, the overall diversity is highly sensitive to perturbations in demographic parameters, including growth and dispersal rates. When dispersal is short range, the resulting spatiotemporal abundance patterns follow broad scale-free distributions that correspond to a directed percolation phase transition. The qualitative agreement of our results for short-range and long-range dispersal suggests that this self-organization process is a general property of species-rich metacommunities. Our study shows that self-sustaining metacommunities are highly sensitive to environmental change and provides insights into how biodiversity can be rescued and maintained.

  9. Influence of landscape attributes on Virginia opossum density

    The Virginia opossum (Didelphis virginiana), North America's only marsupial, has a range extending from southern Ontario, Canada, to the Yucatan Peninsula, Mexico, and from the Atlantic seaboard to the Pacific. Despite the Virginia opossum's taxonomic uniqueness in relation to other mammals in North America and rapidly expanding distribution, its ecology remains relatively understudied. Our poor understanding of the ecology of this important mesopredator is especially pronounced in the rural southeastern United States. Our goal was to estimate effects of habitat on opossum density within an extensive multi-year spatial capture-recapture study. Additionally, we compared the results of this spatial capture-recapture analysis with a simple relative abundance index. Opossum densities in the relatively underdeveloped regions of the southeastern United States were lower compared to the more human-dominated landscapes of the Northeast and Midwest. In the southeastern United States, Virginia opossums occurred at a higher density in bottomland swamp and riparian hardwood forest compared to upland pine (Pinus spp.) plantations and isolated wetlands. These results reinforce the notion that the Virginia opossum is commonly associated with land cover types adjacent to permanent water (bottomland swamps, riparian hardwood). The relatively low density of opossums at isolated wetland sites suggests that the large spatial scale of selection demonstrated by opossums gives the species access to preferable cover types within the same landscape.

  10. The Open Cluster Chemical Abundances and Mapping Survey. VII. APOGEE DR17 [C/N]–Age Calibration

    Large-scale surveys open the possibility to investigate Galactic evolution both chemically and kinematically; however, reliable stellar ages remain a major challenge. Detailed chemical information provided by high-resolution spectroscopic surveys of the stars in clusters can be used as a means to calibrate recently developed chemical tools for age-dating field stars. Using data from the Open Cluster Abundances and Mapping survey, based on the Sloan Digital Sky Survey/Apache Point Observatory Galactic Evolution Experiment 2 survey, we derive a new empirical relationship between open cluster stellar ages and the carbon-to-nitrogen ([C/N]) abundance ratios for evolved stars, primarily those on the red giant branch. With this calibration, [C/N] can be used as a chemical clock for evolved field stars to investigate the formation and evolution of different parts of our Galaxy. We explore how mixing effects at different stellar evolutionary phases, like the red clump, affect the derived calibration. We have established the [C/N]–age calibration for APOGEE Data Release 17 (DR17) giant star abundances to be $$\mathrm{log}{[\mathrm{Age}(\mathrm{yr})]}_{\mathrm{DR}17}=10.14\,(\pm 0.08)+2.23(\pm 0.19)\,[{\rm{C}}/{\rm{N}}]$$, usable for $$8.62\leqslant \mathrm{log}(\mathrm{Age}[\mathrm{yr}])\leqslant 9.82$$, derived from a uniform sample of 49 clusters observed as part of APOGEE DR17 applicable primarily to metal-rich, thin- and thick-disk giant stars. This measured [C/N]–age APOGEE DR17 calibration is also shown to be consistent with asteroseismic ages derived from Kepler photometry.


Search for:
All Records
Subject
ABUNDANCE

Refine by:
Resource Type
Availability
Publication Date
  • 1940: 3 results
  • 1941: 0 results
  • 1942: 0 results
  • 1943: 0 results
  • 1944: 0 results
  • 1945: 2 results
  • 1946: 2 results
  • 1947: 0 results
  • 1948: 3 results
  • 1949: 0 results
  • 1950: 3 results
  • 1951: 2 results
  • 1952: 3 results
  • 1953: 5 results
  • 1954: 0 results
  • 1955: 2 results
  • 1956: 2 results
  • 1957: 4 results
  • 1958: 2 results
  • 1959: 3 results
  • 1960: 7 results
  • 1961: 20 results
  • 1962: 25 results
  • 1963: 47 results
  • 1964: 79 results
  • 1965: 248 results
  • 1966: 1,112 results
  • 1967: 2,155 results
  • 1968: 2,211 results
  • 1969: 2,056 results
  • 1970: 1,576 results
  • 1971: 673 results
  • 1972: 608 results
  • 1973: 611 results
  • 1974: 368 results
  • 1975: 243 results
  • 1976: 164 results
  • 1977: 185 results
  • 1978: 222 results
  • 1979: 242 results
  • 1980: 272 results
  • 1981: 263 results
  • 1982: 251 results
  • 1983: 229 results
  • 1984: 248 results
  • 1985: 243 results
  • 1986: 201 results
  • 1987: 197 results
  • 1988: 222 results
  • 1989: 323 results
  • 1990: 244 results
  • 1991: 209 results
  • 1992: 72 results
  • 1993: 100 results
  • 1994: 60 results
  • 1995: 148 results
  • 1996: 153 results
  • 1997: 75 results
  • 1998: 54 results
  • 1999: 51 results
  • 2000: 38 results
  • 2001: 50 results
  • 2002: 53 results
  • 2003: 57 results
  • 2004: 70 results
  • 2005: 112 results
  • 2006: 172 results
  • 2007: 106 results
  • 2008: 177 results
  • 2009: 427 results
  • 2010: 506 results
  • 2011: 361 results
  • 2012: 415 results
  • 2013: 429 results
  • 2014: 541 results
  • 2015: 523 results
  • 2016: 527 results
  • 2017: 340 results
  • 2018: 70 results
  • 2019: 44 results
  • 2020: 48 results
  • 2021: 14 results
  • 2022: 11 results
  • 2023: 1 results
  • 2024: 3 results
1940
2024
Author / Contributor
Research Organization