Project description:Published estimates of methane emissions from atmospheric data (top-down approaches) exceed those from source-based inventories (bottom-up approaches), leading to conflicting claims about the climate implications of fuel switching from coal or petroleum to natural gas. Based on data from a coordinated campaign in the Barnett Shale oil and gas-producing region of Texas, we find that top-down and bottom-up estimates of both total and fossil methane emissions agree within statistical confidence intervals (relative differences are 10% for fossil methane and 0.1% for total methane). We reduced uncertainty in top-down estimates by using repeated mass balance measurements, as well as ethane as a fingerprint for source attribution. Similarly, our bottom-up estimate incorporates a more complete count of facilities than past inventories, which omitted a significant number of major sources, and more effectively accounts for the influence of large emission sources using a statistical estimator that integrates observations from multiple ground-based measurement datasets. Two percent of oil and gas facilities in the Barnett accounts for half of methane emissions at any given time, and high-emitting facilities appear to be spatiotemporally variable. Measured oil and gas methane emissions are 90% larger than estimates based on the US Environmental Protection Agency's Greenhouse Gas Inventory and correspond to 1.5% of natural gas production. This rate of methane loss increases the 20-y climate impacts of natural gas consumed in the region by roughly 50%.
Project description:Stable isotope labeling is the state of the art technique for in vivo quantification of lymphocyte kinetics in humans. It has been central to a number of seminal studies, particularly in the context of HIV-1 and leukemia. However, there is a significant discrepancy between lymphocyte proliferation rates estimated in different studies. Notably, deuterated (2)H2-glucose (D2-glucose) labeling studies consistently yield higher estimates of proliferation than deuterated water (D2O) labeling studies. This hampers our understanding of immune function and undermines our confidence in this important technique. Whether these differences are caused by fundamental biochemical differences between the two compounds and/or by methodological differences in the studies is unknown. D2-glucose and D2O labeling experiments have never been performed by the same group under the same experimental conditions; consequently a direct comparison of these two techniques has not been possible. We sought to address this problem. We performed both in vitro and murine in vivo labeling experiments using identical protocols with both D2-glucose and D2O. This showed that intrinsic differences between the two compounds do not cause differences in the proliferation rate estimates, but that estimates made using D2-glucose in vivo were susceptible to difficulties in normalization due to highly variable blood glucose enrichment. Analysis of three published human studies made using D2-glucose and D2O confirmed this problem, particularly in the case of short term D2-glucose labeling. Correcting for these inaccuracies in normalization decreased proliferation rate estimates made using D2-glucose and slightly increased estimates made using D2O; thus bringing the estimates from the two methods significantly closer and highlighting the importance of reliable normalization when using this technique.
Project description:Uncovering rates at which susceptible individuals become infected with a pathogen, i.e. the force of infection (FOI), is essential for assessing transmission risk and reconstructing distribution of immunity in a population. For dengue, reconstructing exposure and susceptibility statuses from the measured FOI is of particular significance as prior exposure is a strong risk factor for severe disease. FOI can be measured via many study designs. Longitudinal serology are considered gold standard measurements, as they directly track the transition of seronegative individuals to seropositive due to incident infections (seroincidence). Cross-sectional serology can provide estimates of FOI by contrasting seroprevalence across ages. Age of reported cases can also be used to infer FOI. Agreement of these measurements, however, have not been assessed. Using 26 years of data from cohort studies and hospital-attended cases from Kamphaeng Phet province, Thailand, we found FOI estimates from the three sources to be highly inconsistent. Annual FOI estimates from seroincidence was 2.46 to 4.33-times higher than case-derived FOI. Correlation between seroprevalence-derived and case-derived FOI was moderate (correlation coefficient=0.46) and no systematic bias. Through extensive simulations and theoretical analysis, we show that incongruences between methods can result from failing to account for dengue antibody kinetics, assay noise, and heterogeneity in FOI across ages. Extending standard inference models to include these processes reconciled the FOI and susceptibility estimates. Our results highlight the importance of comparing inferences across multiple data types to uncover additional insights not attainable through a single data type/analysis.
Project description:BackgroundDespite decades of interventions, 240 million people have schistosomiasis. Infections cannot be directly observed, and egg-based Kato-Katz thick smears lack sensitivity, affected treatment efficacy and reinfection rate estimates. The point-of-care circulating cathodic antigen (referred to from here as POC-CCA+) test is advocated as an improvement on the Kato-Katz method, but improved estimates are limited by ambiguities in the interpretation of trace results.MethodWe collected repeated Kato-Katz egg counts from 210 school-aged children and scored POC-CCA tests according to the manufacturer's guidelines (referred to from here as POC-CCA+) and the externally developed G score. We used hidden Markov models parameterized with Kato-Katz; Kato-Katz and POC-CCA+; and Kato-Katz and G-Scores, inferring latent clearance and reinfection probabilities at four timepoints over six-months through a more formal statistical reconciliation of these diagnostics than previously conducted. Our approach required minimal but robust assumptions regarding trace interpretations.ResultsAntigen-based models estimated higher infection prevalence across all timepoints compared with the Kato-Katz model, corresponding to lower clearance and higher reinfection estimates. Specifically, pre-treatment prevalence estimates were 85% (Kato-Katz; 95% CI: 79%-92%), 99% (POC-CCA+; 97%-100%) and 98% (G-Score; 95%-100%). Post-treatment, 93% (Kato-Katz; 88%-96%), 72% (POC-CCA+; 64%-79%) and 65% (G-Score; 57%-73%) of those infected were estimated to clear infection. Of those who cleared infection, 35% (Kato-Katz; 27%-42%), 51% (POC-CCA+; 41%-62%) and 44% (G-Score; 33%-55%) were estimated to have been reinfected by 9-weeks.ConclusionsTreatment impact was shorter-lived than Kato-Katz-based estimates alone suggested, with lower clearance and rapid reinfection. At 3 weeks after treatment, longer-term clearance dynamics are captured. At 9 weeks after treatment, reinfection was captured, but failed clearance could not be distinguished from rapid reinfection. Therefore, frequent sampling is required to understand these important epidemiological dynamics.
Project description:The gradient of Bicoid (Bcd) is key for the establishment of the anterior-posterior axis in Drosophila embryos. The gradient properties are compatible with the SDD model in which Bcd is synthesized at the anterior pole and then diffuses into the embryo and is degraded with a characteristic time. Within this model, the Bcd diffusion coefficient is critical to set the timescale of gradient formation. This coefficient has been measured using two optical techniques, Fluorescence Recovery After Photobleaching (FRAP) and Fluorescence Correlation Spectroscopy (FCS), obtaining estimates in which the FCS value is an order of magnitude larger than the FRAP one. This discrepancy raises the following questions: which estimate is "correct''; what is the reason for the disparity; and can the SDD model explain Bcd gradient formation within the experimentally observed times? In this paper, we use a simple biophysical model in which Bcd diffuses and interacts with binding sites to show that both the FRAP and the FCS estimates may be correct and compatible with the observed timescale of gradient formation. The discrepancy arises from the fact that FCS and FRAP report on different effective (concentration dependent) diffusion coefficients, one of which describes the spreading rate of the individual Bcd molecules (the messengers) and the other one that of their concentration (the message). The latter is the one that is more relevant for the gradient establishment and is compatible with its formation within the experimentally observed times.
Project description:BackgroundEstimates of community spread and infection fatality rate (IFR) of COVID-19 have varied across studies. Efforts to synthesize the evidence reach seemingly discrepant conclusions.MethodsSystematic evaluations of seroprevalence studies that had no restrictions based on country and which estimated either total number of people infected and/or aggregate IFRs were identified. Information was extracted and compared on eligibility criteria, searches, amount of evidence included, corrections/adjustments of seroprevalence and death counts, quantitative syntheses and handling of heterogeneity, main estimates and global representativeness.ResultsSix systematic evaluations were eligible. Each combined data from 10 to 338 studies (9-50 countries), because of different eligibility criteria. Two evaluations had some overt flaws in data, violations of stated eligibility criteria and biased eligibility criteria (eg excluding studies with few deaths) that consistently inflated IFR estimates. Perusal of quantitative synthesis methods also exhibited several challenges and biases. Global representativeness was low with 78%-100% of the evidence coming from Europe or the Americas; the two most problematic evaluations considered only one study from other continents. Allowing for these caveats, four evaluations largely agreed in their main final estimates for global spread of the pandemic and the other two evaluations would also agree after correcting overt flaws and biases.ConclusionsAll systematic evaluations of seroprevalence data converge that SARS-CoV-2 infection is widely spread globally. Acknowledging residual uncertainties, the available evidence suggests average global IFR of ~0.15% and ~1.5-2.0 billion infections by February 2021 with substantial differences in IFR and in infection spread across continents, countries and locations.
Project description:BackgroundThe application of different approaches calculating the anthropogenic carbon net flux from land, leads to estimates that vary considerably. One reason for these variations is the extent to which approaches consider forest land to be "managed" by humans, and thus contributing to the net anthropogenic flux. Global Earth Observation (EO) datasets characterising spatio-temporal changes in land cover and carbon stocks provide an independent and consistent approach to estimate forest carbon fluxes. These can be compared against results reported in National Greenhouse Gas Inventories (NGHGIs) to support accurate and timely measuring, reporting and verification (MRV). Using Brazil as a primary case study, with additional analysis in Indonesia and Malaysia, we compare a Global EO-based dataset of forest carbon fluxes to results reported in NGHGIs.ResultsBetween 2001 and 2020, the EO-derived estimates of all forest-related emissions and removals indicate that Brazil was a net sink of carbon (- 0.2 GtCO2yr-1), while Brazil's NGHGI reported a net carbon source (+ 0.8 GtCO2yr-1). After adjusting the EO estimate to use the Brazilian NGHGI definition of managed forest and other assumptions used in the inventory's methodology, the EO net flux became a source of + 0.6 GtCO2yr-1, comparable to the NGHGI. Remaining discrepancies are due largely to differing carbon removal factors and forest types applied in the two datasets. In Indonesia, the EO and NGHGI net flux estimates were similar (+ 0.6 GtCO2 yr-1), but in Malaysia, they differed in both magnitude and sign (NGHGI: -0.2 GtCO2 yr-1; Global EO: + 0.2 GtCO2 yr-1). Spatially explicit datasets on forest types were not publicly available for analysis from either NGHGI, limiting the possibility of detailed adjustments.ConclusionsBy adjusting the EO dataset to improve comparability with carbon fluxes estimated for managed forests in the Brazilian NGHGI, initially diverging estimates were largely reconciled and remaining differences can be explained. Despite limited spatial data available for Indonesia and Malaysia, our comparison indicated specific aspects where differing approaches may explain divergence, including uncertainties and inaccuracies. Our study highlights the importance of enhanced transparency, as set out by the Paris Agreement, to enable alignment between different approaches for independent measuring and verification.
Project description:Susceptible-Exposed-Infected-Removed (SEIR)-type epidemiologic models, modeling unascertained infections latently, can predict unreported cases and deaths assuming perfect testing. We apply a method we developed to account for the high false negative rates of diagnostic RT-PCR tests for detecting an active SARS-CoV-2 infection in a classic SEIR model. The number of unascertained cases and false negatives being unobservable in a real study, population-based serosurveys can help validate model projections. Applying our method to training data from Delhi, India, during March 15-June 30, 2020, we estimate the underreporting factor for cases at 34-53 (deaths: 8-13) on July 10, 2020, largely consistent with the findings of the first round of serosurveys for Delhi (done during June 27-July 10, 2020) with an estimated 22.86% IgG antibody prevalence, yielding estimated underreporting factors of 30-42 for cases. Together, these imply approximately 96-98% cases in Delhi remained unreported (July 10, 2020). Updated calculations using training data during March 15-December 31, 2020 yield estimated underreporting factor for cases at 13-22 (deaths: 3-7) on January 23, 2021, which are again consistent with the latest (fifth) round of serosurveys for Delhi (done during January 15-23, 2021) with an estimated 56.13% IgG antibody prevalence, yielding an estimated range for the underreporting factor for cases at 17-21. Together, these updated estimates imply approximately 92-96% cases in Delhi remained unreported (January 23, 2021). Such model-based estimates, updated with latest data, provide a viable alternative to repeated resource-intensive serosurveys for tracking unreported cases and deaths and gauging the true extent of the pandemic.
Project description:BACKGROUND:Many methods have been developed to infer and reason about molecular interaction networks. These approaches often yield networks with hundreds or thousands of nodes and up to an order of magnitude more edges. It is often desirable to summarize the biological information in such networks. A very common approach is to use gene function enrichment analysis for this task. A major drawback of this method is that it ignores information about the edges in the network being analyzed, i.e., it treats the network simply as a set of genes. In this paper, we introduce a novel method for functional enrichment that explicitly takes network interactions into account. RESULTS:Our approach naturally generalizes Fisher's exact test, a gene set-based technique. Given a function of interest, we compute the subgraph of the network induced by genes annotated to this function. We use the sequence of sizes of the connected components of this sub-network to estimate its connectivity. We estimate the statistical significance of the connectivity empirically by a permutation test. We present three applications of our method: i) determine which functions are enriched in a given network, ii) given a network and an interesting subnetwork of genes within that network, determine which functions are enriched in the sub-network, and iii) given two networks, determine the functions for which the connectivity improves when we merge the second network into the first. Through these applications, we show that our approach is a natural alternative to network clustering algorithms. CONCLUSIONS:We presented a novel approach to functional enrichment that takes into account the pairwise relationships among genes annotated by a particular function. Each of the three applications discovers highly relevant functions. We used our methods to study biological data from three different organisms. Our results demonstrate the wide applicability of our methods. Our algorithms are implemented in C++ and are freely available under the GNU General Public License at our supplementary website. Additionally, all our input data andresults are available at http://bioinformatics.cs.vt.edu/~murali/supplements/2011-incob-nbe/.