Sharp Reese (desertorder9)
We consider both EGT events resulting in maintaining (EGTcopy) or removing (EGTcut) the gene copy in the source genome. We present a linear-time algorithm for computing the DLE (Duplication, Loss and EGT) distance, as well as an optimal reconciled tree, for the unitary cost, and a dynamic programming algorithm allowing to output all optimal reconciliations for an arbitrary cost of operations. We illustrate the application of our EndoRex software and analyze different costs settings parameters on a plant dataset and discuss the resulting reconciled trees. EndoRex implementation and supporting data are available on the GitHub repository via https//github.com/AEVO-lab/EndoRex. EndoRex implementation and supporting data are available on the GitHub repository via https//github.com/AEVO-lab/EndoRex. Protein domain duplications are a major contributor to the functional diversification of protein families. These duplications can occur one at a time through single domain duplications, or as tandem duplications where several consecutive domains are duplicated together as part of a single evolutionary event. Existing methods for inferring domain-level evolutionary events are based on reconciling domain trees with gene trees. While some formulations consider multiple domain duplications, they do not explicitly model tandem duplications; this leads to inaccurate inference of which domains duplicated together over the course of evolution. Here, we introduce a reconciliation-based framework that considers the relative positions of domains within extant sequences. We use this information to uncover tandem domain duplications within the evolutionary history of these genes. We devise an integer linear programming approach that solves our problem exactly, and a heuristic approach that works well in practice. We perform extensive simulation studies to demonstrate that our approaches can accurately uncover single and tandem domain duplications, and additionally test our approach on a well-studied orthogroup where lineage-specific domain expansions exhibit varying and complex domain duplication patterns. Code is available on github at https//github.com/Singh-Lab/TandemDuplications. Supplementary data are available at Bioinformatics online. Supplementary data are available at Bioinformatics online.The emergency use authorization of two mRNA vaccines in less than a year from the emergence of SARS-CoV-2 represents a landmark in vaccinology1,2. Yet, how mRNA vaccines stimulate the immune system to elicit protective immune responses is unknown. Here we used a systems vaccinology approach to comprehensively profile the innate and adaptive immune responses of 56 healthy volunteers who were vaccinated with the Pfizer-BioNTech mRNA vaccine (BNT162b2). Vaccination resulted in the robust production of neutralizing antibodies against the wild-type SARS-CoV-2 (derived from 2019-nCOV/USA_WA1/2020) and, to a lesser extent, the B.1.351 strain, as well as significant increases in antigen-specific polyfunctional CD4 and CD8 T cells after the second dose. Booster vaccination stimulated a notably enhanced innate immune response as compared to primary vaccination, evidenced by (1) a greater frequency of CD14+CD16+ inflammatory monocytes; (2) a higher concentration of plasma IFNγ; and (3) a transcriptional signature of innate antiviral immunity. Consistent with these observations, our single-cell transcriptomics analysis demonstrated an approximately 100-fold increase in the frequency of a myeloid cell cluster enriched in interferon-response transcription factors and reduced in AP-1 transcription factors, after secondary immunization. Finally, we identified distinct innate pathways associated with CD8 T cell and neutralizing antibody responses, and show that a monocyte-related signature correlates with the neutralizing antibody response against the B.1.351 variant. Collectively, these data provide insight