Vedel Oneal (movedetail96)
The Tibetan Schizothoracinae fish Gymnocypris przewalskii has the ability to adapt to the extreme plateau environment, making it an ideal biological material for evolutionary biology research. However, the lack of well-annotated reference genomes has limited the study of the molecular genetics of G. przewalskii. To characterize its transcriptome features, we first used long-read sequencing technology in combination with RNA-seq for transcriptomic analysis. A total of 159,053 full-length (FL) transcripts were captured by Iso-Seq, having a mean length of 3,445 bp with N50 value of 4,348. Of all FL transcripts, 145,169 were well-annotated in the public database and 134,537 contained complete open reading frames. There were 4,149 pairs of alternative splicing events, of which three randomly selected were defined by RT-PCR and sequencing, and 13,293 long non-coding RNAs detected, based on all- vs.-all BLAST. A total of 118,185 perfect simple sequence repeats were identified from FL transcripts. The FL transcriptome might provide basis for further research of G. przewalskii.Knock-in of large transgenes by Cas9-mediated homology-directed repair (HDR) is an extremely inefficient process. Although the use of single-stranded oligonucleotides (ssODN) as an HDR donor has improved the integration of smaller transgenes, they do not support efficient insertion of large DNA sequences. In an effort to gain insights into the mechanism(s) governing the HDR-mediated integration of larger transgenes and to improve the technology, we conducted knock-in experiments targeting the human EMX1 locus and applied rigorous genomic PCR analyses in the human HEK293 cell line. This exercise revealed an unexpected molecular complication arising from the transgene HDR being initiated at the single homology arm and the subsequent genomic integration of plasmid backbone sequences. To pivot around this problem, we devised a novel PCR-constructed template containing Blocked Long 3' Single-Stranded Overhangs (BL3SSO) that greatly improved the efficiency of bona fide Cas9-stimulated HDR at the EMX1 locus. We further refined BL3SSO technology and successfully used it to insert GFP transgenes into two important interferon-stimulated gene (ISG) loci, Viperin/RSAD2 and ISG15. This study demonstrates the utility of the BL3SSO platform for inserting long DNA sequences into both constitutive and inducible endogenous loci to generate novel human cell lines for the study of important biological processes. The sparse allele vectors (SAV) file format is an efficient storage format for large-scale DNA variation data and is designed for high throughput association analysis by leveraging techniques for fast deserialization of data into computer memory. A command line interface has been developed to complement the storage format and supports basic features like importing, exporting and subsetting. Additionally, a C ++ programming API is available allowing for easy integration into analysis software. https//github.com/statgen/savvy. Supplementary data are available at Bioinformatics online. Supplementary data are available at Bioinformatics online. Ligand-receptor (LR) network analysis allows the characterization of cellular crosstalk based on single cell RNA-seq data. However, current methods typically provide a list of inferred LR interactions and do not allow the researcher to focus on specific cell types, ligands or receptors. In addition, most of these methods cannot quantify changes in crosstalk between two biological phenotypes. CrossTalkeR is a framework for network analysis and visualisation of LR interactions. CrossTalkeR identifies relevant ligands, receptors and cell types contributing to changes in cell communication when contrasting two biological phenotypes, i.e. disease vs. homeostasis. A case study on scRNA-seq of human myeloproliferative neoplasms reinforces the strengths of CrossTalkeR for characterisation of