1. Penguin: A tool for predicting pseudouridine sites in direct RNA nanopore sequencing data
Doaa Hassan, Daniel Acevedo, Swapna Vidhur Daulatabad, Quoseena Mir, Sarath Chandra Janga Methods. 2022 Jul;203:478-487. doi: 10.1016/j.ymeth.2022.02.005. Epub 2022 Feb 16.
Pseudouridine is one of the most abundant RNA modifications, occurring when uridines are catalyzed by Pseudouridine synthase proteins. It plays an important role in many biological processes and has been reported to have application in drug development. Recently, the single-molecule sequencing techniques such as the direct RNA sequencing platform offered by Oxford Nanopore technologies have enabled direct detection of RNA modifications on the molecule being sequenced. In this study, we introduce a tool called Penguin that integrates several machine learning (ML) models to identify RNA Pseudouridine sites on Nanopore direct RNA sequencing reads. Pseudouridine sites were identified on single molecule sequencing data collected from direct RNA sequencing resulting in 723 K reads in Hek293 and 500 K reads in Hela cell lines. Penguin extracts a set of features from the raw signal measured by the Oxford Nanopore and the corresponding basecalled k-mer. Those features are used to train the predictors included in Penguin, which in turn, can predict whether the signal is modified by the presence of Pseudouridine sites in the testing phase. We have included various predictors in Penguin, including Support vector machines (SVM), Random Forest (RF), and Neural network (NN). The results on the two benchmark data sets for Hek293 and Hela cell lines show outstanding performance of Penguin either in random split testing or in independent validation testing. In random split testing, Penguin has been able to identify Pseudouridine sites with a high accuracy of 93.38% by applying SVM to Hek293 benchmark dataset. In independent validation testing, Penguin achieves an accuracy of 92.61% by training SVM with Hek293 benchmark dataset and testing it for identifying Pseudouridine sites on Hela benchmark dataset. Thus, Penguin outperforms the existing Pseudouridine predictors in the literature by 16 % higher accuracy than those predictors using independent validation testing. Employing penguin to predict Pseudouridine sites revealed a significant enrichment of "regulation of mRNA 3'-end processing" in Hek293 cell line and 'positive regulation of transcription from RNA polymerase II promoter involved in cellular response to chemical stimulus' in Hela cell line. Penguin software and models are available on GitHub at https://github.com/Janga-Lab/Penguin and can be readily employed for predicting Ψ sites from Nanopore direct RNA-sequencing datasets.
2. Selected ocular dimensions of three penguin species
Peter W Hadden, Misha Vorobyev, Stephanie B Cassidy, Akilesh Gokul, Samantha K Simkin, Henry Tran, Charles N J McGhee, Jie Zhang Vision Res. 2022 Dec;201:108122. doi: 10.1016/j.visres.2022.108122. Epub 2022 Sep 21.
Penguins (Spheniscidae) are a diverse clade of flightless, marine birds. Their eyes, likely a primary driver of behaviour, have been noted to have anatomic adaptations to their amphibious lifestyle. In particular, they have a relatively flat cornea, which would make the transition from a subaerial to a submarine environment require less accommodative effort. However, the ocular dimensions are not known for many penguin species, despite the diversity within the family, and their accommodative abilities have been the source of some dispute. In this study we undertook to establish the basic dimensions of the eye of the smallest, a mid-sized penguin and the second largest penguin. The power of the front surface of the cornea was inversely related to the size of both the eye and penguin, being 41.3 D in the little penguin (Eudyptula minor), a power greater than previously measured in any other penguin species, 26.3 D in the gentoo (Pygoscelis papua) and 19.1 D in the king penguin (Aptenodytes patagonicus). All other dimensions increased or decreased in line with the size of the eye. All penguins were able to achieve emmetropia in air. The gentoo appeared to be emmetropic underwater. A finding of central corneal thickening in some penguins may be artefactual. Calculations using the ocular dimensions demonstrated that the mean retinal illumination of an extended source of light in the little penguin eye is less than that of its larger, deeper-diving relatives.
3. Adaptation and Cryptic Pseudogenization in Penguin Toll-Like Receptors
Steven R Fiddaman, et al. Mol Biol Evol. 2022 Jan 7;39(1):msab354. doi: 10.1093/molbev/msab354.
Penguins (Sphenisciformes) are an iconic order of flightless, diving seabirds distributed across a large latitudinal range in the Southern Hemisphere. The extensive area over which penguins are endemic is likely to have fostered variation in pathogen pressure, which in turn will have imposed differential selective pressures on the penguin immune system. At the front line of pathogen detection and response, the Toll-like receptors (TLRs) provide insight into host evolution in the face of microbial challenge. TLRs respond to conserved pathogen-associated molecular patterns and are frequently found to be under positive selection, despite retaining specificity for defined agonist classes. We undertook a comparative immunogenetics analysis of TLRs for all penguin species and found evidence of adaptive evolution that was largely restricted to the cell surface-expressed TLRs, with evidence of positive selection at, or near, key agonist-binding sites in TLR1B, TLR4, and TLR5. Intriguingly, TLR15, which is activated by fungal products, appeared to have been pseudogenized multiple times in the Eudyptes spp., but a full-length form was present as a rare haplotype at the population level. However, in vitro analysis revealed that even the full-length form of Eudyptes TLR15 was nonfunctional, indicating an ancestral cryptic pseudogenization prior to its eventual disruption multiple times in the Eudyptes lineage. This unusual pseudogenization event could provide an insight into immune adaptation to fungal pathogens such as Aspergillus, which is responsible for significant mortality in wild and captive bird populations.