01 June, 2022
When our genes are used by cells, molecular machinery reads the base pairs and prepares a transcript called messenger RNA that can be translated into proteins. Most of the genes in our body can generate multiple distinctive messenger RNAs through a process called alternative splicing. In fact, recent studies predict that more than 95% of human genes are alternatively spliced. The research conducted here in the Turner lab is focused on deciphering the role of this complex mechanism mediated by RNA binding proteins on the differentiation of lymphocytes. We use a number of tools and techniques which are always improving, and have recently developed our own.
What are lymphocytes and why do they need alternative splicing?
Lymphocytes are immune cells that are vital for the adaptive immune response. There are multiple types of lymphocytes including B and T cells which remember past infections. During an infection, lymphocytes undergo changes in their gene expression and transcriptome, the mRNA produced in the cell, to fulfil their function in the immune response. To illustrate, B cells differentiate into plasma cells producing antibodies to neutralise the immediate threat, but they also form memory cells that persist and rapidly become plasma cells upon reinfection. For one cell population to be able to mount these types of responses, gene expression is dynamically regulated by a variety of mechanisms before and after the messenger RNA is generated.
Alternative splicing is one of the post-transcriptional events which regulates the diversity of the RNA and in turn protein isoforms. Protein isoforms are similar but not identical and so have different functions. Therefore, the ability to splice genes is crucial for a successful immune response.
How are different messenger RNAs produced?
Most of our genes generate multiple distinctive messenger RNAs. Different messenger RNA molecules may have different levels of stability or even change the structure of the protein it makes. Our genes are split in exons, which act like building blocks for the messenger RNA. Alternative splicing regulates use of different exons and so although the gene has not changed, the messenger RNA at the end can be different depending on the building blocks that get selected.
The selection of alternative splice sites found in the messenger RNA precursors results in the formation of different messenger RNA isoforms.
Alternative splicing is mediated by diverse mechanisms. This can include inclusion or exclusion of introns and exons as well as alteration of transcription start and end sites.
Many molecules are involved in the coordination of the alternative splicing. RNA binding proteins (RBPs) influence the decision and execution of the splicing depending on the particular RBPs present, and how many there are.
The differentiation of lymphocytes is a good example of this mechanism. Before an infection is detected, B cells express one isoform of a gene which might be important for the maintenance of naïve B cell. Upon an infection, the same cell expresses a new isoform which is essential for the differentiation of B cells into antibody producing plasma cells or memory cells.
One example for this is the production of the antibody IgM, which comes in two forms, one secreted and one membrane spanning. The isoforms encoding secreted IgM increase and those encoding the membrane spanning isoforms of IgM decrease upon the differentiation of B cells into antibody producing plasma cells [1]. The secreted IgM then neutralises the antigens of the immune threat.
How are the alternatively spliced isoforms measured by researchers?
As researchers, we are keen to understand more about the different spliced transcripts. So called ‘high-throughput’ methods such as RNA-sequencing are used for the detection of alternative transcript isoforms. In the majority of studies, the identification of alternative isoforms relies on short-read sequencing, which requires computational tools for transcriptome assembly from small fragments of RNA.
These methods are time consuming and don’t provide accurate transcript reconstruction. Sometimes the short-read method leaves gaps, and it can be difficult to get a good idea of the transcription start and end sites or when genes have many exons with complex splicing patterns. This limits our understanding of lymphocyte genome splicing.
Fragmented short reads are challenging to assemble and generate reference genomes. Hence, the shortcomings of this technique make it impossible to detect the complex transcript variants reliably Therefore, the need for longer reads led to the generation of new sequencing platforms.
The next generation
The next generation sequencing method using nanopores allows full-length transcript sequencing, which overcomes the limitations of short-read sequencing. A pioneer in this field is Oxford Nanopore Technology, which produces portable and small sequencing devices. Nanopores are embedded in a membrane for direct sequencing of single stranded DNA or RNA. As single strand of DNA slides through the pore, a reader in the pore detects single nucleotides by alterations in ionic current signals.
Long read sequencing using the standard method requires plenty of starting RNA material, something that can be difficult to collect a lot of if the cells under investigation only exist in small numbers. When the amount of starting RNA is small, there are enough read outs to draw conclusions.
To overcome this barrier, we adapted this ‘long read’ technology to identify the different transcript isoforms that only needed a small input. Though several rounds of optimisation, trying different methods allowed us to decrease the input material and to increase the number of reads generated by nanopore sequencing.
With our method, we improved the coverage of the gene by 10 times, allowing us to analyse differential gene expression and isoform expression more accurately than before.
Using this method, we could measure the number of alternative spliced variants of genes which are essential for lymphocyte activation and differentiation. We were able to identify and quantitate Opa1 alternatively spliced variants combining two alternatively spliced events that could not be resolved by other sequencing methods. These findings contributed to one of our research studies where the alternatively spliced genes in mouse CD8 T cell activation were investigated [3].
"Nexons” the future of isoform detection from nanopore data
One of the challenges of nanopore sequencing is the bioinformatics analysis of the data which is not easy. We investigated three different bioinformatic tools for the detection of transcript isoform from long-reads. The splice variants detected by those tools were not consistent with our findings from nanopore raw data and previous findings with other data sets generated in our lab with short-read sequencing. All of the bioinformatic tools lacked the ability to detect the splicing variants of complex isoforms accurately.
Our team, Ozge Gizlenci (me), Simon Andrews, Laura Biggins and Louise Matheson, decided to improve the splice variant detection by developing a new bioinformatic tool. We named this tool “Nexons” a new method of analysis for the detection of transcript isoforms from nanopore data. Nexons can identify alternative isoforms in an accurate and complete way.
Testing Nexons
Having developed a better method for isoform detection, we wanted to see how it would work with real human data. In collaboration with Daniel Hodson and Jingyu Chen at Cambridge Stem Cell Institute, we used our Nexons method to characterise the B cells in the human germinal centre, where B cell differentiation happens. Our results showed the alternative isoform use in different compartments of human germinal centre.
What’s next for Nexons?
We are excited to see what we can discover using Nexons. Working with our colleagues and after establishing of the local and international collaborations, we aim to contribute to the understanding of RNA biology in the germinal centre reactions and to find answers to the question arising from abnormal functions of the immune system and related diseases.
As the majority of the multi-exon human genes are alternatively spliced, scientists are increasingly turning their attention to the association between alternative splicing and human diseases. Our method will contribute to relating these alternative splicing changes in genes to diseases in other system besides immune related diseases.
1. Lamson G, Koshland ME. Changes in J chain and mu chain RNA expression as a function of B cell differentiation. J Exp Med. 1984;160(3):877-892. doi:10.1084/JEM.160.3.877
2. Picelli S, Björklund ÅK, Faridani OR, Sagasser S, Winberg G, Sandberg R. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nature Methods. Published online 2013. doi:10.1038/nmeth.2639
3. D’Angeli V, Monzón-Casanova E, Matheson LS, et al. Polypyrimidine tract binding protein 1 regulates the activation of mouse CD8 T cells. European Journal of Immunology. 2022;0:1-11. doi:10.1002/EJI.202149781
01 June 2022