Nucleic Acids Research
Home TREND-DB—a transcriptome-wide atlas of the dynamic landscape of alternative polyadenylation
TREND-DB—a transcriptome-wide atlas of the dynamic landscape of alternative polyadenylation
TREND-DB—a transcriptome-wide atlas of the dynamic landscape of alternative polyadenylation

Article Type: research-article Article History
Abstract

Alternative polyadenylation (APA) profoundly expands the transcriptome complexity. Perturbations of APA can disrupt biological processes, ultimately resulting in devastating disorders. A major challenge in identifying mechanisms and consequences of APA (and its perturbations) lies in the complexity of RNA 3′ end processing, involving poorly conserved RNA motifs and multi-component complexes consisting of far more than 50 proteins. This is further complicated in that RNA 3′ end maturation is closely linked to transcription, RNA processing and even epigenetic (histone/DNA/RNA) modifications. Here, we present TREND-DB (http://shiny.imbei.uni-mainz.de:3838/trend-db), a resource cataloging the dynamic landscape of APA after depletion of >170 proteins involved in various facets of transcriptional, co- and post-transcriptional gene regulation, epigenetic modifications and further processes. TREND-DB visualizes the dynamics of transcriptome 3′ end diversification (TREND) in a highly interactive manner; it provides a global APA network map and allows interrogating genes affected by specific APA-regulators and vice versa. It also permits condition-specific functional enrichment analyses of APA-affected genes, which suggest wide biological and clinical relevance across all RNAi conditions. The implementation of the UCSC Genome Browser provides additional customizable layers of gene regulation accounting for individual transcript isoforms (e.g. epigenetics, miRNA-binding sites and RNA-binding proteins). TREND-DB thereby fosters disentangling the role of APA for various biological programs, including potential disease mechanisms, and helps identify their diagnostic and therapeutic potential.

Marini,Scherzinger,and Danckwardt: TREND-DB—a transcriptome-wide atlas of the dynamic landscape of alternative polyadenylation

INTRODUCTION

Next-generation RNA sequencing (RNA-seq) has led to the discovery of a perplexingly complex metazoan transcriptome architecture that results from the alternative use of transcription start sites, exons and introns, and polyadenylation sites (1–3). The combinatorial use and incorporation of such elements into mature transcript isoforms significantly expands genomic information and is subject to dynamic spatial and temporal modulation during development and adaptation. Recently, diversification of the transcriptome at the 3′ end by alternative polyadenylation (APA) evolved as an important and evolutionarily conserved layer of gene regulation (4,5). It results in transcript isoforms that vary at the RNA 3′ end, which can have profoundly different physiological effects, by encoding proteins with distinct functions or regulatory properties, or by affecting the mRNA fate through inclusion or exclusion of regulatory elements, such as sites recognized by miRNAs or RNA-binding proteins (6).

Constitutive RNA 3′ end processing relies on a complex macromolecular machinery that catalyzes endonucleolytic cleavage and polyadenylation (CPA) of pre-mRNA molecules (7). This involves the assembly of four multi-component protein complexes (CPSF, CSTF, CFI and CFII) (8) on the pre-mRNA at dedicated, but largely poorly conserved, processing sites (9). Differential expression of individual complex components (10–12) or regulation of their binding properties (e.g. by signaling (13)) can direct the dynamic modulation of APA resulting in transcript isoforms with alternative 3′ ends. In addition, other mechanisms including RNA polymerase II kinetics (14,15) or epigenetic events (16–22) regulate APA, illustrating a previously unanticipated complex cross-talk between various cellular processes in the control of transcriptome diversity. Although difficult to detect by standard high-throughput profiling techniques (23–25), dynamic changes at the transcriptome 3′ end are widespread (26–29). They affect >70% of all genes.

Global APA changes are commonly associated with differentiation and de-differentiation processes (30). This requires a well-coordinated temporal and spatial interaction between RNA motifs and the multi-component processing complex to ensure that CPA occurs timely and at the right position (31,32). Hence, 3′ end processing is tightly coupled to transcription and other co- and post-transcriptional processes (7,33), and controlled by delicate (auto-) regulatory mechanisms (34,35). Transcripts that show dynamic regulation at the 3′ end are typically encoded by phylogenetically old genes, which corresponds to the phylogenetic age of most executing APA regulators (12). Finally, individual components of the CPA machinery are epistatic during differentiation and de-differentiation; they display functional dominance over ‘neighboring’ factors within and across the multi-component CPA complexes in the control of APA (12).

While the underlying mechanisms of APA regulation are still being elucidated, disruption of this process clearly proved to be clinically relevant (36). For example, APA perturbations are associated with various disorders (11,37–41) (and references therein), but they can also possess direct disease eliciting activities, act as oncogenic driver and thereby mimic genetic alterations (12,42). Importantly, such changes are generally missed in genome profiling endeavors (43). But often they also remain undetected by standard RNA-seq technologies (44). However, even when resulting in primarily subtle changes of non-coding RNA sequence elements in the 3′UTR, APA perturbations can be functionally significant (12) and represent unexpectedly potent novel biomarkers (12,24). Aberrant post-transcriptional expansion of the genome complexity can thus result in most devastating consequences; at the same time, it also opens novel diagnostic (12,24) and therapeutic avenues (45,46).

In an attempt to systematically identify drivers of APA in tumorigenic differentiation/dedifferentiation processes (12), we recently applied a comprehensive RNAi screening coupled to a high-throughput sequencing approach suited to capture polyadenylated transcript 3′ ends of numerous experimental conditions (TRENDseq) (44). We depleted >170 proteins, including all known factors involved in pre-mRNA 3′ end CPA in eukaryotes (8) and selected key factors regulating transcriptional activities, splicing, RNA turnover and other functions (47–59) that could directly or indirectly affecting APA (5,60). This resulted in the identification of numerous drivers directing APA including the transcription termination factor PCF11 with a critical role in neurodevelopment and tumor formation (12).

Here, we present TREND-DB, the full roll-out of a user-friendly database, which in complementation to existing static polyA DBs (61–64) (and references therein), catalogs the dynamic nature of the APA landscape genome-wide upon targeting >170 components involved in the definition of RNA 3′ ends. We provide a highly interactive APA network map displaying central hubs in APA regulation. TREND-DB allows querying effects of selected APA-regulators including condition-specific Gene Ontology (GO) (65) enrichment analyses of APA-affected genes. Including an instance of the UCSC Track Hub framework also allows the investigation of other gene regulatory mechanisms with reference to the transcriptome-wide APA data (e.g. miRNA and RNA-binding proteins, epigenetics), or alternatively the direct comparison with own data, uploaded to the genome browser. TREND-DB thereby helps to illuminate the role of APA for various biological programs and potential disease mechanisms, and allows to identify their diagnostic and therapeutic potential. The functional enrichment across all knockdown conditions already suggests wide biological and clinical relevance.

DATA COLLECTION AND PROCESSING

TRENDseq data collection and processing

Applying TRENDseq combined with an RNAi screening targeting >170 potential APA regulators (12), we previously identified >9000 APA-events (out of approximately 20 000 expressed transcripts), corresponding to >3600 genes significantly affected by APA. Briefly, to this end we first generated a TREND annotation assembly based on 3′ end reads to identify bona fide polyadenylation events (23). This annotation was used to exclude TRENDseq reads originating from internal priming on genome encoded adenosine-rich regions. TRENDseq data originating from the RNAi screening were demultiplexed, trimmed, mapped to the human hg38 genome and filtered from internal priming events using the TREND annotation (for further details, see (12) and Supplementary Data).

Quantification and global characteristics of APA events

The number of reads aligned to each polyadenylation site is used as a proxy for the expression of an individual 3′ end transcript isoform. The expression level of each transcript isoform was examined by Fisher’s exact test in comparison to the respective other 3′ end isoform(s) expressed by the same gene. A contingency table included the number of reads of the tested isoform and total amount of reads of all the other isoforms of the gene (for the knockdown and control samples, respectively). Obtained P-values were adjusted using the Benjaminiñ–Hochberg method, and a threshold on the adjusted P-value ≤ 0.05 filter was applied (for further details on normalization, fold change of regulation and calculation of the shortening index see (12)).

Interestingly, almost all depleted factors showed APA-regulation on average affecting 130 genes (Figure 1) with a mean distance between APA-regulated sites of approximately 1020 nucleotides (Supplementary Table S1). While the vast majority of APA events (65%) occur in the 3′UTR (‘tandem polyadenylation’), 20% of the events affect internal polyadenylation (i.e. within introns or the coding region), thereby modulating the generation of truncated protein isoforms. Directly comparing the effects quantitatively and qualitatively, we observed that APA is predominantly driven by components of the CPA machinery (38% of all APA events, with key factors affecting up to 1400 genes (e.g. NUDT21, CPSF6 and PCF11), average effect 319 genes) and that their function is mainly non-redundant (12). However, a significant proportion of APA (62%) is controlled by components involved in transcription (14%, on average affecting APA of 133 genes) and other co- and post-transcriptional events (e.g. splicing and RNA-turnover (16% and 9%; on average affecting APA of 128 and 89 genes, respectively)) or epigenetic modification. This corresponds to a quantitatively comparable extent to which individual splicing factors influence alternative splicing (66). Notably, while the depletion of the vast majority of factors results in even amounts of shortened and lengthened transcript isoforms, the depletion of the CFI and CFII complex components (CPSF6, NUDT21 and PCF11) directs APA in an almost exclusive unidirectional manner resulting in uniformly shortened (CPSF6 and NUDT21) or lengthened (PCF11) transcript isoforms (Figure 1B). In particular, our screening also identified APA regulation to be associated with factors involved in genome surveillance or known to drive tumor-suppressive programs (e.g. TP53), as well as other processes involved in the coupling between oncogenic signals and 3′ end processing, such as BARD1 (67).

Data preview of TREND-DB cataloging the dynamic landscape of alternative polyadenylation. (A) 3′ End processing of polyadenylated RNAs relies on a complex macromolecular machinery that catalyzes endonucleolytic cleavage and polyadenylation (CPA) of the nascent pre-mRNA molecule. CPA is carried out after assembly of four multi-component protein complexes (CPSF, CSTF, CFI and CFII) interacting with multiple RNA motifs. This is tightly coupled to transcription, other steps of RNA processing and even epigenetic modifications. Differential expression of individual complex components or regulation of their binding properties can redirect CPA to alternative processing sites. In addition, further mechanisms including RNA polymerase II kinetics can regulate alternative polyadenylation. (B) Global overview of alternative polyadenylation (obtained by transcriptome-wide TRENDseq) after depletion >170 components involved in various facets of transcriptional, co- and post-transcriptional gene regulation, epigenetic modifications, and further processes with a potential role in the definition of RNA 3′ ends. APA shortening and lengthening refer to CPA at an upstream or downstream positioned poly(A) site (PAS). This can modulate CPA at alternative PASs located in the 3′UTR (‘tandem’ APA) or in the coding sequence (CDS) and/or intronic region (‘internal’ APA), resulting in transcript isoforms with distinct regulatory properties or encoding c-terminally modified proteins. TREND-DB is designed to provide interactive and accessible exploration of this data set.
Data preview of TREND-DB cataloging the dynamic landscape of alternative polyadenylation. (A) 3′ End processing of polyadenylated RNAs relies on a complex macromolecular machinery that catalyzes endonucleolytic cleavage and polyadenylation (CPA) of the nascent pre-mRNA molecule. CPA is carried out after assembly of four multi-component protein complexes (CPSF, CSTF, CFI and CFII) interacting with multiple RNA motifs. This is tightly coupled to transcription, other steps of RNA processing and even epigenetic modifications. Differential expression of individual complex components or regulation of their binding properties can redirect CPA to alternative processing sites. In addition, further mechanisms including RNA polymerase II kinetics can regulate alternative polyadenylation. (B) Global overview of alternative polyadenylation (obtained by transcriptome-wide TRENDseq) after depletion >170 components involved in various facets of transcriptional, co- and post-transcriptional gene regulation, epigenetic modifications, and further processes with a potential role in the definition of RNA 3′ ends. APA shortening and lengthening refer to CPA at an upstream or downstream positioned poly(A) site (PAS). This can modulate CPA at alternative PASs located in the 3′UTR (‘tandem’ APA) or in the coding sequence (CDS) and/or intronic region (‘internal’ APA), resulting in transcript isoforms with distinct regulatory properties or encoding c-terminally modified proteins. TREND-DB is designed to provide interactive and accessible exploration of this data set.
Figure 1.

Data preview of TREND-DB cataloging the dynamic landscape of alternative polyadenylation. (A) 3′ End processing of polyadenylated RNAs relies on a complex macromolecular machinery that catalyzes endonucleolytic cleavage and polyadenylation (CPA) of the nascent pre-mRNA molecule. CPA is carried out after assembly of four multi-component protein complexes (CPSF, CSTF, CFI and CFII) interacting with multiple RNA motifs. This is tightly coupled to transcription, other steps of RNA processing and even epigenetic modifications. Differential expression of individual complex components or regulation of their binding properties can redirect CPA to alternative processing sites. In addition, further mechanisms including RNA polymerase II kinetics can regulate alternative polyadenylation. (B) Global overview of alternative polyadenylation (obtained by transcriptome-wide TRENDseq) after depletion >170 components involved in various facets of transcriptional, co- and post-transcriptional gene regulation, epigenetic modifications, and further processes with a potential role in the definition of RNA 3′ ends. APA shortening and lengthening refer to CPA at an upstream or downstream positioned poly(A) site (PAS). This can modulate CPA at alternative PASs located in the 3′UTR (‘tandem’ APA) or in the coding sequence (CDS) and/or intronic region (‘internal’ APA), resulting in transcript isoforms with distinct regulatory properties or encoding c-terminally modified proteins. TREND-DB is designed to provide interactive and accessible exploration of this data set.

DATABASE CONTENT AND USAGE

To make the dynamics of APA accessible to a wider scientific community, we constructed a user friendly, highly interactive database. In the database, we store (i) summary files for each of the 175 knockdown experiments, where effect size (reported as fold changes) and significance (as adjusted P-values) of each isoform, together with the shortening index (SI), which describes the overall tendency of a given gene to express shortened (or lengthened, respectively) transcript isoforms (based on a proxy of the two most significantly affected APA isoforms), are reported; (ii) bigWig files with the coverage for the isoform peaks, used to compute the SI values across conditions; (iii) BED files with genomic positions for curated lists of poly(A) sites and known microRNA-binding sites for the hg38 assembly (Figure 2, upper section).

Overview of the exploration workflow with TREND-DB. TRENDseq data were aligned, quantified and processed (12) to obtain the expression values of APA transcript isoforms, and the corresponding shortening index (SI) across all siRNA-conditions (displayed in the ‘Data Preview’ tab). The ‘Main View’, where an interactive APA-network map is displayed, enables users to inspect both dimensions of the underlying resource. Upon selection of a gene (left, ‘Gene View’), all siRNA conditions affecting APA of this gene are displayed, accompanied by the TRENDseq coverage plot centered on the desired region (3′UTR or gene body); genes showing a similar APA phenotype (based on the SI profiles) can also be retrieved. When selecting a siRNA-condition (right, ‘Condition View’), all APA-affected genes are listed, and users can perform functional enrichment analysis on the fly. External databases can be reached via dynamically generated action buttons, reducing the time to retrieve additional information. A dedicated instance of the UCSC Browser is provided, containing all the genomic tracks of the TRENDseq data set, with the possibility to integrate custom tracks.
Figure 2.

Overview of the exploration workflow with TREND-DB. TRENDseq data were aligned, quantified and processed (12) to obtain the expression values of APA transcript isoforms, and the corresponding shortening index (SI) across all siRNA-conditions (displayed in the ‘Data Preview’ tab). The ‘Main View’, where an interactive APA-network map is displayed, enables users to inspect both dimensions of the underlying resource. Upon selection of a gene (left, ‘Gene View’), all siRNA conditions affecting APA of this gene are displayed, accompanied by the TRENDseq coverage plot centered on the desired region (3′UTR or gene body); genes showing a similar APA phenotype (based on the SI profiles) can also be retrieved. When selecting a siRNA-condition (right, ‘Condition View’), all APA-affected genes are listed, and users can perform functional enrichment analysis on the fly. External databases can be reached via dynamically generated action buttons, reducing the time to retrieve additional information. A dedicated instance of the UCSC Browser is provided, containing all the genomic tracks of the TRENDseq data set, with the possibility to integrate custom tracks.

The definition of the software requirements has been established to cover the typical use cases, where the users can view, filter and visualize the underlying data. An overview is followed by dedicated gene- and condition-centered views, offering the possibility to store the generated visual and tabular outputs (Figure 2).

The TREND-DB application is developed in R (68) as a Shiny web application (69), leveraging the reactive programming paradigm offered by this framework. This framework has been chosen over existing other ecosystems such as BioPyramid (70) for its simplicity and efficiency of reactivity, together with the customizability of the analysis components enabled by the R programming language. Moreover, we leverage on a number of core packages and infrastructures, such as GenomicRanges (71), from the Bioconductor framework, which efficiently performs a variety of operations in omics data contexts (72). Tabular information is displayed in TREND-DB as interactive tables, generated with the DT package (73), as an R-based interface to the JavaScript DataTables library, which contains useful features, such as pagination, sorting and filtering. Plots of genomic regions (3′UTR and gene bodies) for selected features are implemented with the Gviz package (74).

The user interface

The user interface is based on a dashboard page structure from the shinydashboard package (75), where the main functionality is divided into a series of tab-based views (represented by the blocks in Figure 2).

The ‘Data Preview’ tab serves as a starting point for exploring the provided data set (see also the dedicated guided tours in the TREND-DB web browser). It offers an overview table that contains all genes and shows their SI values in all involved knockdown conditions, enabling to filter for defined SI thresholds applied on absolute values. Selecting a gene in the overview table prints an additional table, listing all the knockdown conditions in which it is affected.

The ‘Main View’ tab first displays an interactive view of the APA network, implemented with the visNetwork package (76). The APA network map (Figures 2 and 3A) essentially depicts cooperative (and antagonistic) interactions between APA regulators (affecting APA of identical genes). While the radius of the nodes reflects the number of APA affected genes (per condition), the links between a pair of nodes depict synergism (green, i.e. unidirectional lengthening or shortening) or antagonism (red, i.e. reciprocal lengthening or shortening) on APA of the common pool of genes, with the edge width encoding the significance of the over-representation of genes being synergistically or antagonistically regulated (BH-adjusted P-values) (12). Remarkably, the network layout reflects known protein complexes involved in polyadenylation (8) and the cross-talk to other RNA-processing events (7), recapitulating the underlying molecular architecture.

Selected screenshots to illustrate the exploration of the APA landscape with TREND-DB. From the ‘Main View’ tab, users can select any condition among those included in the RNAi screening, either from a dropdown menu or from the interactive APA network (A). For example, upon selection of CPSF6 (component of the CFIm protein complex, highlighted in the circle), ‘neighboring’ components/complexes that also regulate identical APA-affected target genes are highlighted. In this case, CPSF6 was found to regulate APA with a global shortening bias, as displayed by the scatterplot in (B) summarizing the PAS usage. Links to external databases (C) are automatically displayed for simplifying further exploration steps beyond TREND-DB. Moreover, a table containing the affected genes (sorted by the SI in (D)) is displayed, and can be used to further explore the APA phenotype of a particular gene, e.g. OAZ1, both in the ‘Main View’ (E), where general information is shown, and in the ‘Gene Plot’ tab, where users can inspect the coverage tracks for a locus of interest, such as the 3′UTR, where the effects of isoform shortening or lengthening are shown (F). Using the whole set of affected genes in a particular screening condition (i.e. knockdown), TREND-DB enables the direct calculation of functional enrichment of APA-affected target genes. The functional enrichment is displayed in an interactive table (G), or graphically as an enrichment map (H) for easier interpretation (for improving the readability, the single node labels have been replaced with a representative biological process for the clustered subset).
Figure 3.

Selected screenshots to illustrate the exploration of the APA landscape with TREND-DB. From the ‘Main View’ tab, users can select any condition among those included in the RNAi screening, either from a dropdown menu or from the interactive APA network (A). For example, upon selection of CPSF6 (component of the CFIm protein complex, highlighted in the circle), ‘neighboring’ components/complexes that also regulate identical APA-affected target genes are highlighted. In this case, CPSF6 was found to regulate APA with a global shortening bias, as displayed by the scatterplot in (B) summarizing the PAS usage. Links to external databases (C) are automatically displayed for simplifying further exploration steps beyond TREND-DB. Moreover, a table containing the affected genes (sorted by the SI in (D)) is displayed, and can be used to further explore the APA phenotype of a particular gene, e.g. OAZ1, both in the ‘Main View’ (E), where general information is shown, and in the ‘Gene Plot’ tab, where users can inspect the coverage tracks for a locus of interest, such as the 3′UTR, where the effects of isoform shortening or lengthening are shown (F). Using the whole set of affected genes in a particular screening condition (i.e. knockdown), TREND-DB enables the direct calculation of functional enrichment of APA-affected target genes. The functional enrichment is displayed in an interactive table (G), or graphically as an enrichment map (H) for easier interpretation (for improving the readability, the single node labels have been replaced with a representative biological process for the clustered subset).

The remainder of the tab is separated into two sections, corresponding to the 2D of the database: (i) a ‘Gene View’ and (ii) a ‘Condition View’ (Figure 2, central section). Any gene or condition selection in the ‘Data Preview’ tab is automatically carried over to this tab, while a (different) gene or condition can also be selected from the respective dropdown menu. Upon selecting a gene, the ‘Gene View’ section displays a brief summary of the gene, including name, coordinates, strand, chromosome and a description (Figure 3E). Hyperlinks to additional resources, such as the NCBI RefSeq database, the GTEx Portal, the Human Protein Atlas or the dbPTM database (77–80), are provided as well to facilitate further exploration. On the other hand, once an experimental condition has been selected, the ‘Condition View’ provides a list with all genes affected, their SIs and P-values for the shorter and longer isoforms, according to the APA usage (Figure 3D). Users may view a scatterplot for each condition, depicting the distribution of proximal and distal poly(A) site usage among the affected genes (Figure 3B).

Additionally, a built-in GO term enrichment analysis tool (Figure 3G) performs functional analysis on the set of genes affected in the selected condition against a background of all genes, as a potential starting point for more in-depth analysis—the interactive table created in TREND-DB automatically links to the corresponding entries in the AmiGO database (81), and can be represented visually as an enrichment map for better overview of the impacted biological themes (82). For example, genes affected in the AGO2 knockdown display among the top enriched results three retina-related GO terms, a tissue where 3′UTR shortening is often observed (83). However, there are also other interesting examples of GO enrichments underlining the wide biological and clinical relevance across knockdown conditions, for example suggesting not only a potential bidirectional mechanistic link between alternative polyadenylation and alternative splicing, but also mechanisms in UTR biology towards immunobiology or developmental processes (Figure 3H).

We and others earlier demonstrated that the relative abundance of transcript isoforms generated by APA has strong diagnostic (prognostic) potential, with superior performance in predicting death or high risk in cancer patients compared to commonly used clinical stratification markers (12). In addition, disentangling the functional role of (select) transcript isoforms opens ample therapeutic opportunities to interfere with polyadenylation in a selective or non-selective fashion (45,46). This requires further in-depth information on where APA occurs. We thus implemented functions that allow to visualize where APA affected genes are differentially polyadenylated, whether this affects the coding region of the gene (‘internal polyadenylation’) and/or non-coding UTR elements (‘tandem polyadenylation’, see above) and whether there are other conditions that result in a similar APA ‘pattern’ of the same gene.

The ‘Gene Plot’ tab creates overview plots of a gene and its associated data (Figure 3F). These plots contain a gene region reference track, coverage tracks for the selected condition (otherwise defaulting to the condition where the maximum SI is observed) as well as the control, and a genomic track for known APA sites. By default, overview plots are centered around the 3′UTR region of the gene; however, one can display the whole gene body region in the plot. Generated plots can be downloaded using a download button. Similarly affected genes, suggested in the interface component below the plot, can also be selected and displayed.

To enable a seamless integration with the wealth of genomic information provided by other sources, such as the UCSC Genome Browser (84), we set up an instance of the UCSC Track Hub (85), where the data are stored on a secure FTP server and the required configuration files (hub.txt, genomes.txt and trackDb.txt) are specified in the TREND-DB GitHub repository, specifying the properties of the hub data tracks (genome assemblies, name, labels, descriptions and display parameters). This content is integrated in the ‘Genome Browser’ tab of TREND-DB via an iframe HTML element (for displaying web pages within other web pages). A custom URL is generated, accounting for all current selections made by the user, and passes it to the iframe, where the Genome Browser mirrors the behavior of the TREND-DB application.

While the ‘Gene Plot’ view has a more specific aim on visualizing TRENDseq data (e.g. whether APA primarily affects the 3′UTR or the coding region), the ‘Genome Browser’ view constitutes a bridge to a more flexible environment, where users can upload additional resources, such as existing tracks for their own genomic data, and facilitate the integration with the data set of TREND-DB. As a proof-of-concept we implemented miRNA tracks; however, this can also be expanded to other resources e.g. CLIPseq (86), global interactome data (87,88) or DNA/RNA modifications (19,21,22).

Altogether these functions enable accessible and extendable exploration for the TREND-DB data sets, for unbiased and targeted analysis of downstream functions, for assessment of the diagnostic potential of APA signatures and for the development and design of potential therapeutic strategies (89). TREND-DB thereby facilitates access and reuse of such complex data by a broader scientific community complying with the FAIR data principles (90).

Simplifying the exploration for users

A common issue when exploring a series of data with a new application is how to efficiently guide the users across the different pieces of functionality offered. While text-based tutorials and walk-through documents (also in video format) can work efficiently for a specific snapshot of the application, it is not a convenient approach for maintenance once new features are added. Moreover, an approach that encourages the users to try out specific actions tightly linked to defined elements in the user interface has proven useful for other applications such as ideal (91) and iSEE (92), offering a learning-by-doing paradigm for showcasing the application features in a more cohesive, workflow-like way.

Each tab of TREND-DB has a dedicated question mark action button, which triggers bespoke step-by-step tours of the functionality offered in each view, via the rintrojs package (93). We guide the audience in the original documentation by displaying use cases that refer to a few selected findings reported earlier (12).

Moreover, as TRENDseq data are not as common as e.g. transcriptome data sets, we included a Glossary section in the main dropdown menu, precisely defining the terms commonly used throughout the app, and specifically referring to the data type under inspection.

SUMMARY AND FUTURE DIRECTIONS

TREND-DB has been conceived as an application to enable easy access to the main data sets from a recently published massive RNAi screening approach on the mechanisms and consequences underlying APA regulation (12). The deployment of such a portal can provide immediate usability of the resources showcased in publications, and thus increase their impact by releasing a more user-friendly version of the data, thus reaching a wider number of interested researchers.

The R/Shiny framework is highly flexible in regard to data manipulation and analysis, therefore we expect straightforward extensions and adaptations to the scenario we proposed in this work, also to other experimental settings. As computational reproducibility gains more attention (94–96) in the scientific community, it will be desirable to support the creation of a report-like document, e.g. implemented with knitr/R Markdown, tracking the performed analyses, similar to the approach followed by the pcaExplorer package (97).

Finally, we anticipate TREND-DB to become a rich resource to explore and validate the impact of genetic and epigenetic variants on APA, and thereby also help to expand our understanding of the genetic origins of disease (98).

AVAILABILITY OF DATABASE, CODE AND DATA

The TREND-DB application has been deployed on the Shiny Server at the Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), and can be accessed at http://shiny.imbei.uni-mainz.de:3838/trend-db.

The data (sequencing raw and processed) displayed in TREND-DB are also available on GEO, under the accession GSE95057 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE95057).

TREND-DB is freely available as a Shiny web application, with source code and processing information available under the MIT license at https://github.com/federicomarini/trend-db.

ACKNOWLEDGEMENTS

The authors would like to express their gratitude to current and former members of the Danckwardt lab.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

DFG [DA 1189/2-1, GRK 1591 and SPP 1935 to S.D.]; Federal Ministry of Education and Research [BMBF 01EO1003 to S.D. and F.M.]; Dr. Hella Bühler Stiftung to S.D.; German Society of Clinical and Laboratory Medicine (DGKL) to S.D.

Conflict of interest statement. None declared.

REFERENCES

1. 

Carninci P., Klein S.A., Seidel D.J., Pearson B.D., Singer S.F., Michaels P.J., Cox D.I., Seidel D.J., Eskridge R.E., Peterson T.C.et al. The transcriptional landscape of the mammalian genome. Science. 2005; 309:15591563.

2. 

Wang E.T., Sandberg R., Luo S., Khrebtukova I., Zhang L., Mayr C., Kingsmore S.F., Schroth G.P., Burge C.B. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008; 456:470476.

3. 

Reyes A., Huber W. Alternative start and termination sites of transcription drive most transcript isoform differences across human tissues. Nucleic Acids Res.2018; 46:582592.

4. 

Elkon R., Ugalde A.P., Agami R. Alternative cleavage and polyadenylation: extent, regulation and function. Nat. Rev. Genet.2013; 14:496506.

5. 

Tian B., Manley J.L. Alternative polyadenylation of mRNA precursors. Nat. Rev. Mol. Cell Biol.2017; 18:1830.

6. 

Mayr C. Regulation by 3′-Untranslated regions. Annu. Rev. Genet.2017; 51:171194.

7. 

Danckwardt S., Hentze M.W., Kulozik A.E. 3′ end mRNA processing: Molecular mechanisms and implications for health and disease. EMBO J.2008; 27:482498.

8. 

Shi Y., Di Giammartino D.C., Taylor D., Sarkeshik A., Rice W.J., Yates J.R., Frank J., Manley J.L. Molecular architecture of the human Pre-mRNA 3′ processing complex. Mol. Cell. 2009; 33:365376.

9. 

Gruber A.J., Schmidt R., Gruber A.R., Martin G., Ghosh S., Belmadani M., Keller W., Zavolan M. A comprehensive analysis of 3′ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation. Genome Res.2016; 26:11451159.

10. 

Jenal M., Elkon R., Loayza-Puch F., Van Haaften G., Kühn U., Menzies F.M., Vrielink J.A., Bos A.J., Drost J., Rooijers K.et al. The poly(A)-binding protein nuclear 1 suppresses alternative cleavage and polyadenylation sites. Cell. 2012; 149:538553.

11. 

Masamha C.P., Xia Z., Yang J., Albrecht T.R., Li M., Shyu A.B., Li W., Wagner E.J. CFIm25 links alternative polyadenylation to glioblastoma tumour suppression. Nature. 2014; 510:412416.

12. 

Ogorodnikov A., Levin M., Tattikota S., Tokalov S., Hoque M., Scherzinger D., Marini F., Poetsch A., Binder H., Macher-Göppinger S.et al. Transcriptome 3′end organization by PCF11 links alternative polyadenylation to formation and neuronal differentiation of neuroblastoma. Nat. Commun.2018; 9:5331.

13. 

Danckwardt S., Gantzert A.S., Macher-Goeppinger S., Probst H.C., Gentzel M., Wilm M., Gröne H.J., Schirmacher P., Hentze M.W., Kulozik A.E. P38 MAPK controls prothrombin expression by regulated RNA 3′ end processing. Mol. Cell. 2011; 41:298310.

14. 

Pinto P.A.B., Henriques T., Freitas M.O., Martins T., Domingues R.G., Wyrzykowska P.S., Coelho P.A., Carmo A.M., Sunkel C.E., Proudfoot N.J.et al. RNA polymerase II kinetics in polo polyadenylation signal selection. EMBO J.2011; 30:24312444.

15. 

Neve J., Burger K., Li W., Hoque M., Patel R., Tian B., Gullerova M., Furger A. Subcellular RNA profiling links splicing and nuclear DICER1 to alternative cleavage and polyadenylation. Genome Res.2016; 26:2435.

16. 

Wood A.J., Schulz R., Woodfine K., Koltowska K., Beechey C.V., Peters J., Bourc’his D., Oakey R.J. Regulation of alternative polyadenylation by genomic imprinting. Genes Develop.2008; 22:11411146.

17. 

Spies N., Nielsen C.B., Padgett R.A., Burge C.B. Biased chromatin signatures around polyadenylation sites and exons. Mol. Cell. 2009; 36:245254.

18. 

Li W., Park J.Y., Zheng D., Hoque M., Yehia G., Tian B. Alternative cleavage and polyadenylation in spermatogenesis connects chromatin regulation with post-transcriptional control. BMC Biol.2016; 14:6.

19. 

Ke S., Alemu E.A., Mertens C., Gantman E.C., Fak J.J., Mele A., Haripal B., Zucker-Scharff I., Moore M.J., Park C.Y.et al. A majority of m 6 A residues are in the last exons, allowing the potential for 3′ UTR regulation. Genes Develop.2015; 29:20372053.

20. 

Brumbaugh J., Di Stefano B., Wang X., Borkent M., Forouzmand E., Clowers K.J., Ji F., Schwarz B.A., Kalocsay M., Elledge S.J.et al. Nudt21 controls cell fate by connecting alternative polyadenylation to chromatin signaling. Cell. 2018; 172:106120.

21. 

Yue Y., Liu J., Cui X., Cao J., Luo G., Zhang Z., Cheng T., Gao M., Shu X., Ma H.et al. VIRMA mediates preferential m6A mRNA methylation in 3′UTR and near stop codon and associates with alternative polyadenylation. Cell Discov.2018; 4:10.

22. 

Parker M.T., Knop K., Sherwood A.V., Schurch N.J., Mackinnon K., Gould P.D., Hall A.J., Barton G.J., Simpson G.G. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification. eLife. 2020; 9:e49658.

23. 

Hoque M., Ji Z., Zheng D., Luo W., Li W., You B., Park J.Y., Yehia G., Tian B. Analysis of alternative cleavage and polyadenylation by 3′ region extraction and deep sequencing. Nat. Methods. 2013; 10:133139.

24. 

Xia Z., Donehower L.A., Cooper T.A., Neilson J.R., Wheeler D.A., Wagner E.J., Li W. Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3′-UTR landscape across seven tumour types. Nat. Commun.2014; 5:5274.

25. 

Ha K. C.H., Blencowe B.J., Morris Q. QAPA: a new method for the systematic analysis of alternative polyadenylation from RNA-seq data. Genome Biol.2018; 19:45.

26. 

Sandberg R., Neilson J.R., Sarma A., Sharp P.A., Burge C.B. Proliferating cells express mRNAs with shortened 3′ untranslated regions and fewer MicroRNA target sites. Science. 2008; 320:16431647.

27. 

Mayr C., Bartel D.P. Widespread shortening of 3′UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell. 2009; 138:673684.

28. 

Ozsolak F., Kapranov P., Foissac S., Kim S.W., Fishilevich E., Monaghan A.P., John B., Milos P.M. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell. 2010; 143:10181029.

29. 

Derti A., Garrett-Engele P., MacIsaac K.D., Stevens R.C., Sriram S., Chen R., Rohl C.A., Johnson J.M., Babak T. A quantitative atlas of polyadenylation in five mammals. Genome Res.2012; 22:11731183.

30. 

Turner R.E., Pattison A.D., Beilharz T.H. Alternative polyadenylation in the regulation and dysregulation of gene expression. Semin. Cell Dev. Biol.2018; 75:6169.

31. 

Lutz C.S., Moreira A. Alternative mRNA polyadenylation in eukaryotes: An effective regulator of gene expression. WIRES RNA. 2011; 2:2231.

32. 

Gruber A.J., Zavolan M. Alternative cleavage and polyadenylation in health and disease. Nat. Rev. Genet.2019; 20:599614.

33. 

Millevoi S., Vagner S. Molecular mechanisms of eukaryotic pre-mRNA 3′ end processing regulation. Nucleic Acids Res.2009; 38:27572774.

34. 

Kamieniarz-Gdula K., Gdula M.R., Panser K., Nojima T., Monks J., Wiśniewski J.R., Riepsaame J., Brockdorff N., Pauli A., Proudfoot N.J. Selective roles of vertebrate PCF11 in premature and Full-Length transcript termination. Mol. Cell. 2019; 74:158172.

35. 

Wang R., Zheng D., Wei L., Ding Q., Tian B. Regulation of intronic polyadenylation by PCF11 impacts mRNA expression of long genes. Cell Rep.2019; 26:27662778.

36. 

Nourse J., Spada S., Danckwardt S. Emerging roles of RNA 3′ end cleavage and polyadenylation in pathogenesis, diagnosis and therapy of human disorders. Biomolecules. 2020; 10:812.

37. 

Lin Y., Li Z., Ozsolak F., Kim S.W., Arango-Argoty G., Liu T.T., Tenenbaum S.A., Bailey T., Monaghan A.P., Milos P.M.et al. An in-depth map of polyadenylation sites in cancer. Nucleic Acids Res.2012; 40:84608471.

38. 

Weng T., Ko J., Masamha C.P., Xia Z., Xiang Y., Chen N.-Y., Molina J.G., Collum S., Mertens T.C., Luo F.et al. Cleavage factor 25 deregulation contributes to pulmonary fibrosis through alternative polyadenylation. J. Clin. Invest.2019; 129:19841999.

39. 

Patel R., Brophy C., Hickling M., Neve J., Furger A. Alternative cleavage and polyadenylation of genes associated with protein turnover and mitochondrial function are deregulated in Parkinson’s, Alzheimer’s and ALS disease. BMC Med. Genomics. 2019; 12:60.

40. 

Ye C., Zhou Q., Hong Y., Li Q.Q. Role of alternative polyadenylation dynamics in acute myeloid leukaemia at single-cell resolution. RNA Biol.2019; 16:785797.

41. 

Soetanto R., Hynes C., Patel H., Humphreys D., Evers M., Duan G., Parker B., Archer S., Clancy J., Graham R.et al. Role of miRNAs and alternative mRNA 3′ end cleavage and polyadenylation of their mRNA targets in cardiomyocyte hypertrophy. Biochim. Biophys. Acta - Gene Regul. Mech.2016; 1859:744756.

42. 

Lee S.H., Singh I., Tisdale S., Abdel-Wahab O., Leslie C.S., Mayr C. Widespread intronic polyadenylation inactivates tumour suppressor genes in leukaemia. Nature. 2018; 561:127131.

43. 

Stacey S.N., Sulem P., Jonasdottir A., Masson G., Gudmundsson J., Gudbjartsson D.F., Magnusson O.T., Gudjonsson S.A., Sigurgeirsson B., Thorisdottir K.et al. A germline variant in the TP53 polyadenylation signal confers cancer susceptibility. Nat. Genet.2011; 43:10981103.

44. 

Ogorodnikov A., Kargapolova Y., Danckwardt S. Processing and transcriptome expansion at the mRNA 3′ end in health and disease: finding the right end. Pflügers Archiv - Eur. J. Physiol.2016; 468:9931012.

45. 

Vorlová S., Rocco G., LeFave C.V., Jodelka F.M., Hess K., Hastings M.L., Henke E., Cartegni L. Induction of antagonistic soluble decoy receptor tyrosine kinases by intronic PolyA activation. Mol. Cell. 2011; 43:927939.

46. 

Araki S., Nakayama Y., Sano O., Nakao S., Shimizu-Ogasawara M., Toyoshiba H., Nakanishi A., Aparicio S. Decoding transcriptome dynamics of Genome-Encoded polyadenylation and autoregulation with Small-Molecule modulators of alternative polyadenylation. Cell Chem. Biol.2018; 25:14701484.

47. 

Bava F.-A., Eliscovich C., Ferreira P.G., Miñana B., Ben-Dov C., Guigó R., Valcárcel J., Méndez R. CPEB1 coordinates alternative 3′-UTR formation with translational regulation. Nature. 2013; 495:121125.

48. 

Fusby B., Kim S., Erickson B., Kim H., Peterson M.L., Bentley D.L. Coordination of RNA Polymerase II Pausing and 3′ end processing factor recruitment with alternative polyadenylation. Mol. Cell. Biol.2015; 36:MCB.00898-15.

49. 

Dermody J.L., Buratowski S. Leo1 subunit of the yeast Paf1 complex binds RNA and contributes to complex recruitment. J. Biol. Chem.2010; 285:3367133679.

50. 

Batra R., Charizanis K., Manchanda M., Mohan A., Li M., Finn D.J., Goodwin M., Zhang C., Sobczak K., Thornton C.A.et al. Loss of MBNL leads to disruption of developmentally regulated alternative polyadenylation in RNA-Mediated disease. Mol. Cell. 2014; 56:311322.

51. 

Neve J., Burger K., Li W., Hoque M., Patel R., Tian B., Gullerova M., Furger A. Subcellular RNA profiling links splicing and nuclear DICER1 to alternative cleavage and polyadenylation. Genome Res.2016; 26:2435.

52. 

Kaida D., Berg M.G., Younis I., Kasim M., Singh L.N., Wan L., Dreyfuss G. U1 snRNP protects pre-mRNAs from premature cleavage and polyadenylation. Nature. 2010; 468:664668.

53. 

Katahira J., Okuzaki D., Inoue H., Yoneda Y., Maehara K., Ohkawa Y. Human TREX component Thoc5 affects alternative polyadenylation site choice by recruiting mammalian cleavage factor I. Nucleic Acids Res.2013; 41:70607072.

54. 

Di Giammartino D.C., Li W., Ogami K., Yashinskie J.J., Hoque M., Tian B., Manley J.L. RBBP6 isoforms regulate the human polyadenylation machinery and modulate expression of mRNAs with AU-rich 39 UTRs. Genes Develop.2014; 28:22482260.

55. 

Dutertre M., Chakrama F.Z., Combe E., Desmet F.-O., Mortada H., Polay Espinoza M., Gratadou L., Auboeuf D. A recently evolved class of alternative 3′-terminal exons involved in cell cycle regulation by topoisomerase inhibitors. Nat. Commun.2014; 5:3395.

56. 

Huang Y., Li W., Yao X., jiang Lin Q., wen Yin J., Liang Y., Heiner M., Tian B., Hui J., Wang G. Mediator complex regulates alternative mRNA processing via the MED23 subunit. Mol. Cell. 2012; 45:459469.

57. 

Hallais M., Pontvianne F., Andersen P.R., Clerici M., Lener D., Benbahouche N.E.H., Gostan T., Vandermoere F., Robert M.C., Cusack S.et al. CBC-ARS2 stimulates 3′ end maturation of multiple RNA families and favors cap-proximal processing. Nat. Struct. Mol. Biol.2013; 20:13581366.

58. 

Müller-McNicoll M., Botti V., de Jesus Domingues A.M., Brandl H., Schwich O.D., Steiner M.C., Curk T., Poser I., Zarnack K.et al. SR proteins are NXF1 adaptors that link alternative RNA processing to mRNA export. Genes Develop.2016; 30:553566.

59. 

Licatalosi D.D., Mele A., Fak J.J., Ule J., Kayikci M., Chi S.W., Clark T.A., Schweitzer A.C., Blume J.E., Wang X.et al. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature. 2008; 456:464469.

60. 

Proudfoot N.J. Transcriptional termination in mammals: Stopping the RNA polymerase II juggernaut. Science. 2016; 352:aad9926.

61. 

Muller S., Rycak L., Afonso-Grunz F., Winter P., Zawada A.M., Damrath E., Scheider J., Schmah J., Koch I., Kahl G.et al. APADB: a database for alternative polyadenylation and microRNA regulation events. Database. 2014; 2014:bau076.

62. 

Wang R., Nambiar R., Zheng D., Tian B. PolyA-DB 3 catalogs cleavage and polyadenylation sites identified by deep sequencing in multiple genomes. Nucleic Acids Res.2018; 46:D315D319.

63. 

Hong W., Ruan H., Zhang Z., Ye Y., Liu Y., Li S., Jing Y., Zhang H., Diao L., Liang H.et al. APAatlas: decoding alternative polyadenylation across human tissues. Nucleic Acids Res.2020; 48:D34D39.

64. 

Herrmann C.J., Schmidt R., Kanitz A., Artimo P., Gruber A.J., Zavolan M. PolyASite 2.0: a consolidated atlas of polyadenylation sites from 3′ end sequencing. Nucleic Acids Res.2019; 48:D174D179.

65. 

Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T.et al. Gene ontology: Tool for the unification of biology. Nat. Genet.2000; 25:2529.

66. 

Brooks A.N., Duff M.O., May G., Yang L., Bolisetty M., Landolin J., Wan K., Sandler J., Booth B.W., Celniker S.E.et al. Regulation of alternative splicing in Drosophila by 56 RNA binding proteins. Genome Res.2015; 25:17711780.

67. 

Kleiman F.E., Manley J.L. The BARD1-CstF-50 interaction links mRNA 3′ end formation to DNA damage and tumor suppression. Cell. 2001; 104:743753.

68. 

R Core Team R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing Vienna, Austria. 2019; 06 August 2020https://www.R-project.org/.

69. 

Chang W., Cheng J., Allaire J., Xie Y., McPherson J. shiny: Web Application Framework for R R package version 1.4.0.2. 2020; 06 August 2020https://CRAN.R-project.org/package=shiny.

70. 

Stephenson L., Wakeham Y., Seidenman N., Choi J. Building online genomics applications using BioPyramid. Bioinformatics. 2018; 34:30553057.

71. 

Lawrence M., Huber W., Pagès H., Aboyoun P., Carlson M., Gentleman R., Morgan M.T., Carey V.J. Software for computing and annotating genomic ranges. PLoS Comput. Biol.2013; 9:e1003118.

72. 

Huber W., Carey V.J., Gentleman R., Anders S., Carlson M., Carvalho B.S., Bravo H.C., Davis S., Gatto L., Girke T.et al. Orchestrating high-throughput genomic analysis with Bioconductor. Nat. Methods. 2015; 12:115121.

73. 

Xie Y., Cheng J., Tan X. DT: A Wrapper of the JavaScript Library ’DataTables’ R package version 0.13. 2020; 06 August 2020https://CRAN.R-project.org/package=DT.

74. 

Hahne F., Ivanek R. Mathé E, Davis S Visualizing Genomic Data Using Gviz and Bioconductor. Statistical Genomics, Vol. 1418 of Methods in Molecular Biology. 2016; NYSpringer New York335351.

75. 

Chang W., Borges Ribeiro B. shinydashboard: Create Dashboards with ‘Shiny’ R package version 0.7.1. 2018; 06 August 2020https://CRAN.R-project.org/package=shinydashboard.

76. 

Almende B.V., Thieurmel B., Robert T. visNetwork: Network Visualization using ‘vis.js’ Library R package version 2.0.9. 2019; 06 August 2020https://CRAN.R-project.org/package=visNetwork.

77. 

O’Leary N.A., Wright M.W., Brister J.R., Ciufo S., Haddad D., McVeigh R., Rajput B., Robbertse B., Smith-White B., Ako-Adjei D.et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res.2016; 44:D733D745.

78. 

Gamazon E.R., Segrè A.V., van de Bunt M., Wen X., Xi H.S., Hormozdiari F., Ongen H., Konkashbaev A., Derks E.M., Aguet F.et al. Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat. Genet.2018; 50:956967.

79. 

Uhlen M., Zhang C., Lee S., Sjöstedt E., Fagerberg L., Bidkhori G., Benfeitas R., Arif M., Liu Z., Edfors F.et al. A pathology atlas of the human cancer transcriptome. Science. 2017; 357:eaan2507.

80. 

Huang K.-Y., Lee T.-Y., Kao H.-J., Ma C.-T., Lee C.-C., Lin T.-H., Chang W.-C., Huang H.-D. dbPTM in 2019: exploring disease association and cross-talk of post-translational modifications. Nucleic Acids Res.2019; 47:D298D308.

81. 

Carbon S., Douglass E., Dunn N., Good B., Harris N.L., Lewis S.E., Mungall C.J., Basu S., Chisholm R.L., Dodson R.J.et al. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res.2019; 47:D330D338.

82. 

Merico D., Isserlin R., Stueker O., Emili A., Bader G.D. Enrichment Map: A Network-Based method for Gene-Set enrichment visualization and interpretation. PLoS ONE. 2010; 5:e13984.

83. 

Zhang H., Lee J.Y., Tian B. Biased alternative polyadenylation in human tissues. Genome Biol.2005; 6:R100.

84. 

Haeussler M., Zweig A.S., Tyner C., Speir M.L., Rosenbloom K.R., Raney B.J., Lee C.M., Lee B.T., Hinrichs A.S., Gonzalez J.N.et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res.2019; 47:D853D858.

85. 

Raney B.J., Dreszer T.R., Barber G.P., Clawson H., Fujita P.A., Wang T., Nguyen N., Paten B., Zweig A.S., Karolchik D.et al. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics. 2014; 30:10031005.

86. 

Kargapolova Y., Levin M., Lackner K., Danckwardt S. sCLIP—an integrated platform to study RNA–protein interactomes in biomedical research: identification of CSTF2tau in alternative processing of small nuclear RNAs. Nucleic Acids Res.2017; 45:60746086.

87. 

Castello A., Fischer B., Eichelbaum K., Horos R., Beckmann B.M., Strein C., Davey N.E., Humphreys D.T., Preiss T., Steinmetz L.M.et al. Insights into RNA biology from an atlas of mammalian mRNA-Binding proteins. Cell. 2012; 149:13931406.

88. 

Baltz A.G., Munschauer M., Schwanhäusser B., Vasile A., Murakawa Y., Schueler M., Youngs N., Penfold-Brown D., Drew K., Milek M.et al. The mRNA-Bound proteome and its global occupancy profile on Protein-Coding transcripts. Mol. Cell. 2012; 46:674690.

89. 

Yang F., Wang W., Cetinbas M., Sadreyev R.I., Blower M.D. Genome-wide analysis identifies cis -acting elements regulating mRNA polyadenylation and translation during vertebrate oocyte maturation. RNA. 2020; 26:324344.

90. 

Wilkinson M.D., Dumontier M., Aalbersberg I.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.-W., da Silva Santos L.B., Bourne P.E.et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data. 2016; 3:160018.

91. 

Marini F., Linke J., Binder H. ideal: an R / Bioconductor package for Interactive Differential Expression Analysis. 2020; bioRxiv doi:11 January 2020, preprint: not peer reviewed10.1101/2020.01.10.901652.

92. 

Rue-Albrecht K., Marini F., Soneson C., Lun A.T. iSEE: Interactive SummarizedExperiment Explorer. F1000 Res.2018; 7:741.

93. 

Ganz C. rintrojs: A Wrapper for the Intro.js Library. J. Open Source Software. 2016; 1:2016.

94. 

Peng R.D. Reproducible research in computational science. Science. 2011; 334:12261227.

95. 

Nosek B.A., Alter G., Banks G.C., Borsboom D., Bowman S.D., Breckler S.J., Buck S., Chambers C.D., Chin G., Christensen G.et al. Promoting an open research culture. Science. 2015; 348:14221425.

96. 

Collins F.S., Tabak L.A. NIH plans to enhance reproducibility. Nature. 2014; 505:612613.

97. 

Marini F., Binder H. pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components. BMC Bioinform.2019; 20:331.

98. 

Bogard N., Linder J., Rosenberg A.B., Seelig G. A deep neural network for predicting and engineering alternative polyadenylation. Cell. 2019; 178:91106.