APA regulations of select genes in cancer

Citation

Xue Z, Warren RL, Gibb EA, MacMillan D, Wong J, Chiu R, et al. Recurrent tumor-specific regulation of alternative polyadenylation of cancer-related genes. BMC Genomics. 2018;19:536

Introduction

In addition to the 77 cases (arc plots) reported in the manuscript and its supplemental materials, here, we present a total of 1593 arc plots (114 (genes) x 14 (cancer types) - 3) for all gene-cancer type pairs considered in this study in an interactive webpage. Hence, these plots represent a superset of what is presented in the manuscript. We note that for three pairs, (NKX2-1, COAD), (TERT, KICH), and (TERT, KIRP), no cleavage site was detected in the normal samples, so they have no corresponding arc plots; NA is shown instead.

Such an interactive format also enables the readers to inspect their genes of interest in more detail.

Instruction

# Gene Disease
1 ABL1
2 AKT1
3 AKT2
4 ALK
5 AMER1
6 APC
7 ATM
8 AXIN1
9 BAP1
10 BCL6
11 BRAF
12 BRCA1
13 BRCA2
14 CARD11
15 CCND1
16 CCND2
17 CCNE1
18 CD47
19 CDH1
20 CDKN2A
21 CDKN2C
22 CEBPA
23 CHURC1
24 CREBBP
25 CTNNB1
26 CYLD
27 DICER1
28 DRAM1
29 EGFR
30 EP300
31 ERBB2
32 EZH2
33 FAS
34 FBXW7
35 FGF2
36 FGFR2
37 FGFR3
38 FLT3
39 GATA3
40 GNA11
41 GNAQ
42 GNAS
43 HNF1A
44 HRAS
45 IDH1
46 IGF2BP1
47 JUN
48 KDM6A
49 KDR
50 KIT
51 KMT2A
52 KRAS
53 MAP2K1
54 MAP2K2
55 MAP2K4
56 MAX
57 MDM2
58 MDM4
59 MEN1
60 MET
61 MITF
62 MLH1
63 MSH2
64 MSH6
65 MYC
66 MYCL
67 MYCN
68 MYD88
69 NF1
70 NF2
71 NFE2L2
72 NFE2L3
73 NFYA
74 NKX2-1 NA
75 NOTCH1
76 NOTCH2
77 NRAS
78 NSUN5
79 PAX5
80 PDGFRA
81 PIK3CA
82 PIK3R1
83 PRKAR1A
84 PTCH1
85 PTEN
86 RAB10
87 RB1
88 REL
89 RET
90 RNF43
91 RTCA
92 SEC24A
93 SETD2
94 SMAD4
95 SMARCA4
96 SMARCB1
97 SMO
98 SOCS1
99 SOX2
100 STAT3
101 STK11
102 STK17A
103 SUFU
104 TERT NA NA
105 TET2
106 TNFAIP3
107 TP53
108 TRAF7
109 TSC1
110 TSC2
111 TSHR
112 TXNL4A
113 VHL
114 WT1

Click on a gene-cancer type pair in the left table to see an arc plot

Legend: Inside each left-hand panel, each group of bars represents the frequency of a specific CS in normal (blue) and tumor (red) samples. Bar groups are ordered by corresponding CS genomic coordinates. The text box shows the number of normal (N) and tumor (T) samples that were used for frequency calculation. Vertical arrows indicate the positions of predicted CSs. Annotated and predicted CSs match well, but are not expected to overlap exactly. An arrow pointing upwards (downwards) represents an increase (decrease) in frequency from normal to tumor. Arrow height represents the difference (Δ). Bars and arrows of insignificant difference are colored gray. The X-axis represents the genomic coordinate offset by that of the first stop codon. The horizontal arrow indicates the gene strand.

When the 3' UTR length change is too complex to be resolved into a shortening or lengthening trend, the corresponding text box is left uncolored. NMD-related transcript elements are colored in cyan. An orange arrow indicates that a predicted CS with a significant frequency change is mapped to multiple stop codons, with its associated 3' UTR length being ambiguous.

Different from the figures in the main text, all predicted CSs are plotted without filtering by the low frequency (<5%) and insignificant change criteria. As a result, the number of CSs for the same gene in different cancer types may differ.

Besides, we also computed the distribution of expression in reads per kilobase of transcript per million sequenced reads (RPKMS).