ALEXA logo and images of BC Cancer Agency

ALEXA_gg_42_2 (CHICKEN): Detailed statistics and comments


The following page provides detailed statistics, comments and figures describing the ALEXA_gg_42_2 pre-computed microarray design


Statistics


ALEXA_gg_42_2

Chicken image for alternative expression microarray design Species: Gallus gallus
Common name: Chicken
Source EnsEMBL Database: gallus_gallus_core_42_2
Genome Build: WUSTL Gallus-gallus-2.1
Gene count: 17,358
Genes targeted: 17,262
Transcript count: 22,829
Exon count: 182,577
Known exon skipping events: 3,715
Repeat content: 8.96%
Target probe length: 38 bp (+/- 10 bp)
Target probe Tm: 67.1 C
Probe count (unfiltered): 10,077,586
Sequences available for specificity testing (mRNA / EST): 27,285 / 617,155
Probe count (filtered): 2,341,085 (54.0% Exon-Junction, 30.9% Exon-Boundary, 14.9% Exon, 0.2% Intron)
Percentage of genes with > 75% probe coverage: 59.0%
Size of Database download (compressed): 695 Mb


Download

Get this design from our FTP server: ALEXA_gg_42_2.tables.tar.gz

See the Platform section for instructions on how to import these designs and create your own array design submission file.


Design Summary and Custom UCSC Tracks

To view details on the probes for a specific gene of interest, download the annotation file for this design. This file contains links to custom UCSC tracks for every gene (the gene name is also a link). A link to an example gene is also provided below.


Design Annotation File: ALEXA_gg_42_2_summary.xls
Example gene: C19orf50
Housekeeping gene list: chicken_housekeeping_genes.xls


Figures describing the genome used to create the ALEXA_gg_42_2 design


Note: Additional statistics and figures pertaining to the input genome for this design can be downloaded seperately.


Basic summary of EnsEMBL gene models used for this design

Note that most genes currently have only a single transcript annotated


EnsEMBL gene model summary


Distribution of transcript lengths

Transcripts are divided into two bins. Those shorter than 10,000 bp in length and those greater than 10,000


Distribution of EnsEMBL transcript lengths


Distribution of exon lengths

Exons are divided into two bins. Those shorter than 500 bp in length and those greater than 500


Distribution of EnsEMBL exon lengths


Figures describing the ALEXA_gg_42_2 design itself (Probe stats - After Filtering)


Note: The following statistics pertain to all probe types combined (Exon, Intron, Exon-Junction, Exon-Boundary and Random Negative-Control probes) but are limited to only those probes that pass the filtering step. Similar statistics corresponding to all probes before filtering were used to determine suitable thresholds for the filtering step.


Each of the plots shown below as well as many others are available for each probe type individually and can be downloaded as a complete package. This package also includes statistics for the complete set of unfiltered probes.


Probe Length

The length of probes is allowed to vary by +/- 10 bp to achieve the target Tm


Distribution of ALEXA microarray oligonucleotide probe lengths


Probe Melting Temperature

'Probe Tm' values are calcuated by a Nearest Neighbor method and reported in degrees Celsius


Distribution of ALEXA microarray oligonucleotide probe tm


Probe Folding Energy

'Folding energy' values are the minimum free energy values calculated by PairFold for each probe sequence and reported in kcal/mol


Distribution of ALEXA microarray oligonucleotide hairpin folding energy


Distribution of ALEXA microarray oligonucleotide self-self folding energy


Number of Probes Per Gene

The number of probes extracted for a particular gene is dependent on the number of known exons for that gene. As the number of exons increases, the number of exon and exon-boundary probes increases in a linear fashion and the number of exon-junction probes increases in a factorial fashion (n!/[[n-2]!2!] where n is the number of exons).


Number of ALEXA microarray oligonucleotide probes per target gene


Probe Specificity - Length of Non-Specific Hits

'EnsEMBL Non-specific Hit Length' values are the largest BLAST hit observed between a probe sequence and an EnsEMBL transcript from a locus other than the one targeted by the probe (i.e. a closely related sequence from another gene)


Distribution of specificity score for ALEXA oligonucleotide probes


Probe coverage - A measure of the success of probe design for each gene

Each gene has an ideal number of probes based on the number of exons in that gene. 'Probe coverage' values are calculated for each gene and represent the ratio of successful probes designed compared to the ideal number possible for a gene with n exons. In some rare cases, single exons are divided into multiple sections (occurs when the boundaries of an exon are ambiguous). This can lead to a probe coverage value of greater than 100%. Note that the success of probe design varies dramatically between genes. However, 59.0% of genes in this design have a probe coverage of 75% or greater. The stringency of filtering can be reduced to increase this percentage.


ALEXA probe coverage for each target gene


Button link to main ALEXA home page Button link to ALEXA-Arrays home page Button link to ALEXA-Seq home page Button link to acknowledgements of funding and other support for Malachi Griffith and Marco Marra Button link to contact information for Malachi Griffith and Marco Marra