annotate subpackage¶

Sub-package Documentation¶

Types of Output Files¶

expected name/suffix	file type/format	content
`annotations.tab`	text/tabbed	annotated events
`annotations.fusion-cdna.fa`	fasta	putative fusion unspliced cDNA sequences
`drawings/*.svg`	SVG	diagrams
`drawings/*.legend.json`	JSON	diagram legend/metadata

Algorithm Overview¶

see theory - annotating events

read in breakpoint pairs
generate strand-specific annotations (one annotation per strand, multiple if multiple genes/transcripts in the region)
try building fusion transcripts for bp-specific calls
generate SVG diagrams

Levels of Annotations¶

Overview of Class Relationships¶

Figure 23. The Annotation sub-package has objects for genetic annotations and related calculations. The basic layout of the package is shown above. IS-A relationships are given by the blue arrows. HAS-A relationships are shown in black. And reference_object/parent type relationships are shown in red. Gene is a gene. Start and end are genomic positions wrt to the template/chr. usTranscript is the unspliced transcript. Start and end are genomic positions wrt to the template/chr. Transcript: is the spliced transcript. Start and end coordinates are 1 to the length of the spliced product in base pairs. Translation: is the translation of the spliced transcript. Start and end are cdna positions wrt the 5’ end of the spliced transcript. The start and end here describe the start and end of the coding sequence

modules