Setup

library(ithi.utils)
load_base_libs()

library(methods)
library(ComplexHeatmap)
library(cluster)
library(entropy)
library(grid)
library(gridExtra)
library(gridBase)

library(ithi.meta)
library(ithi.figures)
library(ithi.utils)
library(ithi.supp)
library(ithi.clones)
library(ithi.xcr)
ihc_table_path <- snakemake@input$ihc_table
xcr_table_path <- snakemake@input$xcr_table
molecular_subtype_file <- snakemake@input$molsubtypes
tcr_diversity_file <- snakemake@input$tcr_diversity
bcr_diversity_file <- snakemake@input$bcr_diversity

tils_for_cluster <- snakemake@params$tils_for_cluster
all_tiltypes <- snakemake@params$all_tiltypes
db_path <- snakemake@params$db
annotation_colours <- ithi.figures::get_annotation_colours()
ihc_table <- fread(ihc_table_path)
molsubtypes <- fread(molecular_subtype_file)
xcr_table <- read_clonotypes(xcr_table_path, duplicates = FALSE, db_path = db_path)

Read 16.4% of 304822 rows
Read 55.8% of 304822 rows
Read 85.3% of 304822 rows
Read 304822 rows and 18 (of 18) columns from 0.070 GB file in 00:00:06

Analysis

This reviewer makes a great point here – it would make more sense for us to do this with our subtypes. My original intention was to emphasize novelty – i.e. in showing that C1 tumours have higher TCR diversity, which is not known AFAIK in the literature – but we could just as easily do this without own subtypes.

So let’s do so.

til_subtype_results <- ithi.supp::supp_molsubtype_immune_properties(ihc_table, 
    molsubtypes, xcr_table, tcr_diversity_file, bcr_diversity_file, db_path, 
    tiltypes = all_tiltypes, subtype_class = "til_clusters")
grid.newpage()
grid.draw(til_subtype_results$fig[[1]])

grid.newpage()
grid.draw(til_subtype_results$fig[[2]])

So it seems that, although S-TIL is more diverse than ES-TIL, the difference is not significant, except in the case of BCRs when using the D50 index. What that would mean is that BCR repertoires are dominated by rare clonotypes in S-TIL samples, to a greater degree than in ES-TIL and N-TIL samples.

Nevertheless, we can see a consistent trend towards higher TCR and BCR diversity values in S-TIL and ES-TIL subsets. This is a finding that may be validated by further studies/larger cohorts.

Next, we’ll just show the TIL densities in each of the TIL subtypes. This used to show TIL densities in each of the molecular subtypes.

grid.newpage()
grid.draw(til_subtype_results$combined_fig)

As expected, epithelial TIL densities are higher in ES-TIL, and stromal TIL densities are highest in S-TIL (but for CD8+’s and plasma comparable between S-TIL and ES-TIL).

Additionally, there’s another pretty interesting finding here – ES-TIL samples have the lowest proportion of private (site-specific) clonotypes. This could possibly be due to expanded clonotypes ‘crowding out’ the read space for less-expanded, site-specific i.e. ‘private’ clonotypes in the ES-TIL samples (reviewer 3’s hypothesis would probably be this, based on their comments).

---
title: "TIL clusters: immune properties"
---
                        ```{r, echo=FALSE, message=FALSE, warning=FALSE}

######## Snakemake header ########
library(methods)
Snakemake <- setClass(
    "Snakemake",
    slots = c(
        input = "list",
        output = "list",
        params = "list",
        wildcards = "list",
        threads = "numeric",
        log = "list",
        resources = "list",
        config = "list",
        rule = "character"
    )
)
snakemake <- Snakemake(
    input = list('/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/xcr_table.tsv', '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/IGH/postfilter_diversity_stats/diversity.strict.resampled.txt', '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/molsubtypes.tsv', '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/ihc_table.tsv', 'notebooks/tilcluster_immuneproperties.Rmd', '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/TRB/postfilter_diversity_stats/diversity.strict.resampled.txt', "xcr_table" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/xcr_table.tsv', "bcr_diversity" = '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/IGH/postfilter_diversity_stats/diversity.strict.resampled.txt', "molsubtypes" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/molsubtypes.tsv', "ihc_table" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/ihc_table.tsv', "notebook" = 'notebooks/tilcluster_immuneproperties.Rmd', "tcr_diversity" = '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/TRB/postfilter_diversity_stats/diversity.strict.resampled.txt'),
    output = list('/shahlab/alzhang/projects/ITH_Immune/paper/results/review/notebooks/run2/tilcluster_immuneproperties.nb.html'),
    params = list(c('T_CD8_density', 'T_CD4_density', 'T_CD20_density', 'T_Plasma_density', 'E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), '/shahlab/alzhang/projects/ITH_Immune/metadata/db/immune_project.sqlite3', c('E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), 'tilcluster_immuneproperties_analysis', "all_tiltypes" = c('T_CD8_density', 'T_CD4_density', 'T_CD20_density', 'T_Plasma_density', 'E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), "db" = '/shahlab/alzhang/projects/ITH_Immune/metadata/db/immune_project.sqlite3', "tils_for_cluster" = c('E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), "name" = 'tilcluster_immuneproperties_analysis'),
    wildcards = list(),
    threads = 1,
    log = list('/shahlab/alzhang/clusttmp/paperreview2/notebooks/tilcluster_immuneproperties_analysis.log'),
    resources = list(),
    config = list("all_tiltypes" = c('T_CD8_density', 'T_CD4_density', 'T_CD20_density', 'T_Plasma_density', 'E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), "nanostring_annotations" = '/shahlab/alzhang/projects/ITH_Immune/data/expression/nanostring/pancancer_annotations.tsv', "distance_method" = 'horn', "ihc_table" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/ihc_table.tsv', "array_expression_file" = '/shahlab/alzhang/projects/ITH_Immune/data/expression/array/gene_exprs_rma_batch_corrected.txt', "clone_branch_lengths" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/clones/branch_data.tsv', "ith_stats" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/ith_statistics.tsv', "molsubtypes" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/molsubtypes.tsv', "clone_prevalences" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/clones/clone_data.tsv', "refseq_gene_file" = '/shahlab/alzhang/data/genome/hg19/refseq_genes.bed', "benchmarkdir" = '/shahlab/alzhang/benchmarks/paperreview2', "neoediting_outdir" = '/shahlab/alzhang/pipeline_outputs/ith_immune/neoediting/run6', "somatic_coding_result_dir" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/somatic_coding_variants', "icgc_subtypes" = '/shahlab/alzhang/data/ICGC/icgc_primary_tumour_subtypes.tsv', "rooney_mutsigcv_file" = '/shahlab/alzhang/projects/ITH_Immune/external/other_papers/mmc6.xlsx', "image_summary" = '/shahlab/alzhang/data/ithi/yuan_hecr_image_results.csv', "db" = '/shahlab/alzhang/projects/ITH_Immune/metadata/db/immune_project.sqlite3', "copynumber_table" = '/shahlab/alzhang/data/ithi/master_copynumber_file.tsv', "breakpoint_table" = '/shahlab/amcpherson/projects/ith3/ith3/notebooks/bespoke/ith_breakpoints.tsv', "tils_for_variability" = c('T_CD8_density', 'T_CD4_density', 'T_CD20_density', 'T_Plasma_density'), "snv_cluster_dir" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/clones/snv_cluster', "icgc_specimen" = '/shahlab/alzhang/data/ICGC/specimen.tsv', "clone_trees" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/clones/tree_data.tsv', "clola_result_file" = '/shahlab/alzhang/pipeline_outputs/ith_immune/clola/run4/clola_condensed_results/beta/clola_results.tsv', "known_subtypes_array" = '/shahlab/alzhang/projects/ITH_Immune/data/expression/array/subtypes/known_subtypes.tsv', "total_tiltypes" = c('T_CD8_density', 'T_CD4_density', 'T_CD20_density', 'T_Plasma_density'), "patients_for_clonal" = c(1, 2, 3, 4, 7, 9, 10, 11, 12, 13, 14, 15, 16, 17), "til_clusters_output" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/intermediates/run2/til_clusters_output.txt', "nanostring_data" = '/shahlab/alzhang/projects/ITH_Immune/results/nanostring_results/ith_full/qc/limma_quantile/normalized_expression_voa_labels_filtered.tsv', "finnhe_pipeline_results_dir" = '/shahlab/alzhang/pipeline_outputs/ith_immune/finnhe/run1', "tilcluster_supervised_ipynb" = '/shahlab/alzhang/projects/ITH_Immune/paper/review/ipy/tilcluster_supervisedmulticlass.ipynb', "mmctm_final_patient_dir" = '/shahlab/alzhang/projects/ITH_Immune/results/mmctm_results/ith_by-patient_with-ov', "variability_type" = 'stabilize', "ith_icgc_bc" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/ith_icgc_merged_bc.tsv', "tils_for_cluster" = c('E_CD8_density', 'E_CD4_density', 'E_CD20_density', 'E_Plasma_density', 'S_CD8_density', 'S_CD4_density', 'S_CD20_density', 'S_Plasma_density'), "ihc_features_output" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/intermediates/run2/ihc_features_output.txt', "ith_stat_types" = c('entropy', 'postprocessed_divergence', 'combined_ith_normalized', 'proportion_subclonal'), "he_results_dir" = '/shahlab/alzhang/data/ithi/finn_results/he_output_Nov29', "image_summary2" = '/shahlab/alzhang/data/ithi/yuan_hecr_image_results_2.csv', "snv_table" = '/shahlab/amcpherson/projects/ith3/ith3/notebooks/bespoke/ith_snvs.tsv', "tumour_purity" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/tumour_purity.tsv', "bcr_diversity" = '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/IGH/postfilter_diversity_stats/diversity.strict.resampled.txt', "igpartition_outdir" = '/shahlab/alzhang/pipeline_outputs/ith_immune/igpartition/run22', "notebook_dir" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/review/notebooks/run2', "remixt_cellularity_ploidy" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/remixt_cellularity_ploidy.tsv', "table_dir" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/review/tables/run2', "epitopes_unique_filtered" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/epitopes_unique_filtered.tsv', "prevalence_threshold" = 0.01, "xcr_table" = '/shahlab/alzhang/projects/ITH_Immune/paper/results/tables/run2/xcr_table.tsv', "logdir" = '/shahlab/alzhang/clusttmp/paperreview2', "tcr_diversity" = '/shahlab/alzhang/pipeline_outputs/ith_immune/mixcr/mixcr_runs/ith_1_2_3/mixcr5/postprocess/TRB/postfilter_diversity_stats/diversity.strict.resampled.txt'),
    rule = 'tilcluster_immuneproperties_analysis'
)
######## Original script #########

                        ```


## Setup

```{r global_chunk_options, include=FALSE}
knitr::opts_chunk$set(echo = TRUE, tidy=TRUE, warning=FALSE, message=FALSE, cache=TRUE) #cache=TRUE
```

```{r}
library(ithi.utils)
load_base_libs()

library(methods)
library(ComplexHeatmap)
library(cluster)
library(entropy)
library(grid)
library(gridExtra)
library(gridBase)

library(ithi.meta)
library(ithi.figures)
library(ithi.utils)
library(ithi.supp)
library(ithi.clones)
library(ithi.xcr)
```

```{r}
ihc_table_path <- snakemake@input$ihc_table
xcr_table_path <- snakemake@input$xcr_table
molecular_subtype_file <- snakemake@input$molsubtypes
tcr_diversity_file <- snakemake@input$tcr_diversity
bcr_diversity_file <- snakemake@input$bcr_diversity

tils_for_cluster <- snakemake@params$tils_for_cluster
all_tiltypes <- snakemake@params$all_tiltypes
db_path <- snakemake@params$db
```

```{r}
annotation_colours <- ithi.figures::get_annotation_colours()
ihc_table <- fread(ihc_table_path)
molsubtypes <- fread(molecular_subtype_file)
xcr_table <- read_clonotypes(xcr_table_path, duplicates = FALSE, db_path = db_path)
```

## Analysis

This reviewer makes a great point here -- it would make more sense for us to do this with our subtypes. My original intention was to emphasize novelty -- i.e. in showing that C1 tumours have higher TCR diversity, which is not known AFAIK in the literature -- but we could just as easily do this without own subtypes. 

So let's do so. 

```{r, results = 'hide'}
til_subtype_results <- ithi.supp::supp_molsubtype_immune_properties(ihc_table, molsubtypes, xcr_table, tcr_diversity_file, bcr_diversity_file, db_path, tiltypes = all_tiltypes, subtype_class = "til_clusters")
```

```{r}
grid.newpage()
grid.draw(til_subtype_results$fig[[1]])
```


```{r}
grid.newpage()
grid.draw(til_subtype_results$fig[[2]])
```

So it seems that, although S-TIL is more diverse than ES-TIL, the difference is not significant, except in the case of BCRs when using the D50 index. What that would mean is that BCR repertoires are dominated by rare clonotypes in S-TIL samples, to a greater degree than in ES-TIL and N-TIL samples. 

Nevertheless, we can see a consistent trend towards higher TCR and BCR diversity values in S-TIL and ES-TIL subsets. This is a finding that may be validated by further studies/larger cohorts. 


Next, we'll just show the TIL densities in each of the TIL subtypes. This used to show TIL densities in each of the molecular subtypes. 

```{r, fig.width = 10, fig.height = 19}
grid.newpage()
grid.draw(til_subtype_results$combined_fig)
```

As expected, epithelial TIL densities are higher in ES-TIL, and stromal TIL densities are highest in S-TIL (but for CD8+'s and plasma comparable between S-TIL and ES-TIL). 

Additionally, there's another pretty interesting finding here -- ES-TIL samples have the lowest proportion of private (site-specific) clonotypes. This could possibly be due to expanded clonotypes 'crowding out' the read space for less-expanded, site-specific i.e. 'private' clonotypes in the ES-TIL samples (reviewer 3's hypothesis would probably be this, based on their comments). 

