Ambiguous Colocalization from Trait-Specific Effects

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  dpi = 80
)

This vignette demonstrates an example of ambiguous colocalization from trait-specific effects using the colocboost. Specifically, we will use the Ambiguous_Colocalization, which is output from colocboost analyzing GTEx release v8 and UK Biobank summary statistics (see more details of the original data source in Acknowledgment section).

library(colocboost)
# Run colocboost with diagnostic details
data(Ambiguous_Colocalization)
names(Ambiguous_Colocalization)

1. The Ambiguous_Colocalization Dataset

The Ambiguous_Colocalization dataset contains results from a colocboost analysis of a real genomic region showing ambiguous trait-specific effects between eQTL (expression quantitative trait loci) and GWAS (genome-wide association study) signals. Ambiguous colocalization occurs when there appears to be shared causal variants between traits, but the evidence is complicated by the presence of trait-specific effects. This ambiguity typically arises when some trait-specific boosting learners are updating very similar, yet not the same sets of variants as these traits did not share coupled updates.

This dataset is structured as a list with two main components:

  1. ColocBoost_Results: Contains the output from running the ColocBoost algorithm.

  2. SuSiE_Results: Contains fine-mapping results from the SuSiE algorithm for both eQTL and GWAS data separately.

  3. COLOC_V5_Results: Contains colocalization results from COLOC, which is directly from two susie output objects.

2. ColocBoost results

In this example, there are two trait-specific effects for the eQTL and GWAS signals, respectively. But two uCoS have overlapping variants, which indicates that the two uCoS are not independent. ColocBoost identifies two uCoS:

# Trait-specific effects for both eQTL and GWAS
Ambiguous_Colocalization$ColocBoost_Results$ucos_details$ucos$ucos_index

# Intersection of eQTL and GWAS variants
Reduce(intersect, Ambiguous_Colocalization$ColocBoost_Results$ucos_details$ucos$ucos_index)

After checking the correlation of variants between the two uCoS, we can see the high correlation between the two uCoS.

# With-in and between purity
Ambiguous_Colocalization$ColocBoost_Results$ucos_details$ucos_purity

Based on the results, we can see that the two uCoS are not independent, but they are not fully overlapping.

n_variables <- Ambiguous_Colocalization$ColocBoost_Results$data_info$n_variables
colocboost_plot(
  Ambiguous_Colocalization$ColocBoost_Results, 
  plot_cols = 1,
  grange = c(2000:n_variables),
  plot_ucos = TRUE,
  show_cos_to_uncoloc = TRUE
)

2. Fine-mapping results from SuSiE and colocalization with COLOC

In this example, we also have fine-mapping results from SuSiE for both eQTL and GWAS data separately.

susie_eQTL <- Ambiguous_Colocalization$SuSiE_Results$eQTL
susie_GWAS <- Ambiguous_Colocalization$SuSiE_Results$GWAS

# Fine-mapped eQTL
susie_eQTL$sets$cs$L1

# Fine-mapped GWAS variants
susie_GWAS$sets$cs$L1

# Intersection of fine-mapped eQTL and GWAS variants
intersect(susie_eQTL$sets$cs$L1, susie_GWAS$sets$cs$L1)

To visualize the fine-mapping results,

susieR::susie_plot(susie_eQTL, y = "PIP", pos = 2000:n_variables)
susieR::susie_plot(susie_GWAS, y = "PIP", pos = 2000:n_variables)

We also show the colocalization results from COLOC method. For this ambiguous colocalization, COLOC shows

Note that SuSiE-based COLOC has a relatively high confidence of this as a colocalization event because each of SuSiE 95% CS as shown above cover substantially larger region (containing more variants) compared to the trait-specific effects identified by ColocBoost, although at a lower purity (SuSiE purity = 0.56 and 0.64, ColocBoost uCoS purity = 0.67 and 0.70). With larger overlap between the SuSiE 95% CS across traits, the high probability of colocalization is expected. But for this particular data application without knowing the ground truth, it is difficult to determine which method is more precise.

# To run COLOC, please use the following command:
# res <- coloc::coloc.susie(susie_eQTL, susie_GWAS)
res <- Ambiguous_Colocalization$COLOC_V5_Results
res$summary

3. Get the ambiguous colocalization results and summary

ColocBoost provides a function to get the ambiguous colocalization results and summary from trait-specific effects, by considering the correlation of variants between the two uCoS.

3.1. Get the ambiguous colocalization results

The get_ambiguous_colocalization function will return the ambiguous results in ambigous_ucos object, if the following conditions are met:

colocboost_results <- Ambiguous_Colocalization$ColocBoost_Results
res <- get_ambiguous_colocalization(
  colocboost_results, 
  min_abs_corr_between_ucos = 0.5, 
  median_abs_corr_between_ucos = 0.8
)
names(res)
names(res$ambiguous_cos)
names(res$ambiguous_cos[[1]])

Explanation of results For each ambiguous colocalization, the following information is provided:

3.2. Get the summary of ambiguous colocalization results

To get the summary of ambiguous colocalization results, we can use the get_colocboost_summary function.

# Get the full summary results from colocboost
full_summary <- get_colocboost_summary(colocboost_results, summary_level = 3)
names(full_summary)

# Get the summary of ambiguous colocalization results
summary_ambiguous <- full_summary$ambiguous_cos_summary
colnames(summary_ambiguous)

See details of function usage in the Functions.

4. Take home message

In this vignette, we have demonstrated how post-processing of ColocBoost results may be use to reconciliate ambiguous colocalization scenarios where trait-specific effects share highly correlated and overlapping variants.

Acknowledgment



Try the colocboost package in your browser

Any scripts or data that you put into this service are public.

colocboost documentation built on June 8, 2025, 11:07 a.m.