Welcome to Oncosplice

About Oncosplice

Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introduce Oncosplice, a tool that scores mutations based on models of proteomes generated using aberrant splicing predictions. Oncosplice leverages a highly accurate neural network that predicts splice sites within arbitrary mRNA sequences, a greedy transcript constructor that considers alternate arrangements of splicing blueprints, and an algorithm that grades the functional divergence between proteins based on evolutionary conservation. By applying this tool to 12M somatic mutations we identify 8K deleterious variants that are significantly depleted within the healthy population; we demonstrate the tool’s ability to identify clinically validated pathogenic variants with a positive predictive value of 94%; we show strong enrichment of predicted deleterious mutations across pan-cancer drivers. We also achieve improved patient survival estimation using a proposed set of novel cancer-involved genes. Ultimately, this pipeline enables accelerated insight-gathering of sequence-specific consequences for a class of understudied mutations and provides an efficient way of filtering through massive variant datasets – functionalities with immediate experimental and clinical applications.


PyPI Installation

You can install Oncosplice directly from PyPI:

pip install geney
python -m geney.data_setup -b /path/to/empty/directory

Python Execution

You can run Oncosplice with the following Python code:

from geney.oncosplice_pipeline import oncosplice, sample_mut_id

# Run Oncosplice
results = oncosplice(sample_mut_id)

# Print results
display(results)

License

Oncosplice is available for non-commercial usage.

Citation

If you use Oncosplice in your research, please cite:

Lynn, N., Tuller, T. Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing. npj Syst Biol Appl 10, 25 (2024). https://doi.org/10.1038/s41540-024-00351-7

Contact

For any inquiries or further assistance, please reach out to us at nicolasalynn@gmail.com.