Overview

This repository contains a single, end-to-end Python notebook that integrates The Cancer Genome Atlas (TCGA) Breast Cancer (BRCA) multi-omics profiles—gene expression (RNA-seq), DNA methylation (450K), and copy-number variation (CNV) — to discover unsupervised patient groups and interpret them biologically.

Why this project

Projects goals:

Highlights

Result

Figure
CNV key genes
Figure 1: CNV of key genes by cluster
Embedding clusters
Figure 2: Integrated multi-omics embedding (colored by cluster)
Marker expression
Figure 3: Marker gene expression by multi-omics cluster
Survival rate
Figure 4: Overall survival by multi-omics cluster (TCGA BRCA)

How to run

git clone https://github.com/Anwesha19-prog/TCGA-BRCA-Multi-Omics-Intergration
cd TCGA-BRCA-Multi-Omics-Intergration
python -m requirements.txt
jupyter notebook Project-3.ipynb