How do I remove duplicate samples from a KM plot?
If your plot has an '!' icon next to the p-value this means that some patients are in your plot twice. This can happen when A) a patient has both a tumor and normal sample or when a patient has a metastasis that is part of the dataset and/or B) a tumor sample was split into multiple aliquots and then run through the same analysis twice.
This page will guide you on how to remove duplicates due to A. If there are duplicates due to B you will need to download the data, decide how to resolve any inconsistencies between the multiple aliquots and load it up as your own data.

Removing duplicates

    Add the data column of 'sample type' from the Phenotype data
Note that different datasets may call this phenotype data something slightly different. We are just trying to add a column of data that indicates the sample type such as 'Primary Tumor', 'Normal', etc.
We can see here that some patients have both Primary and Recurrent Tumors.
2. Filter to only samples that are 'Primary tumor' by typing 'primary' into the filter search box. More help on filtering. Next, click the filter icon next to the filter search box and chose 'Filter'. This will filter out all samples that are not primary tumor.
To filter out the samples that are 'Recurrent Tumor', type 'primary' into the filter search box
3. Run your KM analysis by clicking the caret menu at the top of the column and choosing 'Kaplan-Meier plot' It will now only have primary tumor samples in it.
Last modified 1mo ago
Export as PDF
Copy link