寻找生存分析的最佳基因表达分组阈值

Original 生信技能树生信技能树 2022-06-06

收录于合集 #生存分析 38个

昨天我们提到了任意更改基因表达分组阈值生存分析结果大不一样：

看到

https://www.proteinatlas.org/ENSG00000111801-BTN3A3/pathology/tissue/breast+cancer

文字版是：

Based on the FPKM value of each gene, we classified the patients into two groups and examined their prognoses.
In the analysis, we excluded genes with low expression, i.e., those with a median expression among samples less than FPKM 1.
The prognosis of each group of patients was examined by Kaplan-Meier survival estimators, and the survival outcomes of the two groups were compared by log-rank tests.
To choose the best FPKM cut-offs for grouping the patients most significantly, all FPKM values from the 20th to 80th percentiles were used to group the patients, significant differences in the survival outcomes of the groups were examined and the value yielding the lowest log-rank P value is selected.

得到的K-M图如下：

image-20190516092236980

如果是在 http://www.oncolnc.org/ 出图如下：

a=read.csv('BRCA_10384_50_50.csv')
head(a)
a$event=ifelse(a$Status=='Alive',0,1)
library(survival)
library(survminer)
sfit <- survfit(Surv(Days, event)~Group, data=a) 
ggsurvplot(sfit, conf.int=F, pval=TRUE)

phe=a
phe$time=phe$Days/365

## 批量生存分析 使用  logrank test 方法
mySurv=with(phe,Surv(time, event))
log_rank_p <- lapply(2:8, function(i){
  thr=sort(phe$Expression)[round(nrow(phe)*i/10)]
  phe$group=ifelse(phe$Expression > thr,'high','low') 
  print(table( phe$group ))
  data.survdiff=survdiff(mySurv~group,data=phe)
  p.val = 1 - pchisq(data.survdiff$chisq, length(data.survdiff$n) - 1)
  return(p.val)
}) 
log_rank_p=unlist(log_rank_p)
log_rank_p

i=8
thr=sort(phe$Expression)[round(nrow(phe)*i/10)]
phe$group=ifelse(phe$Expression > thr,'high','low') 
print(table( phe$group ))
sfit <- survfit(Surv(time, event)~group, data=phe) 
ggsurvplot(sfit, conf.int=F, pval=TRUE)

遗憾的是，因为数据源不一样，使用oncolnc的数据也不太可能画出 proteinatlas 一模一样的图：

见： TCGA数据库生存分析的网页工具哪家强

观察｜官方通报陕西蒲城一职校学生坠亡：事发前与舍友发生口角和肢体冲突认定该生系高空坠落死亡

桐城一派｜倒在“跨年夜”的龚书记，13个字换来免职调查冤不冤？

市管干部“龚书记”免职迷局

讣告！又一知名女星在家中去世，终年54岁，曾是无数人白月光…

近视的孩子有救了！国内最新近视防控矫正技术，不手术，扫码进群即可了解！

寻找生存分析的最佳基因表达分组阈值

您可能也对以下帖子感兴趣

观察｜官方通报陕西蒲城一职校学生坠亡：事发前与舍友发生口角和肢体冲突 认定该生系高空坠落死亡

桐城一派｜倒在“跨年夜”的龚书记，13个字换来免职调查冤不冤？

市管干部“龚书记”免职迷局

讣告！又一知名女星在家中去世，终年54岁，曾是无数人白月光…

近视的孩子有救了！国内最新近视防控矫正技术，不手术，扫码进群即可了解！

生成图片，分享到微信朋友圈

寻找生存分析的最佳基因表达分组阈值

您可能也对以下帖子感兴趣

观察｜官方通报陕西蒲城一职校学生坠亡：事发前与舍友发生口角和肢体冲突认定该生系高空坠落死亡