42
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Dynamics of breast cancer relapse reveal late recurring ER-positive genomic subgroups

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Introduction The rates and routes of lethal systemic spread in breast cancer are poorly understood due to the lack of molecularly characterized cohorts with long-term, detailed follow-up. Long-term follow-up is especially essential for ER-positive (ER+) breast cancer, where tumors continue to recur up to two decades after initial diagnosis 1–6 and there is a critical need to identify high-risk patients prior to lethal recurrence 7–9 . Here we present a statistical framework to model distinct disease stages (loco-regional recurrence (LR), distant recurrence (DR) and breast cancer-related death) and competing risks of breast cancer mortality, while yielding individual risk of recurrence predictions. Application of this model to 3240 breast cancer patients, including 1980 with molecular data, delineates the spatio-temporal patterns of relapse across the immunohistochemical (IHC), intrinsic (PAM50) 10,11 , and integrative (IntClust) 12,13 subtypes. We identify four late-recurring integrative subtypes, comprising a quarter (26%) of ER+, human epidermal growth factor receptor 2-negative (HER2-) tumors, each with characteristic genomic copy number driver alterations and high (median 42–55%) risk of recurrence up to 20 years post-diagnosis. Additionally, we define a subgroup of triple-negative breast cancers (TNBC) that rarely recur after 5 years and a separate subgroup that remain at risk. The integrative subtypes improve prediction of late distant relapse beyond clinical covariates (nodal status, tumor size, grade and IHC subtype). These findings illuminate opportunities for improved patient stratification and biomarker-driven clinical trials. Main Breast cancer is a multistate disease with clinically relevant intermediate endpoints such as LR and DR 14 . Critically, a patient’s prognosis can differ dramatically depending on when and where a relapse occurs, time since surgery, and time since LR or DR 15,16 These events are associated, and individual survival analyses of disease-free survival (DFS) or overall survival (OS) alone cannot fully capture patterns of recurrence associated with differential prognosis. Additionally, most survival analyses employ disease-specific death (DSD) as the primary endpoint and censor natural deaths. However, when competing risks of mortality occur, this approach induces bias 17 . This is particularly problematic for breast cancer, where ER+ patients experience higher mortality from non-malignant causes due to their increased age at diagnosis relative to ER− patients. We evaluated the extent of such bias on breast cancer survival estimates by analysing 3240 patients diagnosed between 1977–2005 with median 14 years clinical follow-up (referred to as the Full Dataset FD; Extended Data Fig.1, Supplementary Table 1, Methods). We compared the naïve cumulative incidence for DSD (computed as 1 – the survival probability) stratified by ER status considering only cancer-related deaths (Extended Data Fig.2a) relative to the estimates with the proper cumulative incidence functions accounting for different causes of death (Extended Data Fig.2b). These comparisons indicate that the incidence of DSD is overestimated for ER+ tumors (0.46 vs 0.37 at 20 years) due to the increased age of diagnosis (median 63.9 vs 53.0 years; p-value <1E-6) (Extended Data Fig.2c) relative to ER− tumors. Moreover, because the baseline survival functions for these subgroups are distinct, their differences cannot be adequately summarized with a single parameter in a Cox proportional hazards model. To overcome these limitations, we developed a non-homogenous (semi) Markov chain model that accounts for different disease states (LR, DR) and timescales (time since surgery, LR or DR), as well as competing risks of mortality and distinct baseline hazards across molecular subgroups, thereby enabling individual risk of relapse predictions (Fig.1a, Methods). The model also incorporates clinical variables known to influence breast cancer survival 18,19 , including age, tumor grade, tumor size and number of positive lymph nodes (all measured at diagnosis). We refer to this as the base clinical model onto which molecular subtype information can be incorporated. We fit this multistate model to the FD and recorded the hazards of moving through distinct states and the number of transitions between each pair of states (Supplementary Table 2, Methods). As expected, the majority of cancer related deaths (83% in ER+ and 87% in ER− tumors) occurred after distant metastasis. The remainder of cases likely reflect undetected recurrences or death due to other malignancies. Age at diagnosis was associated with the transition to death by other causes (p-value < 1E-6). Examination of the log hazard ratios and 95% confidence intervals for all other variables indicated that their effect decreased with disease progression (Extended Data Fig.2d). That is, clinical variables related to the primary tumor were more prognostic for earlier transitions than for later transitions. However, several tumor characteristics informed the risk of progression from LR to DR and from DR to death. In ER+ disease, higher tumor grade, number of positive lymph nodes and tumor size all increased the risk of progression to a later state. A longer time between surgery and LR or DR decreased the risk of transition to a later state and was more pronounced in ER− disease. We confirmed that our models were well calibrated, concordant with the established tool PREDICT 18 and that they performed comparably in external datasets (Extended Data Fig.1, Extended Data Fig.3, Methods, Supplementary Information). A powerful feature of our multistate model is that hazard rates can be transformed into transition probabilities representing the probability of moving from one state into another after a given time. To evaluate the patterns of recurrence across the established breast cancer molecular subgroups, we turned to the METABRIC molecular dataset (MD) composed of 1980 patients (Extended Data Fig.1), which includes assignments to the IHC subtypes (ER+/HER2+, ER+/HER2−, ER−/HER2+, ER−/HER2−), PAM50 11 expression subtypes and the genomic driver based IntClust subtypes 12,13 (Supplementary Table 3). We computed the baseline transition probabilities from surgery, LR or DR at various time intervals (2, 5, 10, 15 and 20 years) and the corresponding standard errors (SE) for average individuals in each subgroup (using the FD for comparisons by ER status and the MD for all others, Supplementary Table 4). After surgery, state transitions differed substantially across the various subtypes (Fig.1b). For example, the transition probabilities post surgery reveal different change points for ER+ versus ER− disease where ER− patients had a higher risk of DR and cancer death (D/C) in the first five years, after which their risk decreased considerably. In contrast, ER+ patients had a smaller, but longer risk period during the first ten years and this increased at a lower rate. Among ER− patients, the PAM50 Basal-like subgroup was nearly indistinguishable from the ER−/HER2− subgroup with the majority of cancer deaths in the first 5 years, similar to HER2+ patients (prior to the widespread use of trastuzumab). In contrast, the three predominantly ER− IntClust subgroups (IntClust4ER−, IntClust5 and IntClust10) exhibited substantial differences in their recurrence trajectories. As expected, IntClust5 (HER2+ enriched) generally had poor prognosis at 5 years (0.48, SE=0.04) with risk increasing to 0.65 (SE=0.04) at 20 years. For IntClust10 (Basal-like enriched), the first 5 years from surgery largely defined patient outcomes: the probability of relapse at 5 years was 0.33 (SE=0.03) and after 20 years rose to only 0.37 (SE=0.04) for an average patient. This pattern was distinct from IntClust4ER− patients who exhibited a persistent and increasing risk of relapse with a probability of 0.30 (0.05) at 5 years and 0.49 (0.05) after 20 years. The distinction between IntClust4ER− and IntClust10 is further apparent when examining the average probabilities of relapse among all patients across the IntClust subtypes after surgery or after being disease-free for 5 and 10 years (Fig.2a). Indeed, through the course of the disease, the risk of relapse changed considerably across the integrative subtypes and to a lesser extent the IHC and PAM50 subtypes (Fig.2a, Extended Data Fig.4). Moreover, the probabilities of DR or cancer death amongst ER−/Her2− patients who were disease free at 5 years post diagnosis revealed low (IntClust10) and high (IntClust4ER-) risk of late relapse TNBC subgroups, whereas IHC (and PAM50) subtypes homogenized this risk (Extended Data Fig.5). Dramatic differences were also apparent amongst ER+ patients with IntClust3, IntClust7, IntClust8 and IntClust4ER+ exhibiting better prognosis while IntClust1, IntClust2, IntClust6 and IntClust9 corresponded to late-recurring poor prognosis patients (Fig.2a). These four subgroups had exceedingly high-risk of relapse with mean probabilities ranging from 0.42 to 0.56 up to 20 years post surgery. IntClust2 exhibited the worst prognosis with a probability of relapse (0.56, SE: 0.02) second only to IntClust5. Collectively, these subgroups comprise 26% of ER+ cases (Fig.2bc) and thus define the minority of patients who may benefit from extended monitoring and treatment given the chronic nature of their disease 5,6 . Importantly, the four high-risk of relapse subgroups were enriched for characteristic genomic copy number alterations, which represent the likely drivers of each subgroup (Fig.2b). For example, IntClust2 tumors were defined by amplification and concomitant over-expression of multiple oncogenes on chromosome 11q13, including CCND1, FGF3, EMSY, PAK1 and RSF1 20–22 . IntClust2 accounts for 4.5% of ER+ cases, 96% of which have RSF1 amplification, compared to 0–22% of other subgroups. IntClust6 (5.5% of ER+ tumors) are characterized by focal amplification of ZNF703 23 and FGFR1 24 on chromosome 8p12 (100% of IntClust6 cases vs. 2–21% of others). IntClust1 (8% of ER+ tumors) exhibited amplification of chromosome 17q23 spanning the mTOR effector, RPS6KB1 (S6K1) 25 , which was gained or amplified in 96% and 70% of cases, respectively (vs. amplification in 0–25% of others). IntClust9 accounted for another 8% of ER+ cases and was characterized by amplification of the MYC oncogene at 8q24 with amplification in 89% of IntClust9 tumors (vs 3–42% of other groups). Thus the late-recurring ER+ subgroups are defined by genomic drivers, several of which are viable therapeutic targets 25–27 . Similar differences in the probability of late distant relapse were seen in the subset of patients whose tumors were ER+/HER2− (Fig.3ab, Extended Data Fig.4a–f), a group in which late relapse and strategies to target this, such as extended endocrine therapy, represent critical clinical challenges. In particular, the probabilities of DR or cancer death amongst patients who were disease free 5 years post diagnosis reveals significant risk for IntClust 1,2,6,9 (relative to IntClust3) that varied over time. Moreover, the risk was not fully captured by a model that included IHC subtype with clinical variables (age, tumor size, grade, number of positive lymph nodes, time since surgery) that have been shown to dictate distant relapse outcomes even after a long disease-free interval 5 (Fig.3a). We therefore assessed whether the integrative subtypes provided information about a patient’s risk of late distant relapse above and beyond what could be inferred optimally from standard clinical information. We found that the model including clinical variables combined with IHC subtype provided substantial information about the probability of distant relapse in ER+/HER2− patients who were relapse-free at 5 years: C-index of 0.63 (CI 0.58–0.68) at 10 years, 0.62 (CI 0.58–0.67) at 15 years, and 0.61 (CI 0.57–0.66) at 20 years (Fig.3c). However, including the IntClust subtypes significantly improved its predictive value: C-index of 0.70 (CI 0.64–0.75; improvement over the clinical model P = 0.00011) at 10 years, 0.67 (CI 0.63–0.72, P = 0.0016) at 15 years, and 0.66 (CI 0.62–0.71, P = 0.0017) at 20 years. These trends were recapitulated in an external validation cohort despite the smaller sample size and shorter follow-up times (prohibiting analyses at 20 years). Thus, information about the dynamics of late relapse provided by integrative subtype could not be inferred from standard clinical variables, including IHC subtype. We subsequently turned to the subset of patients who experienced a LR. LR is commonly treated with curative intent and is thought to be a high-risk event associated with increased rates (45 to 80%) of DR 28 . The transition probabilities after LR varied substantially according to pathological features of the primary tumor at diagnosis and molecular subtype, highlighting opportunities for intervention (Extended Data Fig.6, Extended Data Fig.7, Supplementary Tables 2–3). In contrast, following the initial DR all subgroups exhibited a high probability of cancer death, although the median times differed (Extended Data Fig.8, Supplementary Tables 2–3). Unique to our cohort is a subset of 618 patients (out of 1079 from the FD who relapsed) with a complete description of all recurrences (recurrent event dataset, RD), thereby enabling the detailed analysis of the rates and routes of distant metastasis and their lethality. These data revealed the varied time course over which metastases occurred and indicated that no sites of metastasis are exclusive to ER+ or ER− disease (Extended Data Fig.9a). Moreover, multiple distant metastases were common, even among favorable prognosis subgroups (Extended Data Fig.9b). We next examined the cumulative incidence and number of metastases at different organ sites stratified by ER status (Fig.4a). ER− cases harbored significantly more visceral disease (e.g. brain/meningeal: 27% vs. 11%, pulmonary: 50% vs. 41%) relative to ER+ cases. As previously reported 29,30 , bone metastases were more common in ER+ versus ER− cases (71% vs. 43%), but the cumulative incidence was similar. Thus, the higher proportions observed in ER+ disease appear not to reflect site-specific tropism: rather, bone metastases take a long time to develop, and ER− patients tend to die of other metastases first. ER+ tumors also more commonly present with the first metastasis in the bone (76% vs 61%). Similar comparisons stratified by IHC, PAM50, and IntClust subtypes revealed additional variability (Extended Data Fig.10). Striking differences in the rates of distant metastasis were also evident: ER− disease was characterised by a rapid series of relapses early after diagnosis, while most ER+ patients suffered just one early relapse (commonly bone) and if a second relapse occurred, the probability of additional relapses increased (Fig.4b, Methods). Thus after distant recurrence, subtype continues to dictate the rate of subsequent metastases, underscoring the importance of tumor biology. Both the number and site of relapses influenced the risk of death after recurrence with brain metastasis being most predictive. Risk estimates (Fig.4c) were comparable between ER+ and ER− tumors, suggesting that the impact of the site of metastasis on progression to death is similar. In summary, by leveraging a cohort of 3240 patients, including 1980 from METABRIC with detailed molecular characterization, LR and DR information, we have delineated the spatio-temporal dynamics of breast cancer relapse at unprecedented resolution. Our analyses are based on a powerful multi-state statistical model that yields individual risk of relapse estimates based on tumor features, clinical, pathological and molecular covariates, as well as disease chronology, and is available via a web application (see URL below). Unlike existing models used to calculate the benefits of adjuvant therapy at diagnosis such as PREDICT 18 , this research tool can be used to assess how a patient’s risk of recurrence changes throughout follow-up. Learning whether specific treatments change the outcomes of different integrative subtypes is important and will require analysis of randomized clinical trial cohorts. By classifying breast tumors into the 11 integrative subtypes, important differences in recurrence rates that were obscured in the IHC and PAM50 subtypes became apparent. Amongst TNBC patients, IntClust10 largely remains relapse-free after 5 years, whereas IntClust4ER− patients continue to be at significant risk of recurrence. Amongst ER+/HER2-patients, IntClust 1, 2, 6, and 9 have markedly increased risk of DR up to 20 years post-diagnosis and together account for one quarter of all ER+ tumors and the vast majority of late recurrences. Moreover, the integrative subtypes significantly improved the prediction of distant recurrence after 5 years in ER+/HER2− patients. Our findings thus address one of the contemporary challenges in breast oncology, namely identification of the subset of ER+ patients with high-risk of recurrence and tumor biomarkers that are more predictive of recurrence than standard clinical covariates 7,8 . Integrative subtyping may help determine whether women who are relapse-free 5 years after diagnosis might benefit from extended endocrine therapy or other interventions to improve late outcomes. Critically, the four late-recurring ER+ subgroups are enriched for genomic copy number driver alterations that can be therapeutically targeted 24–27 , thus paving the way for new treatment strategies for these high-risk patient populations. Methods Clinical cohort We employed data from 3240 patients (with a median follow-up of 9.77 years overall, and 14 years amongst patients who remain alive) derived from five tumor banks in the UK and Canada diagnosed between 1977–2005. Primary breast tumors and linked pseudo-anonymized clinical data were obtained with ethical approval from the relevant institutional review boards. The METABRIC study protocol was approved by the ethics committees at the University of Cambridge and British Columbia Cancer Research Centre. Manual curation and basic quality control was performed on the data. Observations that had relapse times equal to zero or relapse times equal to the last observed time were shifted 0.1 days. Local relapses that occurred after distant relapses were omitted. In total, 11 cases with stage 4 were also removed from all analyses. Benign and phylloid tumors were also discarded. Last follow-up time or time of death was the final endpoint for all patients. Special care was taken to remove second primary tumors from the dataset. Clinical parameters, such as tumor grade, were not centrally reviewed, which can lead to variability in the estimation of their effects. Samples were allocated to three datasets depending on the information available. The Full Dataset (FD) Clinical and pathological variables are available for this cohort (15394 transitions from 3147 patients). For a subset of 1980 patients we previously described an integrated genomic analysis based on gene expression and copy number data 12 and refer to this as the molecular dataset or METABRIC MD (9512 transitions from 1962 patients). For this cohort, tumors were stratified based on the IHC subtypes (ER+/HER2+, ER+/HER2−, ER−/HER2+, ER−/HER2−), the intrinsic subtypes (PAM50) 10,11 and the integrative (IntClust) subtypes 12,13 . Finally, for a subset of patients who experienced distant metastasis (618 out of the 1079 who relapsed from the FD), the date of each recurrence is available, enabling analysis of their spatio-temporal dynamics. We refer to this as the recurrent events dataset RD. The three datasets are summarized in Extended Data Fig.1a with clinical details and basic parameters describing the intermediate endpoints of LR and DR across distinct subgroups in Supplementary Table 1. We also established an independent metacohort composed of 1380 breast cancer patients from eight cohorts enabling external validation of our findings, despite their shorter median follow-up (8 years) (Extended Data Fig.1b). We sought to use the maximum information available to fit the models, keeping all the transitions with complete observations needed to estimate the hazard of that specific transition. Therefore, the total number of cases used in each model differs due to different missing values in clinical variables, molecular classification, etc that can affect different transitions. Model description The general model we fitted to our datasets is a multistate model that reflects the different risks of loco-regional relapse, distant relapse or disease-specific death conditioned on the current status of the patient. Although multistate survival models for breast cancer were proposed more than 60 years ago 31 , there are few such analyses in the literature 14,32,33 . Specifically, we employed a non-homogenous semi-Markov Chain with two absorbent states (Death/Cancer and Death/Other) as shown schematically in Fig.1. The model was stratified by molecular subtype and used a clock-reset time scale, in which the clock stops (clock-reset) when the patient enters a new state. Although there were a small number of transitions from distant to local relapse (15 ER+ cases and 7 ER−), we omitted the local relapse in these instances as we considered it redundant and only allowed transitions from local to distant relapse in our model. We also included the possibility of cancer death without a recurrence to account for cases where metastasis was not detected. R packages survival 34 and mstate 35 were used to fit the data. Several covariates were included in the model: age at state entry (diagnosis or relapse), tumor grade, tumor size and the number of positive lymph nodes, all of them as continuous variables (although in the case of lymph nodes, all values larger than 10 lymph nodes were coded as 10, to avoid excessive influence in the slope from extreme cases). The time from diagnosis was also included as continuous. Note that these formulations are a simplification from the modelling in our previous work 12 , where age, size and lymph nodes were modelled non-linearly through splines. We have simplified these effects to reduce the number of parameters in the model, but also, in the case of age, because its non-linearity is only relevant when overall survival is the endpoint. For dataset FD, a Cox model was fitted stratified on ER status. The effect of age on death/other causes was modelled with a different coefficient for each transition into non-malignant death (in each ER status), to account for differences in the age at relapse or diagnosis. Grade, Size and Lymph Nodes were allowed to have different coefficients from the starting state to states of recurrence/cancer death for each ER status. Time since diagnosis had different coefficients from the starting state of relapse to states of recurrence/ cancer death for each ER status and time since LR had different coefficients from distant relapse state to cancer related death for each ER status. The time since LR was not predictive of the time to DR and therefore was not included in further analyses. For dataset MD, and because of the large number of molecular subtypes, we reduced the number of parameters constraining their values to be the same for the different molecular subtypes. Based on different fits and the results of likelihood ratio tests we observed some effects to be markedly different between transitions: age had a coefficient for transitions from surgery or loco-regional relapse into death/other causes for all molecular subtypes and another for transitions from distant relapse into death/other causes. Grade and lymph nodes had a value for transitions from diagnosis and another for transitions from relapse to states of recurrence/death, identical for each molecular subtype. Size had a value for transitions from diagnosis and another for transitions from loco-regional relapse to states of recurrence/death, identical for each molecular subtype. Time since diagnosis had the same coefficient from the starting state of relapse to states of recurrence/death, identical for all molecular subtype. This model was fit three times, one for each molecular classification, based on ER/HER2 status (FourGroupsM), PAM50 (Pam50M) and the Integrative Clusters (ICM); each of them stratified by the respective molecular subgroups. We used a robust variance estimate in all models (option cluster(id) in coxph() function) and performed likelihood ratio tests in order to reduce the number of parameters in each model. Since the number of samples in the MD is smaller than the FD, we retained only the most important covariates and assumed the same effect in each subgroup. Transition probabilities for each molecular subtype Using the model fit, we obtained the hazards for each transition for a given individual. We used these hazards to compute the corresponding transition probabilities as follows. We employ a clock-reset model and define all probabilities starting at the time of entry to the last state. All times s, t are also defined starting from the time of entry. Let the set of states be {S=disease-free/after surgery, L=loco-regional relapse D=distant relapse, C=cancer death, O=other cause of death}. We condition on the vector of clinical covariates x, which includes the time from surgery (in the case of relapse this variable has an effect on the hazards). Transitions from distant relapse Following 14,36 , we define the conditional probability of having no further event between times t and s for a patient with distant relapse at time t as S D ( s , t | x ) = exp ⁡ { − ∫ t s ( λ D , C ( u | x ) + λ D , O ( u | x ) ) d u } where λi,j (t|x) is the hazard of moving from state i to state j at time t with the vector of covariates x (including the time from surgery or age, that must be updated after a relapse). Then, the prediction probabilities for each path are: π D C ( u , t | x ) = ∫ t u λ D , C ( s | x ) S D ( s , t ) d s π D O ( u , t | x ) = ∫ t u λ D , O ( s | x ) S D ( s , t ) d s π D ( u , t | x ) = 1 − ( π D C ( u , t | x ) + π D O ( u , t | x ) ) Transitions from loco-regional relapse Similarly, we obtain: S L ( s , t | x ) = exp ⁡ { − ∫ t s ( λ L , D ( u | x ) + λ L , C ( u | x ) + λ L , O ( u | x ) ) d u } π L D , C ( u , t | x ) = ∫ t u λ L , D ( s | x ) π D C ( u − s , 0 | x ) S L ( s , t | x ) d s π L D , O ( u , t | x ) = ∫ t u λ L , D ( s | x ) π D O ( u − s , 0 | x ) S L ( s , t | x ) d s π L D ( u , t | x ) = ∫ t u λ L , D ( s | x ) π D ( u − s , 0 | x ) S L ( s , t | x ) d s π L C ( u , t | x ) = ∫ t u λ L , C ( s | x ) S L ( s , t | x ) d s π L O ( u , t | x ) = ∫ t u λ L , O ( s | x ) S L ( s , t | x ) d s π L ( u , t | x ) = 1 − ( π L D , C ( u , t | x ) + π L D , O ( u , t | x ) + π L D ( u , t | x ) + π L C ( u , t | x ) + π L O ( u , t | x ) ) Transitions after surgery S S ( s , t | x ) = e x p [ − ∫ t s ( λ S , L ( u | x ) + λ S , D ( u | x ) + λ S , C ( u | x ) + λ S , O ( u | x ) ) d u ] π S L , D , C ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L D , C ( u − s , 0 ) S S ( s , t | x ) d s π S L , D , O ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L D , O ( u − s , 0 ) S S ( s , t | x ) d s π S L , C ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L C ( u − s , 0 ) S S ( s , t | x ) d s π S L , O ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L O ( u − s , 0 ) S S ( s , t | x ) d s π S L , D ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L D ( u − s , 0 ) S S ( s , t | x ) d s π S D , C ( u , t | x ) = ∫ t u λ S , D ( s | x ) π D C ( u − s , 0 ) S S ( s , t | x ) d s π S D , O ( u , t | x ) = ∫ t u λ S , D ( s | x ) π D O ( u − s , 0 ) S S ( s , t | x ) d s π S L ( u , t | x ) = ∫ t u λ S , L ( s | x ) π L ( u − s , 0 ) S S ( s , t | x ) d s π S D ( u , t | x ) = ∫ t u λ S , D ( s | x ) π D ( u − s , 0 ) S S ( s , t | x ) d s π S C ( u , t | x ) = ∫ t u λ S , C ( s | x ) S S ( s , t | x ) d s π S O ( u , t | x ) = ∫ t u λ S , O ( s | x ) S S ( s , t | x ) d s π S ( u , t | x ) can be computed as 1 minus the sum of the others. Prediction probabilities for being in a particular state at a certain time can also be computed summing the appropriate paths. Note that the main difficulty in computing these probabilities is updating the corresponding hazards every time a transition occurs, as they may depend on variables that change over time or after a transition to a different state. In our implementation we tried to follow the style in the mstate package 35 . Standard Errors for the transition probabilities in our model If our model was Markovian (as the clock-forward model), the transition probabilities could be easily computed through the product-integral representation 37 and it would also be straightforward to obtain estimates of their standard errors. However, for our clock-reset model the estimation of standard errors is complicated, so we used a semi-parametric bootstrap approach to obtain such estimates 38 . Briefly, for every bootstrap replicate (B=100), we sampled trajectories for each observation in our original dataset based on our fitted model. These trajectories were fitted to the original model and bootstrap hazards for the original average individuals were computed. Then, the formulas described earlier were used to obtain bootstrap transition probabilities. Because these bootstrap estimates are not likely to converge to the theoretical estimates in transitions with a small number of observed instances, we computed the standard deviation of the bootstrap estimates as an indication of the variability of these predictions for a given patient. Transition probabilities for specific events The transition probabilities obtained for each patient can be aggregated to obtain probabilities of visiting specific states (LR, DR) or specific endpoints. We used these probabilities in two ways: as an example of individual predictions for an average patient for each molecular subtype (based on typical or average values of each covariate), as in Supplementary Table 4B, Fig.1b, Extended Data Fig.6 and Extended Data Fig.8 together with a confidence interval computed using the obtained probabilities +/− 1.96 times the standard deviation of the bootstrap estimates described above, that represent variability around individual predictions. We also computed probabilities for all patients to show their distribution in each molecular subtype, as in Supplementary Table 4A and Fig.2a, Fig.3a, Extended Data Fig.4 and Extended Data Fig.5. Confidence intervals computed using the mean of the probabilities +/− 1.96 times the standard error of the mean represent variability around the mean in each subtype. Sites of relapse For the RD datasets, each patient can have several relapses. Instead of adding the site to our multistate models, we selected only patients who had distant relapse. First, in Fig.4a and Extended Data Fig.10, we tested if the proportions of relapses in each organ differed by molecular subtype. We fitted a logistic regression model with relapse as a binary variable and the sites of metastases as dependent variables. We computed simultaneous tests using the R package multcomp 39 using the Dunnet method 40 . Only those proportions with a p-value smaller than 0.05 were considered significant. In the same figures, cumulative incidence distributions for each organ were computed independently, that is, no competing risk model was fitted. We modelled recurrent distant metastases (Fig.4b) using the Prentice, Williams and Peterson 41 (PWP) conditional model. This model allows for different baseline hazards for each consecutive recurrence while keeping at risk for recurrence i only those individuals that have experimented the recurrence i-1. Finally, in Fig.4c we fitted a Cox model with time dependent variables to estimate the hazard of having metastasis in each organ. We also included in this model the clinical variables from the primary tumor (tumor grade, tumor size and number of positive lymph nodes). Goodness of fit testing Goodness of fit testing was performed for all models. Proportional hazards assumption was tested using the Schoenfeld Residuals vs. time using the survival function cox.zph() 34 . None of the models showed covariates that violated the assumption, except the model for sites of metastasis (ER+), where the number of metastases and “other metastasis” were significant and the model for sites of metastasis (ER-) where grade and the number of metastases were significant. Visual inspection of the plots showed that the trend was roughly flat and thus the violation was not critical. In the model that includes ER, as previously shown ER violates the proportional hazard assumption. However, this model was only used to test differences in the hazard ratios of the other covariates according to ER. Model Validation and Calibration We validated each of the models using several approaches, as outlined below. Internal validation We validated the global predictions of the model on all transitions using the bootstrap approach described in detail in 42 using the rms R package. We used the following measures of predictive ability: Somers’ Dxy rank correlation (Dxy). This is 2(c-0.5), where c is the c-index Nagelkerke’s R2, which is the square root of the proportion of log likelihood explained by the model from the log likelihood that could be explained by a “perfect” model, with a penalty for model complexity Slope shrinkage (slope), a measure of how much the estimates are affected by extreme observations Discrimination index D, derived from the log-likelihood at the shrunken linear predictor Unreliability index U, a measure of the difference between the model maximum log likelihood is from a model with frozen coefficients Overall quality index Q, a normalized and penalized for unreliability log likelihood g-index (g) on the log relative hazard (linear predictor) scale (Gini’s mean difference) Each measure was computed on the training set and on 200 bootstrap test sets, estimating the optimism and the corrected indexes for predictions at 5, 10 and 15 years (see Extended Data Fig.3a). Internal calibration We also employed the following procedure for model calibration as described in 42 : Interpolation of the hazard function using splines (hare method) among all the cases as a general function of the predictor variables and time Computation of the predicted values for a given time point (5, 10 or 15 years) Computation of the differences between observed and predicted Using 200 bootstrap datasets, computation of the optimism in those differences Extended Data Fig.3b shows a boxplot of the mean absolute error of all predictions. External calibration As an external comparison of the predicted probabilities of our models, we used predict v2.1 18 , a tool that has been validated extensively. PREDICT uses a model with several variables (including the effect of treatment) and produces estimates of the probability of cancer-specific death (C/D) and non-malignant death (O/D), as well as estimates of the effect of treatment. We compared the probabilities for these events with PREDICT using Pearson correlation (see Extended Data Fig.3cd). External validation We used two sets of external samples to validate the predictions of our models: A set of METABRIC samples that were not used in the original study including 121 patients with copy number data and 57 patients with expression data. We already had survival data from these patients (in fact they are part of the full dataset FD, but because they have not been used to fit the IntClust Model, they could be employed to test the validity of the c-index on an external dataset). We classified these tumours into IntClust groups using the iC10 13 package. An external dataset of 1380 patients from 8 different cohorts and different survival information. We validated predictions of disease-specific survival (DSS), overall survival (OS), relapse-free survival (RFS) and distant-relapse free survival (DRFS). We compiled a metacohort by merging early breast cancer cohorts where expression data (Affymetrix array), outcome and covariates are available, including GSE19615 (DFHCC 43 ), GSE42568 (Dublin 44 ), GSE9195 (Guyt2 45 ), GSE45255 (IRB/JNR/NUH 46 ), GSE11121 (Maintz 47 ), GSE6532 (TAM 45 ), GSE7390 (Transbig 48 ) and GSE3494 (Upp 49 ). Original data (raw CEL files) were downloaded and pre-processed using the rma function from the affy 50 package. The intensities were then quantile normalized and corrected for batch effects with the COMBAT function from the sva 51 package. PAM50 was called using the genefu 52 package. ER, PR and Her2 status were extracted from the expression using probes 205225_at, 208305_at and 216836_s_t using a Gaussian mixture model. IC10 subgroups was called using iC10 package. C-indices and summary c-indices were calculated using survcomp 53 package. For the combined metacohort scores, we calculated c-scores for each individual cohort and then combined them using the function combine.est from survcomp 53 package. Confidence intervals and p-values for comparing c-indexes were computed with the same package. Extended Data Fig.3e shows the c-indices and confidence intervals for these comparisons. General Statistical considerations All tests were performed two-sided (except where indicated). Adjustment for multiple comparisons was done as described in the sections “Comparison of probabilities of relapse in ER+ high risk Integrative Subtypes” (see Supplementary Methods) and the comparison of proportions of metastases in each organ from Fig.4a and Extended Data Fig.10. All analyses were conducted in R 3.5.1 54 Extended Data Extended Data Fig.1 | Description of the cohorts used in this study. a. Description of the METABRIC discovery cohort, clinical characteristics and flow chart of sample inclusion for analysis. b. Description of the validation cohort, clinical characteristics and flow chart of sample inclusion for analysis. Extended Data Fig.2 | Effect of censoring non-malignant deaths in the estimation of disease-specific survival and prognostic value of clinical covariates at different disease states. a. Cumulative incidence computed as 1-Kaplan-Meier estimator using only disease-specific death as endpoint and censoring other types of death. b. Cumulative incidence computed using a competing-risk model that takes into account different causes of death. The bias of the 1-Kaplan-Meier estimator is visible. c. Distribution of age at the time of diagnosis for ER− and ER+ patients. The number of patients in each group is indicated in all Panels. This analysis was done with the FD. d. Log Hazard Ratios (HR) calculated using the multistate model stratified by ER status (n=3147) for different covariates, namely grade, lymph node (LN) status, tumor size (size), time from local relapse, time from surgery. Log HR are shown from different states, including post surgery (PS; HR of progressing to relapse or DSD), loco-regional recurrence (LR; HR of progressing to DR or DSD) and distant recurrence (DR; HR of cancer-specific death). 95% confidence intervals are shown. This analysis was done with the FD. Extended Data Fig.3 | Model calibration and validation in an external dataset. a. Internal validation of the global predictions of the models on all transitions using bootstrap (n=200). Boxplots are computed using the median of the observations, the first and third quartiles as hinges and the +/−1.58 Interquartile range divided by the square root of the sample size as notches. The optimism (difference between the training predictive ability and the test predictive ability of several discriminant measures (see Methods). b. Internal calibration of the global predictions of the models on all transitions using bootstrap (n=200). The distribution of the mean absolute error between observed and predicted is plotted. Boxplot defined as above (see Methods). c. External calibration of disease-specific death (DSD) risk and non-malignant death risk using PREDICT 2.1 (n=1841). The distribution of the mean absolute error between the predictions of PREDICT and our model based on ER status only is plotted. Boxplots defined as above. d. Scatterplot of the predictions of DSD risk computed by PREDICT and our model based on the IntClust subtypes only at 10 years (n=1841) (see Methods). Pearson correlation is shown. e. Concordance index (c-index) of prediction of risk of distant relapse (distant relapse free survival, DRFS), disease-specific death (disease specific survival, DSS), death (overall survival, OS) and relapse (relapse free survival, RFS) in the 178 withheld METABRIC samples and in a metacohort composed of 8 published studies amongst ER−/HER2− patients in the high-risk IntClust subtypes, where results are shown for individual cohorts and the combined metacohort (see Methods, Supplementary Information). Error bars correspond to 95% confidence intervals for the c-index. The number of patients in each group is indicated. Extended Data Fig.4 | Different subtypes have distinct probabilities of recurrence. a. Average probability of experiencing a distant relapse (DR, defined as the probability of having a distant relapse at any point followed by any other transition) for the high risk ER+ IntClust (IC) subtypes (IC1; n=134, IC6; n=81, IC9; n=134, IC2; n=69) relative to IC3 (n=269), the best prognosis ER+ subgroup. This analysis was restricted to ER+/HER2− cases, which represent the vast majority for each of these subtypes. Error bars represent 95% confidence intervals for the mean. b. As in Panel (a), but showing the average probability of experiencing DR or cancer related death after a LR (IC1; n=21, IC6; n=10, IC9; n=21, IC2; n=13, IC3; n=30). c. Average probability of recurrence (distant relapse or cancer-specific death) after loco-regional relapse for all patients in each of the 11 IntClust subtypes. d. Median time until an additional relapse (DR or cancer specific death) after LR for all patients in each the 11 IntClust subtypes (n=270). This has been computed using a Kaplan-Meier approach with competing risks of progression and non-malignant death. Error bars represent 95% confidence intervals for the median time. Asterisks denote situations where the median time cannot be computed because less than 50% of the patients relapsed. This analysis was done with the MD. e. Average probability of cancer related death after DR for all patients by subtype. f. As in Panel (d), except that the median time until cancer specific death after DR is shown (n=596). g. Mean probabilities of having relapse after surgery and after being 5 and 10 years disease-free (see Methods and Supplementary Table 3) for the patients in each of the four clinical subtypes. Error bars represent 95% confidence intervals. The number of patients in each group is indicated. h, i, j, k. Same as Panels (b, c, d, e) for the IHC subtypes (same sample sizes). l. As in Panel (g) but for the PAM50 subtypes. The number of patients in each group is indicated. m, n, o, p. Same as Panels (b, c, d, e) for the PAM50 subtypes (same sample sizes except for Panel (p); n=593). Extended Data Fig.5 | The ER−/HER2− integrative subtypes exhibit distinct risks of relapse. Probabilities of distant relapse (DR) or cancer related death (C/D) amongst ER−/Her2− patients who were disease free at 5 years post diagnosis reveals dramatic differences in the risk of relapse for TNBC IntClust (IC) subtypes IC4ER− versus the IC10 (Basal-like enriched) subtype. Here the base clinical model with IHC subtypes is compared with the base clinical model plus IntClust subtype information. Error bars represent 95% confidence intervals. The number of patients in each group is indicated. Extended Data Fig.6 | Subtype specific risks of relapse after loco-regional relapse. Transition probabilities from LR to other states (LR=Loco-regional relapse, DR=Distant relapse, D/C=Cancer/disease specific death, D/O=Death by other causes) for individual average patients stratified based on ER status, IHC, PAM50, or IntClust subtypes. 95% confidence bands were computed using bootstrap. This analysis was done with the FD for ER+/ER− comparisons and the MD for the remainder. Extended Data Fig.7 | Associations between probabilities of distant relapse 10 years after loco-regional relapse with clinico-pathological and molecular features of the primary tumor. For each patient that had a loco-regional recurrence (LR), the 10-year probability of having distant relapse (DR) or cancer-related death (D/C) is plotted against different variables. A loess fit is overlaid in order to highlight the relationship between the probability and tumor size or time of relapse. Boxplots are computed using the median of the observations, the first and third quartiles as hinges and the +/−1.58 interquartile range divided by the square root of the sample size as notches. This analysis was done with the MD and the model was stratified by IntClust subtype (n=257). Extended Data Fig.8 | Subtype specific risks of relapse after a distant relapse. Transition probabilities from DR to other states (LR=Loco-regional relapse, DR=Distant relapse, D/C=Cancer related death, D/O=Death by other causes) for individual average patients stratified based on ER status, IHC, PAM50 or IntClust subtypes. 95% confidence bands were computed using bootstrap. This analysis was done with the FD for ER+/ER− comparisons and the MD for the remainder. Extended Data Fig.9 | Distribution of the number of relapses by molecular subtype. a. Times of distant recurrence (DR) for ER− and ER+ patients (n=605). Each dot represents a distant recurrence, coded by color for different sites. b. Distribution of the number of distant relapses for different subtypes (n=611), based on ER/HER2 status (ER+/HER2+ n=36, ER+/HER2− n=263, ER−/HER2+ n=41, ER−/HER2− n=82), PAM50 (Basal n=79, Her2 n=69, Luminal A n=101, Luminal B n=138, Normal n=33) and IntClust subtypes (IC1 n=40, IC2 n=25, IC3 n=32, IC4ER+ n=46, IC4ER− n=16, IC5 n=72, IC6 n=23, IC7 n=24, IC8 n=54, IC9 n=38, IC10 n=52). ER status was imputed based on expression in 6 samples. These analyses were done with RD cohort. Extended Data Fig.10 | Site specific patterns of relapse in the IHC, PAM50 and IntClust subtypes. a. Left Panel: Percentages of patients with a given site of metastasis in the IHC subtypes (barplots, total numbers also indicated). Upright triangles indicate significant positive differences in that group with respect to the overall mean and inverted triangles indicate significant positive differences in that group with respect to the overall mean using simultaneous testing of all sites (see Methods). Location of metastatic sites is not anatomically accurate. Right Panel: Cumulative incidence functions (as 1-Kaplan-Meier estimates) for each site of metastasis in the IHC subtypes. The same patient can have multiple sites of metastasis. b. Same as in Panel (a) but for the PAM50 subtypes. c. Same as in Panel (a) but for the IntClust subtypes. These analyses were done with RD cohort. Supplementary Material Reporting Summary Supplementary Information Guide Supplementary Methods Supp Table 1 Supp Table 2 Supp Table 3 Supp Table 4 Supp Table 5 Supp Table 6 Supp Table 7 Supp Table 8

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          20-Year Risks of Breast-Cancer Recurrence after Stopping Endocrine Therapy at 5 Years.

          The administration of endocrine therapy for 5 years substantially reduces recurrence rates during and after treatment in women with early-stage, estrogen-receptor (ER)-positive breast cancer. Extending such therapy beyond 5 years offers further protection but has additional side effects. Obtaining data on the absolute risk of subsequent distant recurrence if therapy stops at 5 years could help determine whether to extend treatment.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Subtyping of Breast Cancer by Immunohistochemistry to Investigate a Relationship between Subtype and Short and Long Term Survival: A Collaborative Analysis of Data for 10,159 Cases from 12 Studies

            Introduction Breast cancer is a heterogeneous disease that can be classified using a variety of clinical and pathological features. Classification may help in prognostication and targeting of treatment to those most likely to benefit. Currently, estrogen receptor (ER) status and human epidermal growth factor receptor-2 (HER2) status are routinely used as predictive markers to select specific adjuvant therapies. Prognostic markers may also be used to target adjuvant chemotherapy to those at highest risk of poor outcome—for example, the risk prediction tool Adjuvant!Online (www.adjuvant.org) uses prognostic markers to predict the likely absolute benefit of postoperative hormonal and/or chemotherapy and is widely used by oncologists to identify patients most likely to benefit from adjuvant treatment. Perou et al. identified four breast cancer subtypes on the basis of gene-expression profiling of 39 invasive breast tumours and three normal breast specimens [1]. There was one ER-positive (ER+/luminal-like) and three ER-negative subtypes (basal-like, ERBB2+, and normal-like). In addition to expressing the ER receptor, luminal-like tumours expressed other genes that were characteristic of luminal or glandular epithelial cells of origin. The basal-like tumours expressed basal or myoepithelial markers, and none of the basal tumours expressed ER. Similar to the basal-like tumours, overexpression of the ERBB2 oncogene was associated with low ER. The normal-like subgroup was typified by high gene expression for basal and low expression for luminal breast epithelium. A subsequent gene expression analysis by Sorlie et al. of patterns in 78 breast cancers, three fibroadenomas, and four normal breast tissues suggested that the luminal-like subtype could be further separated into two subgroups: luminal A and luminal B [2]. The molecular subtypes were reflected in differences in prognosis. Overall and relapse-free survivals were most favourable for luminal A tumours and least favourable for ERBB2+ and basal-like breast cancers. The investigators also suggested that there may be a third luminal subgroup, the luminal C tumours, but this has not been supported by the subsequent analysis of an expanded dataset [3]. The classification of breast cancers into subgroups on the basis of gene expression patterns in tumour tissue is often regarded as the gold standard, but widespread use of gene-expression profiling in either the clinical or the research setting remains limited. Lack of widespread use of expression profiles is primarily due to the expense and technical difficulty encountered when carrying out high-throughput gene-expression profiling using paraffin-embedded material. Moreover, the currently defined subtypes based on expression profiling were determined through the study of relatively small numbers of tumours and these subgroups may not be definitive. Consequently there is interest in using immunohistochemical (IHC) markers to classify tumours into subtypes that are surrogates for those based on gene-expression profiling [4]. Many investigators have used IHC to classify tumours but have used different naming conventions. Generally a hierarchical classification is used, with luminal and nonluminal tumours defined as those tumours that express either ER or progesterone receptor (PR) and those that do not. The luminal and nonluminal groups can then be further subdivided according to HER2-expression status to generate four subtypes, and these four subtypes can each be categorised according to whether or not they express a basal marker yielding a total of eight subtypes. The mapping of these eight IHC subtypes onto the five subtypes based on gene expression is not exact. Luminal A tumours as defined by gene expression have, in general, higher expression of ER-related genes and lower expression of proliferative genes than luminal B tumours [5]. However, there are no established IHC markers for subdividing the luminal subtypes into the same categories. Recently, it has been suggested that the luminal B subtype is equivalent to those that express either HER2 or the proliferation marker KI67 [6]. The nonluminal tumours are ER negative and PR negative and are generally subdivided into three groups. The nonluminal, HER2-positive tumours are the equivalent of the ERBB2-overexpressing tumours. Tumours that do not express ER, PR, or HER2—the triple negative phenotype (TNP) tumours—are often regarded as equivalent to the basal subtype as they can be easily identified with IHC markers that are currently used in routine clinical use. However, not all TNP tumours express basal cytokeratins (CKs), and within the TNP subtype, expression of basal markers may reflect important clinical differences. Expression of either CK5/6 or epidermal growth factor receptor (EGFR) has been shown to accurately identify basal-like tumours classified using gene expression [7],[8], and several published studies have used these markers to subclassify the TNP tumours into a core basal subgroup (CBP), which is equivalent to the basal-like from expression profiling and the five negative phenotype (5NP: ER−, PR−, HER2−, CK5/6−, and EGFR−). Although this hierarchical classification is commonly used, questions remain as to whether these groups are biologically distinct and clinically relevant. For example, it has been suggested that basal markers can be used to classify the basal tumours independent of other markers [9]. Cheang et al. reported a significantly poorer survival in CBP tumours compared to the 5NP tumours [10], an observation that supports the notion that the two are biologically distinct types of the TNP tumours. This finding was not confirmed by a smaller study with limited power to detect small differences [11]. A third study reported that the prognostic significance of CBP tumours was similar to that of the TNP tumours [12]. However, they did not explicitly compare the CBP and 5NP subtypes. Previously published studies have either compared the five subtypes by using the luminal HER2-negative tumours as a reference category to compare with the other four subtypes [8],[10],[12],[13], or they have compared the subtypes by restricting the analysis to either luminal or nonluminal tumours [6],[11]. Unanswered questions include whether the behaviour of luminal HER2-positive tumours and the nonluminal HER2-positive tumours are different, whether the behaviour of luminal basal-positive tumours is different from that of the nonluminal basal-positive tumours, and whether basal marker status is important in the luminal, HER2-negative tumours. The association between ER status and mortality is known to be time dependent, with hazard ratios for ER-positive versus ER-negative tumours being lower than one in the first years after diagnosis and becoming higher than one after 7–10 y. Mortality in women with ER-positive tumours remains fairly constant over time, whereas the mortality in women with ER-negative tumours is initially higher than that in women with ER-positive disease and then falls to a lower rate after 7–10 y [14]–[16]. In addition, Tischkowitz and colleagues reported that the prognostic effects of both TNP and CBP tumours compared to luminal tumours tended to diminish over time, whereas the effect of CK5 and other basal markers, when considered alone, might increase with time [12]. Another study reported that the effects of the CBP were attenuated over time [13]. Inspection of the Kaplan-Meier survival curves published by Cheang et al. also suggest that the prognostic effects of the CBP and 5NP subtypes are time dependent [10]. All the major subtypes apart from the luminal A tumours are relatively infrequent, and only very large studies with prolonged follow-up have the power to study meaningful differences in prognosis. The aim of this study was to pool individual data from multiple breast cancer case series, in order to definitively establish the relative survival of the major subtypes of breast cancer as classified using five IHC markers, and to characterise their prognostic effects over time. Materials and Methods Ethics Statement All studies were approved by the relevant research ethics committee or institutional review board. Participants in Amsterdam Breast Cancer Study (ABCS), Helsinki Breast Cancer Study (HEBCS), Jewish General Hospital (JGH), Mayo Clinic Breast Cancer Study (MCBCS), Melbourne Collaborative Cohort Study (MCCS), Polish Breast Cancer Study (PBCS), Sheffield Breast Cancer Study (SBCS), and Study of Epidemiology and Risk factors in Cancer Heredity (SEARCH) provided informed written consent. Samples for British Columbia Cancer Agency (BCCA), Nottingham Breast Cancer Case Series (NOBCS), University of British Columbia (UBC), and Vancouver General Hospital (VGH) were from legacy archival material and individual consent was not obtained. All data were anonymised before being sent to the coordinating centre for analysis. Study Populations The international breast cancer association consortium (BCAC) comprises a large number of studies investigating the role of common germline genetic variation in breast cancer susceptibility [17]. In addition to data on germline genotype, many BCAC studies have detailed pathological data on the breast cancer cases linked to follow-up data. All BCAC studies that had collected IHC data on ER, PR, HER2, and either EGFR or CK5/6 or both, in addition to survival time data and data on tumour grade, size, and nodal status were eligible for inclusion in this study. The investigators of the three previously published studies with equivalent data [10]–[12], were also invited to contribute their data, as were the investigators of a fourth large breast cancer case series that had taken part in a previous collaboration involving other BCAC studies [18]. All studies provided data on age at diagnosis, vital status, breast cancer-specific mortality, time between diagnosis and ascertainment, follow-up time, tumour grade (low, intermediate, and high), tumour size (<2 cm, 2–4.9 cm, ≥5 cm) and node status (positive or negative). In total, 12 studies from Europe, North America, and Australia contributed data on 10,159 cases with complete data [7],[9],[10],[12],[18]–[29]. Nine studies also provided data on whether or not the patient had been treated with adjuvant hormonal therapy or adjuvant chemotherapy. These data were available for a subset of 8,171 and 8,061 cases, respectively. The studies are described in Table 1. 10.1371/journal.pmed.1000279.t001 Table 1 Description of participating studies. Study Country Case Ascertainment Case Definition Age Range (y) References ABCS The Netherlands Hospital-based All cases of operable, invasive cancer diagnosed from 1974 to 1994 in four Dutch hospitals. Familial non-BRCA1/2 cases <50 from the Clinical Genetic Centre at The Netherlands Cancer Institute 23–50 [19] BCCA Canada Hospital-based Women diagnosed with invasive breast cancer between 1986 to 1992 and identified through the British Columbia Cancer Agency 23–89 [7],[10] HEBCS Finland Hospital-based (1) Consecutive cases (883) from the Department of Oncology, Helsinki University Central Hospital 1997–1998 and 2000; (2) Consecutive cases (986) from the Department of Surgery, Helsinki University Central Hospital 2001–2004; (3) Familial breast cancer patients (536) from the Helsinki University Central Hospital, Departments of Oncology and Clinical Genetics (1995–) 22–96 [20]–[22] JGH Canada Hospital-based Ashkenazi Jewish women diagnosed with nonmetastatic, invasive breast cancer at Jewish General Hospital, Montreal between 1980 and 1995 26–66 [12] MCBCS USA Hospital-based Incident cases residing in six states (Minnesota, Wisconsin, Iowa, Illinois, North Dakota, South Dakota) seen at the Mayo Clinic in Rochester, Minnesota from 2002–2005 22–89 [23] MCCS Australia Cohort Incident cases diagnosed within the Melbourne Collaborative Cohort Study during the follow-up from baseline (1990–1994) to 2004 of the 24,469 participating women 30–82 [24] NOBCS UK Hospital-based Primary operable breast carcinoma patients presenting from 1986 to 1998 and entered into the Nottingham Tenovus Primary Breast Carcinoma Series. 26–93 [9] PBCS Poland Population-based Incident cases from 2000–2003 identified through a rapid identification system in participating hospitals covering ∼90% of all eligible cases; periodic check against the cancer registries in Warsaw and Łódź to assure complete identification of cases 27–75 [25] SBCS UK Hospital-based Women with pathologically confirmed breast cancer recruited from surgical outpatient clinics at the Royal Hallamshire Hospital, Sheffield, 1998–2002; cases are a mixture of prevalent and incident disease 29–93 [26],[27] SEARCH UK Population-based Two groups of cases identified through East Anglian Cancer Registry: (1) prevalent cases diagnosed age <55 y from 1991–1996 and alive when study started in 1996; (2) incident cases diagnosed age <70 y diagnosed after 1996 23–69 [18] UBCBCT Canada Hospital-based Women with stage I to III breast cancer who participated in four different British Columbia Cancer Agency clinical trials between 1970 and 1990 and all received chemotherapy 22–90 [28],[29] VGH Canada Hospital-based Women with primary breast cancer who underwent surgery at Vancouver General Hospital 1975–1995 28–91 [12] ABCS, Amsterdam Breast Cancer Study; BCCA, British Columbia Cancer Agency; HEBCS, Helsinki Breast Cancer Study; JGH, Jewish General Hospital; MCBCS, Mayo Clinic Breast Cancer Study; MCCS, Melbourne Collaborative Cohort Study; NOBCS, Nottingham Breast Cancer Case Series; PBCS, Polish Breast Cancer Study; SBCS, Sheffield Breast Cancer Study; SEARCH, Study of Epidemiology and Risk factors in Cancer Heredity; UBCBCT, University of British Columbia Breast Cancer Trials; VGH, Vancouver General Hospital. Immunohistochemistry and Tumour Classification Data for these antibodies were either derived from IHC performed in a research setting or collated from patient records by the individual groups. The methods used by each study for each marker are shown in Table S1. The cases were grouped into subtypes on the basis of their protein expression profile (Figure 1). Luminal tumours were those with positive staining for ER or PR. Luminal tumours were subdivided according to HER2 status into luminal 1 (HER2-negative), which is broadly equivalent to the luminal A tumours defined by gene expression, and luminal 2 (HER2-positive) tumours. The luminal 2 tumours are a subset of the luminal B tumours because some of the tumours classified as luminal 1 would be expected to express proliferative markers and thus be misclassified luminal B tumours. The nonluminal tumours were those that were negative for both ER and PR. These were subdivided by HER2 expression status into the nonluminal HER2-positive tumours and the TNP tumours. The TNP tumours were further subdivided into the CBP tumours (either CK5/6 or EGFR positive) and the 5NP tumours (CK5/6-negative and EGFR-negative). Four studies did not provide data for EGFR, and for these studies the 5NP tumours were those that were negative for ER, PR, HER2, and CK5/6. A small number of 5NP tumours from these studies will thus be misclassified core basal tumours. The tumours classified as luminal 1 were also further subdivided according to expression of basal markers into luminal 1, basal marker negative and luminal 1, basal marker positive. 10.1371/journal.pmed.1000279.g001 Figure 1 Classification of breast cancer subtypes according to IHC marker profile. Statistical Analysis The association between each prognostic marker and subtype and all-cause mortality after diagnosis was investigated using Cox regression stratified by study and adjusted for age at diagnosis, grade, node status, and size of tumour. Ordinal categories of tumour grade and size were treated as continuous variables in all analyses. Age at diagnosis was treated as a categorical variable (<40, 40–49, 50–59, and ≥60 y). In several studies the cases were ascertained after diagnosis (prevalent cases), and this was allowed for in the analysis by setting “time at risk” from the date of diagnosis and “time under observation” on date of study entry. This step produces an unbiased estimate of the hazard ratio provided the proportional hazards assumption is correct [16]. Follow-up was censored on the date of death from any cause, or, if death did not occur, on date last known alive or at 15 y after diagnosis, whichever came first. The Cox proportional hazards model assumes that the hazard ratio is constant over time. This assumption is known to be violated for ER [14]–[16] and over prolonged follow-up is also likely to be violated for other predictors. We therefore carried out a conditional relative survival analysis by splitting follow-up time into five different periods—0–2, 2–4, 4–6, 6–10, and 10–15 y after diagnosis—and deriving Cox models separately for each period. The Cox proportional hazards assumption was checked for each study period by visual inspection of the standard log-log plots. A test for heterogeneity of the study-specific hazard ratios was carried out using the Mantel-Haenszel method. Kaplan-Meier cumulative survival plots were adjusted for study, age group, tumour grade, tumour size, and node status. In order to provide an overall test of association to compare survival time across all 15 y of follow-up we used multivariate Cox regression models in which the prognostic factors were treated as time-varying covariates. In these models the log hazard ratio varies as a function of the natural logarithm of follow-up time. Models with and without the covariates of interest were then compared using likelihood ratio tests. All analyses were performed in Intercooled Stata, version 10 (Stata Corp). Results Eight studies provided data on ER, PR, HER2, CK5/6, and EGFR with a further four studies providing data on ER, PR, HER2, and CK5/6, but not EGFR. Based on these data, there were 10,159 subjects that could be classified into one of the five major breast subtypes. There were 3,181 deaths in 85,799 person-years of follow-up, with 1,975 deaths from breast cancer. The multivariate, period-specific hazard ratios for age (in four categories), tumour grade, tumour size, node status, and the IHC markers are given in Table 2. These data show that the hazard ratios for all variables except age at diagnosis attenuate over time, and that for ER, PR, HER2, CK5/6, EGFR, and grade the effect changes direction with time. The time-dependent changes were most pronounced for ER and PR status. There was little difference in the hazard ratios for all-cause mortality and breast cancer-specific mortality, except for in the youngest and oldest age groups (Figures S1 and S2). Breast cancer-specific hazard ratios tended to be higher for women diagnosed under the age of 40 y (reference age at diagnosis 50–59 y). In contrast, for age at diagnosis ≥60 y, all-cause mortality hazard ratios were greater, as might be expected because of the impact of mortality from other causes. 10.1371/journal.pmed.1000279.t002 Table 2 Multivariate period-specific all-cause mortality hazard ratios (95% CI). Variable Time after Diagnosis 0–2 y 2–4 y 4–6 y 6–10 y 10–15 y Age at diagnosis (y) <40 0.69 (0.49–0.98) 1.09 (0.87–1.37) 1.14 (0.84–1.55) 0.83 (0.62–1.12) 0.68 (0.44–1.05) 40–49 0.63 (0.48–0.84) 0.77 (0.64–0.93) 0.83 (0.65–1.06) 0.66 (0.53–0.82) 0.51 (0.38–0.68) 50–59 1.00 (ref) 1.00 (ref) 1.00 (ref) 1.00 (ref) 1.00 (ref) ≥60 1.74 (1.36–2.22) 1.26 (1.04–1.52) 1.64 (1.31–2.06) 1.79 (1.49–2.14) 2.05 (1.63–2.58) Grade a 1.51 (1.24–1.84) 1.81 (1.59–2.08) 1.37 (1.18–1.60) 1.14 (1.01–1.29) 0.97 (0.83–1.13) Node positive 2.64 (2.12–3.27) 2.42 (2.09–2.82) 1.86 (1.55–2.23) 1.56 (1.35–1.82) 1.40 (1.15–1.70) Tumour size a 1.67 (1.42–1.97) 1.47 (1.31–1.66) 1.43 (1.23–1.66) 1.37 (1.20–1.56) 1.30 (1.09–1.55) ER positive 0.55 (0.42–0.71) 0.76 (0.63–0.91) 1.31 (1.02–1.68) 1.63 (1.29–2.07) 1.24 (0.91–1.69) PR positive 0.36 (0.27–0.47) 0.62 (0.52–0.74) 0.74 (0.6–0.91) 1.04 (0.87–1.23) 1.16 (0.92–1.46) HER2 positive 1.21 (0.95–1.52) 1.50 (1.27–1.78) 1.55 (1.23–1.96) 1.35 (1.07–1.69) 0.96 (0.67–1.37) Basal marker positive 1.33 (1.06–1.68) 1.21 (1.01–1.44) 1.38 (1.08–1.78) 1.06 (0.83–1.35) 0.83 (0.59–1.17) All analyses are stratified by study. a Grade and tumour size are ordinal variables treated as continuous, giving hazard ratios per unit increase in score. There were 7,882 luminal tumours (78% of total). Of these, 7,243 (92%) were luminal 1 and 639 (8%) were luminal 2. There were 632 tumours of the nonluminal HER2-positive subtype (6% of total), and 1,645 TNP tumours (16% of total). Of the TNP tumours, 962 were CBP (58%) and 683 basal-negative tumours (42%). The number of tumours by the five major subtypes for each study are shown in Table 3. In addition to the five main subtypes, we subdivided the luminal 1 tumours according to expression of basal markers, with 562 (8%) being basal marker positive and 6,119 (92%) being basal marker negative (Table S2 shows the luminal 1 subgroups by study). Table 4 shows the characteristics of the five major breast cancer subtypes by age at diagnosis, tumour grade, tumour size, and node status. 10.1371/journal.pmed.1000279.t003 Table 3 Number of tumours by subtype and study. Study Luminal 1 Luminal 2 Nonluminal HER2+ CBP 5NP Total n Percent n Percent n Percent n Percent n Percent ABCS 497 67 64 9 51 7 60 8 68 9 740 BCCA 2,378 71 206 6 238 7 317 9 209 6 3,348 HEBCS 169 72 25 11 8 3 21 9 13 6 236 JGH 160 77 18 9 5 2 21 10 3 1 207 MCBCS 219 86 24 9 4 2 8 3 1 <1 256 MCCS 276 72 22 6 30 8 37 10 17 4 382 NOBCS 1,051 71 44 3 71 5 196 13 108 7 1,470 PBCS 694 69 35 3 67 7 137 14 75 7 1,008 SBCS 206 77 16 6 10 4 14 5 21 8 267 SEARCH 1,247 76 121 7 71 4 112 7 83 5 1,634 UBC 154 42 53 15 62 17 15 4 81 22 365 VGH 192 78 11 4 15 6 24 10 4 2 246 Total 7,243 71 639 6 632 6 962 9 683 7 10,159 ABCS, Amsterdam Breast Cancer Study; BCCA, British Columbia Cancer Agency; HEBCS, Helsinki Breast Cancer Study; JGH, Jewish General Hospital; MCBCS, Mayo Clinic Breast Cancer Study; MCCS, Melbourne Collaborative Cohort Study; NOBCS, Nottingham Breast Cancer Case Series; PBCS, Polish Breast Cancer Study; SBCS, Sheffield Breast Cancer Study; SEARCH, Study of Epidemiology and Risk factors in Cancer Heredity; UBCBCT, University of British Columbia Breast Cancer Trials; VGH, Vancouver General Hospital. 10.1371/journal.pmed.1000279.t004 Table 4 Characteristics of breast cancer subtypes by age at diagnosis, tumour grade, tumour size, and node status. Breast Cancer Subtype Characteristics Luminal 1 Luminal 2 HER2-enriched CBP 5NP Total n Percent n Percent n Percent n Percent n Percent n Percent Vital status at censoring Alive 5,242 72 369 58 333 53 590 61 444 65 6,978 69 Dead 2,001 28 270 42 299 47 372 39 239 35 3,181 31 Age group (y) <40 457 6 74 12 80 13 165 17 90 13 866 9 40–49 1,960 27 215 34 190 30 286 30 237 35 2,888 28 50–59 3,142 43 233 36 268 42 377 39 238 35 4,258 42 ≥60 1,684 23 117 18 94 15 134 14 118 17 2,147 21 Tumour grade 1 1,493 21 41 6 20 2 15 3 40 6 1,609 16 2 3,645 50 239 37 146 23 129 13 174 25 4,333 42 3 2,105 29 359 56 466 73 818 85 469 69 4,217 42 Node status Negative 4,229 58 278 44 267 42 577 60 367 54 5,718 56 Positive 3,014 42 361 56 365 58 385 40 316 46 4,441 44 Tumour size <2 cm 4,441 61 300 47 272 43 442 46 296 43 5,751 56 2–4.9 cm 2,580 36 306 48 318 50 468 49 336 49 4,008 39 ≥5 cm 222 3 33 5 42 7 52 5 51 7 402 4 The hazard ratios over time for the five subtypes of breast cancer, stratified by study and adjusted for grade, tumour size, and node status, are shown in Figure 2. There was little evidence for heterogeneity of effects by study for these hazard ratios except for the 5NP tumours (Table S3). Figure 2 shows that, compared to the luminal 1 tumours, luminal 2 tumours are associated with a slightly poorer prognosis in the first few years after diagnosis, but that the difference reduces with time, and by 8 y after diagnosis there is no difference between the two. In contrast the mortality for women with the HER2-enriched and both types of TNP tumours (CBP and 5NP) is substantially greater than that for women with the luminal 1 tumours immediately after diagnosis, but the difference declines rapidly and reverses at 5–10 y after diagnosis. These patterns reflect the time-dependent changes in mortality rates in the different subgroups (Figure S3). Within the TNP subgroup, the women with CBP tumours have a slightly poorer prognosis than women with the 5NP tumours. This difference declines slightly over time and by 8 y after diagnosis, no difference is observed. A similar pattern is seen for the luminal 1, basal-positive tumours when compared to the luminal 1, basal-negative tumours. We repeated the analyses using breast cancer-specific mortality as the end point (Figure S4). The hazard ratio estimates tended to be greater (for hazard ratios greater than unity) than the all-cause mortality hazard ratios, but the confidence intervals were somewhat wider. 10.1371/journal.pmed.1000279.g002 Figure 2 Period-specific hazard ratios (all-cause mortality) for major breast cancer subtypes. All hazard ratios are stratified by study and adjusted for tumour grade, tumour size, and node status. The Kaplan-Meier cumulative survival for the three luminal subtypes adjusted for study, grade, tumour size, and node status is shown in Figure 3A. This result shows that the cumulative survival for the luminal 1 subtypes declines almost linearly over time, which is compatible with a constant mortality rate. In contrast, the mortality rate in women with the luminal 2 tumours tends to flatten out over time as the high mortality in the first few years after diagnosis declines. It also clearly shows the poorer prognosis for the luminal 1 tumours that are basal marker positive. The survival curves associated with nonluminal HER2-positive, CBP, and 5NP tumours all show a similar pattern to that of the luminal 2 tumours (Figure 3B). There were significant differences in prognosis between all pairs of subtypes apart from the nonluminal HER2-positive tumours compared with the CBP tumours (Table S4). Of particular note is the difference between the CBP and 5NP tumours (p = 0.0008). The luminal, HER2-positive tumours and the nonluminal, HER2-positive tumours are two distinct subgroups, with the nonluminal tumours having a poorer prognosis (p<0.0001), and the CBP tumours having a poorer prognosis than the luminal, basal-positive tumours (p<0.0001). These differences did not depend on whether or not the patient had been treated with either adjuvant hormone therapy or adjuvant chemotherapy (Figure S5). In contrast, the basal markers seem to have no prognostic significance within the HER2 positive subtypes of disease (p = 0.85). 10.1371/journal.pmed.1000279.g003 Figure 3 (A and B) Kaplan-Meier cumulative survival (all-cause mortality) in luminal and nonluminal tumours by subtype. All curves are adjusted for age at diagnosis, tumour grade, tumour size, node status, and study. The luminal, HER2-positive tumours and the nonluminal, HER2-positive tumours represent two distinct subgroups, as do the ER-positive/negative tumours that are basal positive. In both cases the ER-negative tumours have a poorer prognosis in the first few years after diagnosis, but after 5 to 10 y it is the ER-positive tumours that have the poorer outcome (Figure S6). In contrast, the basal markers seem to have no prognostic significance within the HER2-positive subtypes of disease (unpublished data). Data on the association between the major subtypes and prognosis have previously been published for three of the studies included in this analysis—BCCA, JGH, and VGH—and it is possible that the effect estimates that we report here are subject to publication bias. We therefore repeated all the analyses after excluding the data for these three studies but there was little difference in the results (see Figure S7). Discussion We evaluated the prognostic significance of five previously described major subtypes of breast cancer that were classified using five IHC markers. To our knowledge, this study represents one of the largest datasets analysed for prognosis research in breast cancer using IHC markers. Our data confirm the observations of others that the pattern of survival in ER-positive tumours is qualitatively different to that in ER-negative tumours. In ER-positive tumours, the mortality rate is approximately constant over time since diagnosis, whereas the mortality rate associated with ER-negative disease is initially high and then progressively declines over time. However, the pattern of mortality rates associated with the HER2-positive subgroup of ER-positive tumours (luminal 2) is similar to those of the nonluminal subtypes (Figure 3A). Berry et al. suggest [14] that the pattern of mortality after diagnosis associated with ER-positive tumours is mainly an effect of treatment with adjuvant hormone therapy and that the pattern of mortality in women not treated with adjuvant hormone therapy is similar to that in women with ER-negative disease. The pattern of mortality in women with luminal 1 tumours and treated with adjuvant hormone therapy was similar to those who did not receive hormone therapy (Figure S3). This result implies that the time-dependent effects we observed are not simply the result of adjuvant hormone therapy in a subset of the women with ER-positive tumours. Few of the participants with HER2-positive tumours in this study would have been treated with trastuzumab and so the prognosis in women with these tumours would not reflect the benefit of targeted therapy. Instead we propose that the survival patterns reflect the underlying molecular heterogeneity of breast cancer. We have hypothesized that this heterogeneous biology reflects the fact that breast cancers can initiate in different cell types, either breast epithelial stem cells or their progeny (transit amplifying cells or committed differentiated cells) [30]. Furthermore the recognition of the subtype-specific differences in short-term and long-term prognosis will inevitably lead to tailored follow-up programmes after completion of primary therapy. Our data confirm the view that the TNP is not a good proxy for the CBP because the CBP and 5NP tumours are biologically distinct and show different behaviours. The CBP tumours are clearly associated with a poorer prognosis than the 5NP tumours. Currently, chemotherapy remains the only systemic treatment option available for patients with triple negative (CBP and 5NP) tumours. A number of small studies have shown that basal-like cancers defined through gene-expression profiling or immunophenotyping are responsive to chemotherapy regimes [31]–[33]. In addition, the expression of core basal markers such as EGFR, may lead to the application of targeted therapies, with EGFR inhibitors currently under investigation for use in basal-like breast cancers. We have also shown that the expression of basal markers in ER-positive tumours is associated with a poorer prognosis, suggesting that the luminal 1 tumours represent two distinct subtypes, both of which differ in behaviour from the luminal 2 tumours. Overall the prognostic model based on the six subtypes defined by five IHC markers fits significantly better than a model based on three subtypes—ER-positive or PR-positive and HER2-negative, HER2-positive, and triple-negative tumours—defined by the three markers currently in standard clinical practice (likelihood ratio chisq = 54.4, 3 degrees of freedom [df], p<0.0001). One remaining question is whether the 5NP tumours represent a distinct subtype or are just other subtypes that have been misclassified because of assay failure. However, given the pattern of mortality rates over time since diagnosis (Figure S3), it seems unlikely that many of the 5NP tumours are misclassified luminal tumours. If the 5NP tumours were misclassified nonluminal HER2-positive or CBP tumours, we would expect the survival associated with them to be intermediate, whereas the 5NP tumours have a better prognosis than both the other nonluminal subtypes. Furthermore, the prognosis associated with the 5NP is different from each of the other five subtypes and is also different from all the other subtypes combined. Thus it seems likely that the majority of 5NP tumours represent a true distinct subtype, with a small, but unknown, proportion representing misclassification of the other subtypes, Until a marker to positively identify the genuine 5NP subtype has been identified, it will not be possible to separate these two sets of tumours. Our study has several limitations. IHC was carried out in different laboratories using different methods for both staining and scoring and, as a result, some misclassification of tumour subtypes is inevitable. However, it is likely that such error is random with respect to patient outcome. For the analyses of breast cancer-specific mortality, cause of death was obtained from the underlying cause of death as reported on death certificates and may thus be associated with some error. However, any error in ascertaining cause of death is likely to be random with respect to tumour characteristics. Thus, measurement error of either breast cancer subtype, as a result of interlaboratory variability or outcome, is, if anything, likely to result in an underestimate of any true differences between subtypes. The fact that we have found clear differences in subtypes classified by IHC analyses that were carried out in different laboratories, and would therefore be subject to interlaboratory assay result variability, suggests that the markers are robust to interlaboratory variation in their application and therefore suitable for use in routine clinical practice. There is also some nonrandom error as the luminal 1 tumours that express proliferation markers are likely to behave more like luminal 2 tumours [6]. As the luminal 1 tumours were used as the reference category, this misclassification is likely to lead to an underestimation in the true difference between luminal 1 and the other subtypes. Similarly, some of the 768 5NP tumours will be misclassified CBP tumours because data on EGFR were missing. Assuming these data were missing at random, approximately 25 of the 5NP tumours may represent misclassified CBP tumours. However, when the definition of 5NP tumours was restricted to those that were negative for both CK5/6 and EGFR, there was little difference in the hazard ratio estimates (unpublished data. Finally, the effects may also be underestimated because of the nonrandom use of adjuvant chemotherapy. The more aggressive subtypes are more likely to have been treated with chemotherapy, which would result in a reduction in the difference between these groups and the better prognosis subtypes. Data from 12 different studies were used in this analysis. These studies represent different ethnic groups from different regions of the world as well as differences in case ascertainment. Furthermore there were differences in the way that pathology samples were handled, stained, and scored, and the degree of misclassification will vary from study to study. This heterogeneity in study design may weaken the observed associations, and limit the specificity of the conclusions drawn. Nevertheless, the clear differences between the subtypes of breast cancer that we identified, despite the presence of heterogeneity, make the results robust and broaden their generalisability. In conclusion, we have confirmed that six breast cancer subtypes can be robustly classified using five IHC markers. These subtypes behave differently with specific patterns of mortality over time since diagnosis. These characteristics are independent of other clinico-pathological markers of prognosis and independent of systemic therapy received. The classification based on these markers is robust to multiple sources of heterogeneity between studies suggesting that they are suitable for use in routine clinical practice. The incorporation of these markers into prognostic tools such as Adjuvant!Online and the Nottingham Prognostic Index currently used in clinical practice or tools such as PREDICT [34], which was recently developed to enable the incorporation of novel prognostic biomarkers, may be warranted. It is plausible that these markers are predictive and that different subtypes respond differently to specific treatments, and the evaluation of subtype-specific responses in the context of clinical trials of specific treatments is urgently required. Given that these subtypes can easily be defined using robust IHC markers in archival material, this type of analysis should be possible with existing clinical trial data. Supporting Information Figure S1 Comparison of multivariate, period-specific hazard ratios for age group, tumour grade, and node status based on all-cause and breast-specific mortality. Left-hand panel are results for all-cause mortality and right-hand panels results for breast-specific mortality. Tumour size was treated as an ordinal variable in the Cox regression models and so the hazard ratios represent the hazard ratio for a unit change in the variable. (1.29 MB EPS) Click here for additional data file. Figure S2 Comparison of multivariate, period-specific hazard ratios for tumour size, ER, PR, HER2, and basal marker status based on all-cause and breast-specific mortality. Left-hand panel are results for all cause mortality and right-hand panels results for breast specific mortality. Tumour size was treated as ordinal variables in the Cox regression models and so the hazard ratios represent the hazard ratio for a unit change in the variable. (1.26 MB EPS) Click here for additional data file. Figure S3 Breast cancer-specific mortality by subtype and time since diagnosis. (0.65 MB EPS) Click here for additional data file. Figure S4 Period-specific hazard ratios (breast-specific mortality) for major breast cancer subtypes. All hazard ratios are stratified by study and adjusted for tumour grade, tumour size, and node status. (1.01 MB EPS) Click here for additional data file. Figure S5 Kaplan-Meier cumulative survival in luminal and nonluminal tumours by subtype and by treatment with adjuvant hormone therapy and adjuvant chemotherapy. All curves are adjusted for age at diagnosis, tumour grade, tumour size, node status, and study. (2.18 MB EPS) Click here for additional data file. Figure S6 Period-specific hazard ratios for ER-negative versus ER-positive disease stratified by HER2 status and basal marker status. All hazard ratios are adjusted for age at diagnosis, tumour grade, tumour size, and node status and stratified by study. (0.81 MB EPS) Click here for additional data file. Figure S7 Comparison of period- and subtype-specific hazard ratios (all-cause mortality) for all data and for subset of data after excluding published studies. Left-hand panels show results based on all data (as shown in Figure 1) and right-hand panels show equivalent hazard ratios after exclusion of data from BCCA, JGH, and VGH. (1.24 MB EPS) Click here for additional data file. Table S1 Methods used for IHC analysis by study. (0.10 MB DOC) Click here for additional data file. Table S2 Classification of luminal 1 tumours by basal marker expression. (0.04 MB DOC) Click here for additional data file. Table S3 p-Values for test for heterogeneity of period-specific hazard ratio estimates (compared to luminal 1 tumours) by study. (0.03 MB DOC) Click here for additional data file. Table S4 Likelihood ratio test statistic (2 degrees of freedom) and p-value for comparison of 15-y all-cause mortality between each subtype pair. (0.04 MB DOC) Click here for additional data file.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Strong Time Dependence of the 76-Gene Prognostic Signature for Node-Negative Breast Cancer Patients in the TRANSBIG Multicenter Independent Validation Series

              Recently, a 76-gene prognostic signature able to predict distant metastases in lymph node-negative (N(-)) breast cancer patients was reported. The aims of this study conducted by TRANSBIG were to independently validate these results and to compare the outcome with clinical risk assessment. Gene expression profiling of frozen samples from 198 N(-) systemically untreated patients was done at the Bordet Institute, blinded to clinical data and independent of Veridex. Genomic risk was defined by Veridex, blinded to clinical data. Survival analyses, done by an independent statistician, were done with the genomic risk and adjusted for the clinical risk, defined by Adjuvant! Online. The actual 5- and 10-year time to distant metastasis were 98% (88-100%) and 94% (83-98%), respectively, for the good profile group and 76% (68-82%) and 73% (65-79%), respectively, for the poor profile group. The actual 5- and 10-year overall survival were 98% (88-100%) and 87% (73-94%), respectively, for the good profile group and 84% (77-89%) and 72% (63-78%), respectively, for the poor profile group. We observed a strong time dependence of this signature, leading to an adjusted hazard ratio of 13.58 (1.85-99.63) and 8.20 (1.10-60.90) at 5 years and 5.11 (1.57-16.67) and 2.55 (1.07-6.10) at 10 years for time to distant metastasis and overall survival, respectively. This independent validation confirmed the performance of the 76-gene signature and adds to the growing evidence that gene expression signatures are of clinical relevance, especially for identifying patients at high risk of early distant metastases.
                Bookmark

                Author and article information

                Journal
                0410462
                6011
                Nature
                Nature
                Nature
                0028-0836
                1476-4687
                10 February 2019
                13 March 2019
                March 2019
                13 September 2019
                : 567
                : 7748
                : 399-404
                Affiliations
                [1 ] Cancer Research UK Cambridge Institute and Department of Oncology, Li Ka Shing Centre, University of Cambridge, Robinson Way, Cambridge CB2 0RE, UK
                [2 ] Cancer Research UK Cambridge Cancer Centre, Department of Oncology, Li Ka Shing Centre, University of Cambridge, Robinson Way, Cambridge CB2 0RE, UK
                [3 ] Department of Medicine, Division of Oncology, Stanford University School of Medicine, Stanford, California, USA
                [4 ] Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
                [5 ] Stanford Cancer Institute, Stanford University School of Medicine, Stanford, California, USA
                [6 ] Cambridge Breast Unit, Addenbrooke’s Hospital, Cambridge University Hospital NHS Foundation Trust. Cambridge CB2 2QQ, UK
                [7 ] NIHR Cambridge Biomedical Research Centre and Cambridge Experimental Cancer Medicine Centre, Cambridge University Hospitals NHS, Hills Road, Cambridge CB2 0QQ, UK
                [8 ] Research Institute in Oncology and Hematology, 675 McDermot Avenue, Winnipeg, Manitoba, Canada R3E 0V9
                [9 ] NIHR Comprehensive Biomedical Research Centre at Guy’s and St Thomas’ NHS Foundation Trust and Research Oncology, Cancer Division, King’s College London, London SE1 9RT, UK
                [10 ] Department of Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada V5Z 1L3
                [11 ] Division of Cancer and Stem Cells, School of Medicine, University of Nottingham and Nottingham University Hospital NHS Trust, Nottingham NG5 1PB, UK.
                [12 ] Strangeways Research Laboratory, University of Cambridge, 2 Worts’ Causeway, Cambridge CB1 8RN, UK
                [13 ] Dpto. de Estadística e Investigación Operativa. Universidad de Valladolid. Facultad de Ciencias. Paseo de Belen 7, 47011, Valladolid, Spain
                Author notes
                [+]

                These authors contributed equally to this work.

                Author Contributions

                O.M.R. and C.C. conceived of the study. O.M.R. performed statistical analysis and implemented the model. J.A.S. compiled the validation cohort and performed statistical analyses. S.J.S. led the annotation of clinical samples with input from S.F.C., M.C., R.B., B.P., A.B., H.A., E.P., B.L., M.P., C.G., S.M., A.R.G., L.M., A.P., I.O.E., S.A.A. and Ca.C. A.R.G., L.M., A.P., I.O.E., S.A.A. and Ca.C provided data. P.D.P and C.R provided statistical advice. Ca.C and S.A.A. are METABRIC PIs. O.M.R., J.A.S., J.L.C., Ca.C. and C.C. interpreted the results. O.M.R., J.L.C. and C.C. wrote the manuscript, which was approved by all authors. Ca.C. and C.C. supervised the study.

                [* ]Corresponding authors: Carlos Caldas, Cancer Research UK Cambridge Institute and Department of Oncology, Li Ka Shing Centre, University of Cambridge, Robinson Way, Cambridge CB2 0RE, UK. Carlos.Caldas@ 123456cruk.cam.ac.uk , Christina Curtis, Stanford University School of Medicine, 265 Campus Drive, Lorry Lokey Building Suite G2120C, Stanford, CA 94305. USA. cncurtis@ 123456stanford.edu
                Article
                NIHMS1520488
                10.1038/s41586-019-1007-8
                6647838
                30867590
                c9f8ec95-297f-4119-bcf5-c9dcb8706c4f

                Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

                History
                Categories
                Article

                Uncategorized
                Uncategorized

                Comments

                Comment on this article