17
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Deep learning‐based multi‐omics study reveals the polymolecular phenotypic of diabetic kidney disease

      letter

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Dear Editor, Approximately 30% to 40% of patients with type 2 diabetes mellitus (T2DM) develop diabetic kidney disease (DKD), and most will go on to develop end‐stage renal disease. 1 The presence of kidney disease complicates the management of patients with T2DM. 2 Therefore, identifying biomarkers for the early diagnosis of DKD based on circulating molecular factors associated with physiological alterations in patients with T2DM can effectively reduce and delay the incidence of DKD. We used deep learning (DL) to analyze and process multi‐omics data and establish key molecular characteristics (biomarker panels) that affect the incidence and development of DKD. Based on strict diagnostic inclusion and exclusion criteria, 405 subjects from two centers in China were included in the discovery (n = 105) and test (n = 300) sets and divided into healthy control (HC), T2DM, and DKD groups (Table 1 and Supplementary Materials). TABLE 1 Characteristics of the participants included in the discovery set. Trait HC T2DM DKD p‐Value Demographic data n 35 35 35 Age (years) 64.2 ± 9.4 63.5 ± 5.9 64.5 ± 8.9 >.05 Sex (male/female) 18/17 18/17 17/18 >.05 Blood pressure SBP (mm Hg) 133.9 ± 16.9 139.1 ± 13.7 133.2 ± 13.1 >.05 DBP (mm Hg) 75.7 ± 12.4 81.7 ± 7.0 79.7 ± 10.6 >.05 Blood lipid index TG (mmol/L) 1.2 ± .3 2.5 ± 2.5** 2.3 ± 3.2** <.01 TC (mmol/L) 4.7 ± .6 5.3 ± 1.8 4.8 ± 1.1 >.05 HDL (mmol/L) 1.3 ± .1 1.0 ± .3** 1.0 ± .3** <.01 LDL (mmol/L) 2.8 ± .5 3.9 ± 1.5 3.1 ± .89 >.05 AASI 2.1 ± .6 3.2 ± 1.7** 3.0 ± 1.0** <.01 Diabetes index HbA1c (%) 5.6 ± .3 6.9 ± 1.4** 8.0 ± 1.6** <.01 Glu (mmol/L) 5.4 ± .4 7.5 ± 2.7** 8.7 ± 2.3** <.01 Malb (mg/24 h) N/A N/A 50.8 ± 95.9 History Diabetes history (year) N/A 7.1 ± 5.6 14.6 ± 7.6## <.01 Kidney function Blood urea nitrogen (BUN) (mmol/L) 5.4 ± 1.3 N/A 14.3 ± 44.6* <.05 Serum creatinine (SCR) (μmol/L) 64.7 ± 12.1 N/A 79.3 ± 37.6* <.05 Note: Data are mean ± SD for continuous measures and n for categorical measure. *: Compared with HC (*p < .05; ** p < .01); #: Compared with T2DM (#p < .05; ##p < .01); AASI, Ambulatory Arterial Stiffness Index; N/A, not Applicable; HC, healthy control group; LDL, Low Density Lipoprotein; T2DM, type 2 diabetes mellitus group; TC, Total Cholesterol; TG, Triglyceride; DKD, diabetic kidney disease group. John Wiley & Sons, Ltd. In the discovery set, the combination of lipidomics and data‐independent acquisition quantitative proteomics enabled the discovery of additional potential biomarkers and pathological mechanisms related to the occurrence and development of DKD. Lipidomics revealed that the metabolic profile of the both disease group changed significantly compared to that of HC; however, the metabolic profiles of T2DM and DKD groups were relatively similar (Figure 1A). Using the criteria of variable importance in projection > 1 and p < .05, 70 differential serum metabolites (Table S2) were identified (Figure 1B and Figure S1A). These mainly involved metabolic pathways, such as sphingolipid metabolism, steroid hormone biosynthesis, glycerol phospholipid metabolism and arachidonic acid metabolism (Figure 1C). In addition, the distribution of lipid abundance and lipid classes among the all groups showed that the glycerolipid and glycerophospholipid proportions were the highest. FIGURE 1 Lipidomics and proteomics results of discovery set. (A) Score plots of principal component analysis of each comparison group based on lipidomics modes from independent cohort 1. (B) Venn diagram of 70 differential lipids from three comparison groups in independent cohort 1. (C) Heatmap of 70 differential lipids in independent cohort 1. (D) Score plots of principal component analysis of each comparison group based on proteomics modes from independent cohort 1. (E) Venn diagram of 219 differential proteins from three comparison groups in independent cohort 1. (F) Heatmap of 219 differential proteins in independent cohort 1. Proteomic data showed that protein content may vary depending on the physiological state of the individual (Figure 1D). With fold change (≥ 1.5 or ≤ .67) and p < .05 as screening criteria, 219 differential proteins were quantified (Figure S1B and Figure 1E–F), most of which were highly expressed in the both disease group (Table S3). In addition, the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analyses of the 219 proteins showed that complement and coagulation cascades, focal adhesions and phagosomes were significantly enriched, revealing that the development of DKD was related to pro‐inflammatory signals (Figures S1C and D). Research is increasingly focusing on applying multi‐omics to identify ‘at‐risk’ profiles. 3 At present, biomarkers for the risk of diabetes progressing to DKD at the single‐molecule level have been identified; however, their diagnostic efficacy is poor. 4 , 5 DKD is a complex secondary disease, and studies on risk markers at multiple molecular levels would be helpful in reflecting disease risk. 6 We used support vector machine and convolutional neural network (CNN) models to evaluate the accuracy of single‐ or multi‐omics and found that the CNN model in multi‐omics showed significant advantages (Table S4), with the highest internal and prediction accuracies (100% and 90.48%, respectively). The neighborhood component analysis algorithm selected 58 fusion features (20%) from the 289 features, including 32 different proteins and 26 different lipids. To reveal the intrinsic association of the 58 fusion features with DKD, Pearson correlation coefficient analysis was performed (Figure 2A). Twelve lipid metabolites showed significant association (R > .5) with 26 differentially expressed proteins (Figure 2B). By plotting the relative abundance of these lipid metabolites, we observed that the vast majority of lipids were significantly enriched in patients with T2DM than those with DKD (Figure 2C) and showed a linear increase with disease progression. A strong positive correlation between trihydroxycoprostanoic acid, Cer (d18:1/16:0), and 3α, 7 α‐dihydroxycoprostanic acid was observed (Figure 2D, R > .85, p < .01). These results suggest that DKD‐related proteins are associated with changes in serum lipid metabolite levels. FIGURE 2 Determination and evaluation of diagnostic efficiency of biomarker panel. (A) Ranking of 58 fusion features in multiomics. (B) Pearson correlation of 58 fusion features. (C) Heatmap of 12 important differential lipid molecular features. (D) Pearson correlation analysis between 3a,7a‐dihydroxycoprostanic acid, cer(d18:1/16:0) and trihydroxycoprostanoic. (E) Mean decrease accuracy scores of eight key features. (F) Determination of biomarker panel by receiver operating characteristic (ROC) curve; (G) ROC curve analysis of independent cohort 2‐based biomarker panel (T2DM vs. DKD and HC vs. DKD); (H) ROC curve analysis of independent cohort 1‐based serum creatinine (SCR) and blood urea nitrogen (BUN). In the test set, four lipid metabolites and four proteins in the 58 fusion features showed similar trends and content changes as that in the discovery set (Tables S5 and 6). Recently, several clinical histological studies have focused on the concept of “biomarker panel”. 2 , 7 , 8 Based on the above results, we selected 3α, 7α‐dihydroxycoprostanic acid and Cer (d18:1/16:0) with an absolute high contribution to draw the receiver operating characteristic curve, with an area under the curve (AUC) of .800 (95% confidence interval [CI]: .698–.902), to establish the diagnostic distinction between T2DM and DKD (Figure 2E). Subsequently, the remaining six substances were added to obtain the best biomarker panel to predict the development of DKD, which was composed of 3α, 7α ‐dihydroxycoprostanic acid, Cer (d18:1/16:0), cyclase‐associated protein 1 (CAP1) and talin‐1 (TLN1) (AUC = .873; 95% CI: .794–.951) (Figure 2F and S2A–B). We applied the obtained biomarker panel to the discovery (AUC = .838, 95% CI: .726–.950) and test sets (AUC = .938, 95% CI: .8670–1.000) that showed a strong diagnostic ability far higher than serum creatinine (SCR) (AUC = .620, 95% CI: .485–.755), and blood urea nitrogen (BUN) (AUC = .638, 95% CI: .506–.770) (Figure 2G and H). We found that the two lipid metabolites, Cer (d18:1/16:0) and 3α, 7α‐dihydroxycoprostanic acid, had prominent and robust positive correlations with hemoglobin A1c and glucose levels (Figures S2C). In addition, the positive correlations between CAP1, TLN1, SCR and BUN were stronger than those between the two lipid metabolites (Figures S2D). Furthermore, all four markers were positively correlated with a history of diabetes to varying degrees, with the two lipid metabolites being particularly significant (Figures S2E). This emphasizes the complementary nature and importance of a biomarker panel. In conclusion, this study combined multiple bioinformatic tools and learning algorithms to synthetically identify the optimal diagnosis of a disease biomarker panel. Our findings provide insights for the integrated modelling of multi‐omics data and new research opportunities for T2DM complications. Furthermore, the combined use of two powerful histological techniques, lipidomics and proteomics, provided a comprehensive understanding of this disease. 9 , 10 The advent of DL will enable the handling of large amounts of high‐dimensional and complex‐structured data, further enabling the identification of key metabolic features. LIMITATIONS This study used training models from small populations to validate large cohorts because of complications such as sample collection and time constraints, which may have resulted in some features being neglected. Therefore, in future studies, attention should be paid to the cohort settings (usually 8:1 to 4:1). CONFLICT OF INTEREST STATEMENT The authors declare no conflict of interest. Supporting information Supporting Information Click here for additional data file. Supporting Information Click here for additional data file. Supporting Information Click here for additional data file.

          Related collections

          Most cited references10

          • Record: found
          • Abstract: found
          • Article: not found

          Diabetic Kidney Disease: Challenges, Progress, and Possibilities.

          Diabetic kidney disease develops in approximately 40% of patients who are diabetic and is the leading cause of CKD worldwide. Although ESRD may be the most recognizable consequence of diabetic kidney disease, the majority of patients actually die from cardiovascular diseases and infections before needing kidney replacement therapy. The natural history of diabetic kidney disease includes glomerular hyperfiltration, progressive albuminuria, declining GFR, and ultimately, ESRD. Metabolic changes associated with diabetes lead to glomerular hypertrophy, glomerulosclerosis, and tubulointerstitial inflammation and fibrosis. Despite current therapies, there is large residual risk of diabetic kidney disease onset and progression. Therefore, widespread innovation is urgently needed to improve health outcomes for patients with diabetic kidney disease. Achieving this goal will require characterization of new biomarkers, designing clinical trials that evaluate clinically pertinent end points, and development of therapeutic agents targeting kidney-specific disease mechanisms (e.g., glomerular hyperfiltration, inflammation, and fibrosis). Additionally, greater attention to dissemination and implementation of best practices is needed in both clinical and community settings.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A guide to deep learning in healthcare

            Here we present deep-learning techniques for healthcare, centering our discussion on deep learning in computer vision, natural language processing, reinforcement learning, and generalized methods. We describe how these computational techniques can impact a few key areas of medicine and explore how to build end-to-end systems. Our discussion of computer vision focuses largely on medical imaging, and we describe the application of natural language processing to domains such as electronic health record data. Similarly, reinforcement learning is discussed in the context of robotic-assisted surgery, and generalized deep-learning methods for genomics are reviewed.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Circulating TNF receptors 1 and 2 predict ESRD in type 2 diabetes.

              Levels of proinflammatory cytokines associate with risk for developing type 2 diabetes but whether chronic inflammation contributes to the development of diabetic complications, such as ESRD, is unknown. In the 1990s, we recruited 410 patients with type 2 diabetes for studies of diabetic nephropathy and recorded their characteristics at enrollment. During 12 years of follow-up, 59 patients developed ESRD (17 per 1000 patient-years) and 84 patients died without ESRD (24 per 1000 patient-years). Plasma markers of systemic inflammation, endothelial dysfunction, and the TNF pathway were measured in the study entry samples. Of the examined markers, only TNF receptors 1 and 2 (TNFR1 and TNFR2) associated with risk for ESRD. These two markers were highly correlated, but ESRD associated more strongly with TNFR1. The cumulative incidence of ESRD for patients in the highest TNFR1 quartile was 54% after 12 years but only 3% for the other quartiles (P<0.001). In Cox proportional hazard analyses, TNFR1 predicted risk for ESRD even after adjustment for clinical covariates such as urinary albumin excretion. Plasma concentration of TNFR1 outperformed all tested clinical variables with regard to predicting ESRD. Concentrations of TNFRs moderately associated with death unrelated to ESRD. In conclusion, elevated concentrations of circulating TNFRs in patients with type 2 diabetes at baseline are very strong predictors of the subsequent progression to ESRD in subjects with and without proteinuria.
                Bookmark

                Author and article information

                Contributors
                yaowufenxi001@sina.com
                Journal
                Clin Transl Med
                Clin Transl Med
                10.1002/(ISSN)2001-1326
                CTM2
                Clinical and Translational Medicine
                John Wiley and Sons Inc. (Hoboken )
                2001-1326
                08 June 2023
                June 2023
                : 13
                : 6 ( doiID: 10.1002/ctm2.v13.6 )
                : e1301
                Affiliations
                [ 1 ] State Key Laboratory of Component‐based Chinese Medicine Tianjin University of Traditional Chinese Medicine Tianjin China
                [ 2 ] Henan Key Laboratory of Rare Diseases Endocrinology and Metabolism Center The First Affiliated Hospital and College of Clinical Medicine of Henan University of Science and Technology Luoyang China
                [ 3 ] School of Chinese Materia Medica Beijing University of Chinese Medicine, Liangxiang Town, Fangshan District Beijing China
                [ 4 ] Second Hospital of Tianjin Medical University Tianjin China
                [ 5 ] Department of Endocrinology and Nephrology PKU Care CNOOC Hospital Tianjin China
                Author notes
                [*] [* ] Correspondence

                Yubo Li, State Key Laboratory of Component‐based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin, China.

                Email: yaowufenxi001@ 123456sina.com

                [#]

                Huan Zhao, Yu Yuan, Siyu Chen and Yaqi Yao contributed equally to this work.

                Author information
                https://orcid.org/0000-0003-0455-0969
                Article
                CTM21301
                10.1002/ctm2.1301
                10248822
                6ae962a1-b9a5-425b-a21e-52768092f87e
                © 2023 The Authors. Clinical and Translational Medicine published by John Wiley & Sons Australia, Ltd on behalf of Shanghai Institute of Clinical Bioinformatics.

                This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

                History
                : 22 May 2023
                : 17 February 2023
                : 28 May 2023
                Page count
                Figures: 2, Tables: 1, Pages: 5, Words: 1886
                Categories
                Letter to the Editor
                Letter to the Editor
                Custom metadata
                2.0
                June 2023
                Converter:WILEY_ML3GV2_TO_JATSPMC version:6.2.9 mode:remove_FC converted:08.06.2023

                Medicine
                Medicine

                Comments

                Comment on this article