24
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Utilizing graph machine learning within drug discovery and development

      review-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Graph machine learning (GML) is receiving growing interest within the pharmaceutical and biotechnology industries for its ability to model biomolecular structures, the functional relationships between them, and integrate multi-omic datasets — amongst other data types. Herein, we present a multidisciplinary academic-industrial review of the topic within the context of drug discovery and development. After introducing key terms and modelling approaches, we move chronologically through the drug development pipeline to identify and summarize work incorporating: target identification, design of small molecules and biologics, and drug repurposing. Whilst the field is still emerging, key milestones including repurposed drugs entering in vivo studies, suggest GML will become a modelling framework of choice within biomedical machine learning.

          Related collections

          Most cited references278

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

            Abstract Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein–protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein–protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The Protein Data Bank.

              The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.
                Bookmark

                Author and article information

                Contributors
                Journal
                Brief Bioinform
                Brief Bioinform
                bib
                Briefings in Bioinformatics
                Oxford University Press
                1467-5463
                1477-4054
                November 2021
                19 May 2021
                19 May 2021
                : 22
                : 6
                : bbab159
                Affiliations
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                The Computer Laboratory, University of Cambridge , UK
                Relation Therapeutics , London, UK
                The Computer Laboratory, University of Cambridge , UK
                Department of Biochemistry, University of Cambridge , UK
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                Relation Therapeutics , London, UK
                Juvenescence , London, UK
                Mila, the Quebec AI Institute , Canada
                HEC Montreal , Canada
                Relation Therapeutics , London, UK
                Juvenescence , London, UK
                The Francis Crick Institute , London, UK
                Department of Biochemistry, University of Cambridge , UK
                Relation Therapeutics , London, UK
                Department of Computing, Imperial College London , UK
                Twitter , UK
                Relation Therapeutics , London, UK
                Juvenescence , London, UK
                Author notes
                Corresponding author: Jake P. Taylor-King, Relation Therapeutics, London, UK. Tel.: +44 7387 277904; E-mail: jake@ 123456relationrx.com
                Article
                bbab159
                10.1093/bib/bbab159
                8574649
                34013350
                acb36dde-ace6-4d1b-95d6-eb5d773e3028
                © The Author(s) 2021. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

                History
                : 9 December 2020
                : 1 April 2021
                : 5 April 2021
                Page count
                Pages: 22
                Categories
                Review
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                graph machine learning,drug discovery,drug development

                Comments

                Comment on this article