7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Deep Learning in Mining Biological Data

      review-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enormous amount of data for pattern recognition is a big challenge and requires sophisticated data-intensive machine learning techniques. Artificial neural network-based learning systems are well known for their pattern recognition capabilities, and lately their deep architectures—known as deep learning (DL)—have been successfully applied to solve many complex pattern recognition problems. To investigate how DL—especially its different architectures—has contributed and been utilized in the mining of biological data pertaining to those three types, a meta-analysis has been performed and the resulting resources have been critically analysed. Focusing on the use of DL to analyse patterns in data from diverse biological domains, this work investigates different DL architectures’ applications to these data. This is followed by an exploration of available open access data sources pertaining to the three data types along with popular open-source DL tools applicable to these data. Also, comparative investigations of these tools from qualitative, quantitative, and benchmarking perspectives are provided. Finally, some open research challenges in using DL to mine biological data are outlined and a number of possible future perspectives are put forward.

          Related collections

          Most cited references90

          • Record: found
          • Abstract: found
          • Article: not found

          Long Short-Term Memory

          Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Occurrence of the potent mutagens 2- nitrobenzanthrone and 3-nitrobenzanthrone in fine airborne particles

            Polycyclic aromatic compounds (PACs) are known due to their mutagenic activity. Among them, 2-nitrobenzanthrone (2-NBA) and 3-nitrobenzanthrone (3-NBA) are considered as two of the most potent mutagens found in atmospheric particles. In the present study 2-NBA, 3-NBA and selected PAHs and Nitro-PAHs were determined in fine particle samples (PM 2.5) collected in a bus station and an outdoor site. The fuel used by buses was a diesel-biodiesel (96:4) blend and light-duty vehicles run with any ethanol-to-gasoline proportion. The concentrations of 2-NBA and 3-NBA were, on average, under 14.8 µg g−1 and 4.39 µg g−1, respectively. In order to access the main sources and formation routes of these compounds, we performed ternary correlations and multivariate statistical analyses. The main sources for the studied compounds in the bus station were diesel/biodiesel exhaust followed by floor resuspension. In the coastal site, vehicular emission, photochemical formation and wood combustion were the main sources for 2-NBA and 3-NBA as well as the other PACs. Incremental lifetime cancer risk (ILCR) were calculated for both places, which presented low values, showing low cancer risk incidence although the ILCR values for the bus station were around 2.5 times higher than the ILCR from the coastal site.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Deep learning in neural networks: An overview

              In recent years, deep artificial neural networks (including recurrent ones) have won numerous contests in pattern recognition and machine learning. This historical survey compactly summarizes relevant work, much of it from the previous millennium. Shallow and Deep Learners are distinguished by the depth of their credit assignment paths, which are chains of possibly learnable, causal links between actions and effects. I review deep supervised learning (also recapitulating the history of backpropagation), unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.
                Bookmark

                Author and article information

                Contributors
                mufti.mahmud@ntu.ac.uk , muftimahmud@gmail.com
                mskaiser@juniv.edu
                Journal
                Cognit Comput
                Cognit Comput
                Cognitive Computation
                Springer US (New York )
                1866-9956
                1866-9964
                5 January 2021
                5 January 2021
                : 1-33
                Affiliations
                [1 ]GRID grid.12361.37, ISNI 0000 0001 0727 0669, Department of Computer Science, , Nottingham Trent University, ; Clifton, NG11 8NS Nottingham, UK
                [2 ]GRID grid.411808.4, ISNI 0000 0001 0664 5967, Institute of Information Technology, , Jahangirnagar University, ; Savar 1342 Dhaka, Bangladesh
                [3 ]GRID grid.12641.30, ISNI 0000000105519715, Intelligent Systems Research Centre, , Ulster University, ; Northern Ireland BT48 7JL Derry, UK
                [4 ]School of Computing , Edinburgh, EH11 4BN Edinburgh, UK
                [5 ]GRID grid.12361.37, ISNI 0000 0001 0727 0669, Medical Technology Innovation Facility, , Nottingham Trent University, ; NG11 8NS Clifton, Nottingham, UK
                Author information
                http://orcid.org/0000-0002-2037-8348
                Article
                9773
                10.1007/s12559-020-09773-x
                7783296
                33425045
                6bed0589-4590-41ca-a2c3-c57b3894f56c
                © The Author(s) 2020

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 1 May 2020
                : 28 September 2020
                Funding
                Funded by: Nottingham Trent University
                Categories
                Article

                Neurosciences
                brain–machine interfaces,bioimaging,deep learning performance comparison,medical imaging,omics,open access data sources,open-source tools

                Comments

                Comment on this article