16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Sequential stacking link prediction algorithms for temporal networks

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Link prediction algorithms are indispensable tools in many scientific applications by speeding up network data collection and imputing missing connections. However, in many systems, links change over time and it remains unclear how to optimally exploit such temporal information for link predictions in such networks. Here, we show that many temporal topological features, in addition to having high computational cost, are less accurate in temporal link prediction than sequentially stacked static network features. This sequential stacking link prediction method uses 41 static network features that avoid detailed feature engineering choices and is capable of learning a highly accurate predictive distribution of future connections from historical data. We demonstrate that this algorithm works well for both partially observed and completely unobserved target layers, and on two temporal stochastic block models achieves near-oracle-level performance when combined with other single predictor methods as an ensemble learning method. Finally, we empirically illustrate that stacking multiple predictive methods together further improves performance on 19 real-world temporal networks from different domains.

          Abstract

          Link prediction in temporal networks is relevant for many real-world systems, however, current approaches are usually characterized by high computational costs. The authors propose a temporal link prediction framework based on the sequential stacking of static network features, for improved computational speed, appropriate for temporal networks with completely unobserved or partially observed target layers.

          Related collections

          Most cited references49

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The meaning and use of the area under a receiver operating characteristic (ROC) curve.

            A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented. It is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a randomly chosen non-diseased subject. Moreover, this probability of a correct ranking is the same quantity that is estimated by the already well-studied nonparametric Wilcoxon statistic. These two relationships are exploited to (a) provide rapid closed-form expressions for the approximate magnitude of the sampling variability, i.e., standard error that one uses to accompany the area under a smoothed ROC curve, (b) guide in determining the size of the sample required to provide a sufficiently reliable estimate of this area, and (c) determine how large sample sizes should be to ensure that one can statistically detect differences in the accuracy of diagnostic techniques.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              An introduction to ROC analysis

                Bookmark

                Author and article information

                Contributors
                peter.j.mucha@dartmouth.edu
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                14 February 2024
                14 February 2024
                2024
                : 15
                : 1364
                Affiliations
                [1 ]Department of Mathematics, Dartmouth College, ( https://ror.org/049s0rh22) Hanover, NH USA
                [2 ]Yale Institute for Network Science, Yale University, ( https://ror.org/03v76x132) New Haven, CT USA
                [3 ]Department of Scientific Computing, Pukyong National University, ( https://ror.org/0433kqc49) Busan, South Korea
                [4 ]Department of Computer Science, University of Colorado, ( https://ror.org/02ttsq026) Boulder, CO USA
                [5 ]BioFrontiers Institute, University of Colorado, Boulder, ( https://ror.org/02ttsq026) Boulder, CO USA
                [6 ]Santa Fe Institute, ( https://ror.org/01arysc35) Santa Fe, NM USA
                Author information
                http://orcid.org/0000-0002-4136-9408
                http://orcid.org/0000-0002-3515-3504
                http://orcid.org/0000-0003-3860-3213
                http://orcid.org/0000-0002-3529-8746
                http://orcid.org/0000-0002-0648-7230
                Article
                45598
                10.1038/s41467-024-45598-0
                10866871
                38355612
                e5861d2b-43de-4090-8f12-dfab58cbdd67
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 31 January 2023
                : 29 January 2024
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2024

                Uncategorized
                computational science,applied mathematics,software,interdisciplinary studies
                Uncategorized
                computational science, applied mathematics, software, interdisciplinary studies

                Comments

                Comment on this article