12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Principles of QSAR Modeling : Comments and Suggestions From Personal Experience

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          At the end of her academic career, the author summarizes the main aspects of QSAR modeling, giving comments and suggestions according to her 23 years' experience in QSAR research on environmental topics. The focus is mainly on Multiple Linear Regression, particularly Ordinary Least Squares, using a Genetic Algorithm for variable selection from various theoretical molecular descriptors, but the comments can be useful also for other QSAR methods. The need for rigorous validation, also external, and for applicability domain check to guarantee predictivity and reliability of QSAR models is particularly highlighted. The commented approach is the “predictive” one, based on chemometrics, and is usefully applied to the prioritization of environmental pollutants. All the discussed points and the author's ideas are implemented in the software QSARINS, as a legacy to the QSAR community.

          Related collections

          Most cited references113

          • Record: found
          • Abstract: not found
          • Article: not found

          A Concordance Correlation Coefficient to Evaluate Reproducibility

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found
            Is Open Access

            Root mean square error (RMSE) or mean absolute error (MAE)? – Arguments against avoiding RMSE in the literature

            Both the root mean square error (RMSE) and the mean absolute error (MAE) are regularly employed in model evaluation studies. Willmott and Matsuura (2005) have suggested that the RMSE is not a good indicator of average model performance and might be a misleading indicator of average error, and thus the MAE would be a better metric for that purpose. While some concerns over using RMSE raised by Willmott and Matsuura (2005) and Willmott et al. (2009) are valid, the proposed avoidance of RMSE in favor of MAE is not the solution. Citing the aforementioned papers, many researchers chose MAE over RMSE to present their model evaluation statistics when presenting or adding the RMSE measures could be more beneficial. In this technical note, we demonstrate that the RMSE is not ambiguous in its meaning, contrary to what was claimed by Willmott et al. (2009). The RMSE is more appropriate to represent model performance than the MAE when the error distribution is expected to be Gaussian. In addition, we show that the RMSE satisfies the triangle inequality requirement for a distance metric, whereas Willmott et al. (2009) indicated that the sums-of-squares-based statistics do not satisfy this rule. In the end, we discussed some circumstances where using the RMSE will be more beneficial. However, we do not contend that the RMSE is superior over the MAE. Instead, a combination of metrics, including but certainly not limited to RMSEs and MAEs, are often required to assess model performance.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              The problem of overfitting.

                Bookmark

                Author and article information

                Contributors
                (View ORCID Profile)
                Journal
                International Journal of Quantitative Structure-Property Relationships
                IGI Global
                2379-7487
                2379-7479
                July 2020
                July 2020
                : 5
                : 3
                : 61-97
                Affiliations
                [1 ]University of Insubria, Varese, Italy
                Article
                10.4018/IJQSPR.20200701.oa1
                4748cf51-d811-43af-87f6-d4bbcc8540bf
                © 2020
                History

                Quantitative & Systems biology,Biophysics
                Quantitative & Systems biology, Biophysics

                Comments

                Comment on this article