5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      ICDXML: enhancing ICD coding with probabilistic label trees and dynamic semantic representations

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Accurately assigning standardized diagnosis and procedure codes from clinical text is crucial for healthcare applications. However, this remains challenging due to the complexity of medical language. This paper proposes a novel model that incorporates extreme multi-label classification tasks to enhance International Classification of Diseases (ICD) coding. The model utilizes deformable convolutional neural networks to fuse representations from hidden layer outputs of pre-trained language models and external medical knowledge embeddings fused using a multimodal approach to provide rich semantic encodings for each code. A probabilistic label tree is constructed based on the hierarchical structure existing in ICD labels to incorporate ontological relationships between ICD codes and enable structured output prediction. Experiments on medical code prediction on the MIMIC-III database demonstrate competitive performance, highlighting the benefits of this technique for robust clinical code assignment.

          Related collections

          Most cited references13

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          MIMIC-III, a freely accessible critical care database

          MIMIC-III (‘Medical Information Mart for Intensive Care’) is a large, single-center database comprising information relating to patients admitted to critical care units at a large tertiary care hospital. Data includes vital signs, medications, laboratory measurements, observations and notes charted by care providers, fluid balance, procedure codes, diagnostic codes, imaging reports, hospital length of stay, survival data, and more. The database supports applications including academic and industrial research, quality improvement initiatives, and higher education coursework.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Medical Subject Headings (MeSH).

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              BioWordVec, improving biomedical word embeddings with subword information and MeSH

              Distributed word representations have become an essential foundation for biomedical natural language processing (BioNLP), text mining and information retrieval. Word embeddings are traditionally computed at the word level from a large corpus of unlabeled text, ignoring the information present in the internal structure of words or any information available in domain specific structured resources such as ontologies. However, such information holds potentials for greatly improving the quality of the word representation, as suggested in some recent studies in the general domain. Here we present BioWordVec: an open set of biomedical word vectors/embeddings that combines subword information from unlabeled biomedical text with a widely-used biomedical controlled vocabulary called Medical Subject Headings (MeSH). We assess both the validity and utility of our generated word embeddings over multiple NLP tasks in the biomedical domain. Our benchmarking results demonstrate that our word embeddings can result in significantly improved performance over the previous state of the art in those challenging tasks.
                Bookmark

                Author and article information

                Contributors
                s.de@surrey.ac.uk
                Journal
                Sci Rep
                Sci Rep
                Scientific Reports
                Nature Publishing Group UK (London )
                2045-2322
                7 August 2024
                7 August 2024
                2024
                : 14
                : 18319
                Affiliations
                [1 ]Department of Computing, Xi’an Jiaotong Liverpool University, ( https://ror.org/03zmrmn05) Suzhou, 21500 China
                [2 ]School of Computer Science and Electronic Engineering, University of Surrey, ( https://ror.org/00ks66431) Surrey, GU2 7XH UK
                [3 ]Department of Computer Science, University of Liverpool, ( https://ror.org/04xs57h96) Liverpool, L69 3BX UK
                [4 ]UCL Social Research Institute, University College London, ( https://ror.org/02jx3x895) London, WC1E 6BT UK
                Author information
                http://orcid.org/0000-0001-7439-6077
                Article
                69214
                10.1038/s41598-024-69214-9
                11306547
                39112791
                15b3e4b8-e433-4fa6-82ef-42f8c3780ca4
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 1 December 2023
                : 1 August 2024
                Funding
                Funded by: FundRef 501100000269, RCUK | Economic and Social Research Council (ESRC);
                Award ID: ES/Z502935/1
                Funded by: FundRef 501100000266, RCUK | Engineering and Physical Sciences Research Council (EPSRC);
                Award ID: EP/W032473/1
                Award Recipient :
                Funded by: FundRef 501100005145, Basic Research Program of Jiangsu Province;
                Award ID: BK20221260
                Award Recipient :
                Funded by: Xi’an Jiaotong-Liverpool University Postgraduate Research Scholarship (contract number PGRS2006013).
                Funded by: FundRef 501100008868, Jiangsu Science and Technology Department;
                Award ID: BK20221260
                Award ID: EP/W032473/1
                Award Recipient :
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2024

                Uncategorized
                natural language processing,icd coding,extreme multi-label classification,few-shot learning,medical knowledge representation,computer science,computational science

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content20

                Most referenced authors180