Show simple item record

dc.contributor.authorZhou, Shang-Ming
dc.contributor.authorFernandez-Gutierrez, F
dc.contributor.authorKennedy, J
dc.contributor.authorCooksey, R
dc.contributor.authorAtkinson, M
dc.contributor.authorDenaxas, S
dc.contributor.authorSiebert, S
dc.contributor.authorDixon, WG
dc.contributor.authorO’Neill, TW
dc.contributor.authorChoy, E
dc.contributor.authorSudlow, C
dc.contributor.authorBrophy, S
dc.date.accessioned2021-02-04T22:23:05Z
dc.date.available2021-02-04T22:23:05Z
dc.date.issued2016-05-02
dc.identifier.issn1932-6203
dc.identifier.issn1932-6203
dc.identifier.otherARTN e0154515
dc.identifier.urihttp://hdl.handle.net/10026.1/16855
dc.description.abstract

OBJECTIVES: 1) To use data-driven method to examine clinical codes (risk factors) of a medical condition in primary care electronic health records (EHRs) that can accurately predict a diagnosis of the condition in secondary care EHRs. 2) To develop and validate a disease phenotyping algorithm for rheumatoid arthritis using primary care EHRs. METHODS: This study linked routine primary and secondary care EHRs in Wales, UK. A machine learning based scheme was used to identify patients with rheumatoid arthritis from primary care EHRs via the following steps: i) selection of variables by comparing relative frequencies of Read codes in the primary care dataset associated with disease case compared to non-disease control (disease/non-disease based on the secondary care diagnosis); ii) reduction of predictors/associated variables using a Random Forest method, iii) induction of decision rules from decision tree model. The proposed method was then extensively validated on an independent dataset, and compared for performance with two existing deterministic algorithms for RA which had been developed using expert clinical knowledge. RESULTS: Primary care EHRs were available for 2,238,360 patients over the age of 16 and of these 20,667 were also linked in the secondary care rheumatology clinical system. In the linked dataset, 900 predictors (out of a total of 43,100 variables) in the primary care record were discovered more frequently in those with versus those without RA. These variables were reduced to 37 groups of related clinical codes, which were used to develop a decision tree model. The final algorithm identified 8 predictors related to diagnostic codes for RA, medication codes, such as those for disease modifying anti-rheumatic drugs, and absence of alternative diagnoses such as psoriatic arthritis. The proposed data-driven method performed as well as the expert clinical knowledge based methods. CONCLUSION: Data-driven scheme, such as ensemble machine learning methods, has the potential of identifying the most informative predictors in a cost-effective and rapid way to accurately and reliably classify rheumatoid arthritis or other complex medical conditions in primary care EHRs.

dc.format.extente0154515-e0154515
dc.format.mediumElectronic-eCollection
dc.languageen
dc.language.isoeng
dc.publisherPublic Library of Science (PLoS)
dc.rightsAttribution-NonCommercial 4.0 International
dc.rightsAttribution-NonCommercial 4.0 International
dc.rightsAttribution-NonCommercial 4.0 International
dc.rightsAttribution-NonCommercial 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subjectAlgorithms
dc.subjectAntirheumatic Agents
dc.subjectArthritis, Rheumatoid
dc.subjectElectronic Health Records
dc.subjectHumans
dc.subjectMachine Learning
dc.subjectPrimary Health Care
dc.titleDefining Disease Phenotypes in Primary Care Electronic Health Records by a Machine Learning Approach: A Case Study in Identifying Rheumatoid Arthritis
dc.typejournal-article
dc.typeJournal Article
dc.typeResearch Support, Non-U.S. Gov't
plymouth.author-urlhttps://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000375674800025&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=11bb513d99f797142bcfeffcc58ea008
plymouth.issue5
plymouth.volume11
plymouth.publication-statusPublished online
plymouth.journalPLOS ONE
dc.identifier.doi10.1371/journal.pone.0154515
plymouth.organisational-group/Plymouth
plymouth.organisational-group/Plymouth/Faculty of Health
plymouth.organisational-group/Plymouth/Faculty of Health/School of Nursing and Midwifery
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA/UoA03 Allied Health Professions, Dentistry, Nursing and Pharmacy
plymouth.organisational-group/Plymouth/Users by role
plymouth.organisational-group/Plymouth/Users by role/Academics
dc.publisher.placeUnited States
dcterms.dateAccepted2016-04-14
dc.identifier.eissn1932-6203
dc.rights.embargoperiodNot known
rioxxterms.funderMedical Research Council
rioxxterms.identifier.projectDECIPHer: Centre for the Development and Evaluation of Complex Interventions for Public Health Improvement
rioxxterms.versionofrecord10.1371/journal.pone.0154515
rioxxterms.licenseref.urihttp://creativecommons.org/licenses/by-nc/4.0/
rioxxterms.licenseref.startdate2016
rioxxterms.typeJournal Article/Review
plymouth.funderDECIPHer: Centre for the Development and Evaluation of Complex Interventions for Public Health Improvement::Medical Research Council


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial 4.0 International
Except where otherwise noted, this item's license is described as Attribution-NonCommercial 4.0 International

All items in PEARL are protected by copyright law.
Author manuscripts deposited to comply with open access mandates are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author.
Theme by 
Atmire NV