Show simple item record

dc.contributor.authorHaynes, C
dc.contributor.authorPALOMINO, MARCO
dc.contributor.authorStuart, EJ
dc.contributor.authorViira, D
dc.contributor.authorHannon, F
dc.contributor.authorCrossingham, G
dc.contributor.authorTantam, K
dc.date.accessioned2022-03-25T16:36:44Z
dc.date.available2022-03-25T16:36:44Z
dc.date.issued2022-03-18
dc.identifier.issn2227-7390
dc.identifier.issn2227-7390
dc.identifier.other983
dc.identifier.urihttp://hdl.handle.net/10026.1/18975
dc.description.abstract

<jats:p>Text datasets come in an abundance of shapes, sizes and styles. However, determining what factors limit classification accuracy remains a difficult task which is still the subject of intensive research. Using a challenging UK National Health Service (NHS) dataset, which contains many characteristics known to increase the complexity of classification, we propose an innovative classification pipeline. This pipeline switches between different text pre-processing, scoring and classification techniques during execution. Using this flexible pipeline, a high level of accuracy has been achieved in the classification of a range of datasets, attaining a micro-averaged F1 score of 93.30% on the Reuters-21578 “ApteMod” corpus. An evaluation of this flexible pipeline was carried out using a variety of complex datasets compared against an unsupervised clustering approach. The paper describes how classification accuracy is impacted by an unbalanced category distribution, the rare use of generic terms and the subjective nature of manual human classification.</jats:p>

dc.format.extent983-983
dc.languageen
dc.language.isoen
dc.publisherMDPI AG
dc.subjectNLP
dc.subjectclassification
dc.subjectclustering
dc.subjecttext pre-processing
dc.subjectmachine learning
dc.subjectNational Health Service (NHS)
dc.titleAutomatic Classification of National Health Service Feedback
dc.typejournal-article
dc.typeJournal Article
plymouth.author-urlhttps://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000774106700001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=11bb513d99f797142bcfeffcc58ea008
plymouth.issue6
plymouth.volume10
plymouth.publication-statusPublished online
plymouth.journalMathematics
dc.identifier.doi10.3390/math10060983
plymouth.organisational-group/Plymouth
plymouth.organisational-group/Plymouth/Faculty of Science and Engineering
plymouth.organisational-group/Plymouth/Faculty of Science and Engineering/School of Engineering, Computing and Mathematics
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA/UoA11 Computer Science and Informatics
plymouth.organisational-group/Plymouth/Users by role
plymouth.organisational-group/Plymouth/Users by role/Academics
dcterms.dateAccepted2022-03-16
dc.rights.embargodate2022-3-29
dc.identifier.eissn2227-7390
dc.rights.embargoperiodNot known
rioxxterms.versionofrecord10.3390/math10060983
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserved
rioxxterms.licenseref.startdate2022-03-18
rioxxterms.typeJournal Article/Review
plymouth.funderAGE'IN (Age Independently)::Interreg 2 Seas Mers Zeeën


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record


All items in PEARL are protected by copyright law.
Author manuscripts deposited to comply with open access mandates are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author.
Theme by 
Atmire NV