Show simple item record

dc.contributor.authorDalla Valle, Luciana
dc.contributor.authorKenett, R
dc.date.accessioned2017-12-26T14:59:09Z
dc.date.available2017-12-26T14:59:09Z
dc.date.issued2018-11-30
dc.identifier.issn0957-4174
dc.identifier.issn1873-6793
dc.identifier.urihttp://hdl.handle.net/10026.1/10453
dc.description.abstract

In recent years, the growing availability of huge amounts of information, generated in every sector at high speed and in a wide variety of forms and formats, is unprecedented. The ability to harness big data is an opportunity to obtain more accurate analyses and to improve decision-making in industry, government and many other organizations. However, handling big data may be challenging and proper data integration is a key dimension in achieving high information quality. In this paper, we propose a novel approach to data integration that calibrates online generated big data with interview based customer survey data. A common issue of customer surveys is that responses are often overly positive, making it difficult to identify areas of weaknesses in organizations. On the other hand, online reviews are often overly negative, hampering an accurate evaluation of areas of excellence. The proposed methodology calibrates the levels of unbalanced responses in different data sources via resampling and performs data integration using Bayesian Networks to propagate the new re-balanced information. In this paper we show, with a case study example, how the novel data integration approach allows businesses and organizations to get a bias corrected appraisal of the level of satisfaction of their customers. The application is based on the integration of online data of review blogs and customer satisfaction surveys from the San Francisco airport. We illustrate how this integration enhances the information quality of the data analytic work in four of InfoQ dimensions, namely, Data Structure, Data Integration, Temporal Relevance and Chronology of Data and Goal.

dc.format.extent76-90
dc.languageen
dc.language.isoen
dc.publisherElsevier
dc.subjectBayesian networks
dc.subjectCalibration
dc.subjectData integration
dc.subjectSocial media
dc.subjectInformation quality (InfoQ)
dc.subjectResampling techniques
dc.titleSocial Media Big Data Integration: a New Approach Based on Calibration
dc.typejournal-article
dc.typeJournal Article
plymouth.author-urlhttps://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000441491500007&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=11bb513d99f797142bcfeffcc58ea008
plymouth.volume111
plymouth.publication-statusPublished
plymouth.journalExpert Systems with Applications
dc.identifier.doi10.1016/j.eswa.2017.12.044
plymouth.organisational-group/Plymouth
plymouth.organisational-group/Plymouth/Faculty of Science and Engineering
plymouth.organisational-group/Plymouth/Faculty of Science and Engineering/School of Engineering, Computing and Mathematics
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA/EXTENDED UoA 10 - Mathematical Sciences
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA/UoA10 Mathematical Sciences
plymouth.organisational-group/Plymouth/Users by role
plymouth.organisational-group/Plymouth/Users by role/Academics
dcterms.dateAccepted2017-12-24
dc.identifier.eissn1873-6793
dc.rights.embargoperiodNot known
rioxxterms.versionofrecord10.1016/j.eswa.2017.12.044
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserved
rioxxterms.licenseref.startdate2018-11-30
rioxxterms.typeJournal Article/Review


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record


All items in PEARL are protected by copyright law.
Author manuscripts deposited to comply with open access mandates are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author.
Theme by 
Atmire NV