Document Type

Engineering, Computing and Mathematics Article

Abstract

Social network bots are becoming an ever-greater threat to online users. Most studies carried out have looked at bots which generate a lot of tweets known as spam, as these are very common. In recent years research into the area of bots within Twitter has been carried out using machine learning to attempt to find patterns in these ac-counts to aide with detection. However, limited research has been carried out that focuses on a sub set of Twitter bots which are involved in phishing campaigns which tweet very little to avoid detection. In this project an application was developed that combines a variety of commercial tools with machine learning theory to allow a user to collect and analyse public Twitter data using a neural network. The focus of the project is to try and find patterns in these phishing bots’ properties and to use the data collected to train a neural network to recognise these patterns and detect bots. A Twit-ter crawler was developed that harvests data from the Twitter API and stores it in a graph database. The data is then formatted and normalised by a pre-processor mod-ule which is then fed into a neural network. The neural network evaluates the data and creates predictions based on what it has previously learnt, these predictions are then displayed in a graph format within the browser. Experimental results have shown that there is a pattern in the properties of an account, and tests showed a correlation in the friend to follower ratio of bot accounts. With this pattern and other properties of an account, a neural network has been trained to detect bot accounts, with tests showing the neural being able to make predictions for an account with an accuracy of 92%. Whilst these results are still experimental the project has proven that is it possible to detect bots within Twitter using just the properties of an account.

Publication Date

2017-12-01

First Page

208

Last Page

223

Deposit Date

May 2019

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Recommended Citation

Brake, Christopher James (2017) "A machine learning approach to the classification of phishing bot accounts within Twitter," The Plymouth Student Scientist: Vol. 10: Iss. 2, Article 4.
DOI: https://doi.org/10.24382/gbec-nv09
Available at: https://pearl.plymouth.ac.uk/tpss/vol10/iss2/4

license.txt (5 kB)

Download

COinS

The Plymouth Student Scientist

A machine learning approach to the classification of phishing bot accounts within Twitter

Document Type

Abstract

Publication Date

First Page

Last Page

Deposit Date

Creative Commons License

Recommended Citation

About

Search

The Plymouth Student Scientist

A machine learning approach to the classification of phishing bot accounts within Twitter

Authors

Document Type

Abstract

Publication Date

First Page

Last Page

Deposit Date

Creative Commons License

Recommended Citation

Share

About

Search