SARC

Step I

Training models used

Binary - SARC: a model trained by examining the Self-Annotated Reddit Corpus to determine the sarcastic nature of text (read more here).
Binary - Netflix Review Model: a model trained by examining film and TV show reviews on Netflix.
Binary - Twitter: a model trained by examining tweets on Twitter.
Categorical - Stanford Model: a model trained by examining data provided by Stanford University.

The SARC Model is a newly trained binary model that is used to classify text into sarcastic or not sarastic. More details on the dataset and how it was built can be found here and here. The dataset is rather complex and is comprised of multiple parts. For the purposes of a simpler description, only the most vital parts of the dataset will be discussed. First and foremost, the balanced training and testing datasets in the CSV format were parsed in a similar manner to extract the IDs of all the reponses to the Reddit comments and their respective labels (0 for not sarcastic and 1 for sarcastic). Having those IDs saved, the main JSON file containing all the metadata was parsed and searched for the response IDs saved previously. If the ID was found, the text of the reponse was extracted and labeled according to the previously saved response label. Since the dataset was previously divided into training and testing, both of them were simply shuffled and used for respective purposes. The labels were removed from the testing dataset and saved separately for the purposes of further testing. The dataset was thereafter split into seperate files (a format required by our parser) and used for testing the newly trained SARC model, as well as the three sentiment models as listed above.

The Twitter Model is a newly trained binary sentiment model. The dataset acquired from this website. The initial dataset contained 1,578,627 classified tweets labeled with 0 (negative) or 1 (positive). It was thereafter parsed to build a training dataset of the required format and shuffled to avoid potential bias. We then split the resulting dataset into 80:20 for training and testing purposes respectively.

To find out more about the models we trained please visit our Models and Datasets pages.

Dataset evaluated

SARC - unlabeled (64,666): the balanced unlabeled dataset (~20%) created specifically for testing purposes.

Step II.A

SARC ANALYSIS - the SARC Ground Truth

Correlation between the binary sentiment and the sarcastic nature (according to the Netflix model and SARC ground truth)

SARCASTIC

Postive: 9,074
Negative: 23,259

NOT SARCASTIC

Postive: 9,405
Negative: 22,928

Correlation between the binary sentiment aand the sarcastic nature (according to the Twitter model and SARC ground truth)

SARCASTIC

Postive: 18,109
Negative: 14,224

NOT SARCASTIC

Postive: 17,869
Negative: 14,464

Correlation between the categorical sentiment and the sarcastic nature (according to the Stanford model and SARC ground truth)

SARCASTIC

Angry: 2,181
Sad: 6,815
Neutral: 15,148
Like: 6,647
Love: 1,542

NOT SARCASTIC

Angry: 2,469
Sad: 6,567
Neutral: 15,986
Like: 7,311
Love: 0

Step II.B

SARC ANALYSIS - the SARC Model

Correlation between the binary sentiment and the sarcastic nature (according to SARC + Netflix models)

SARCASTIC

Postive: 9,015
Negative: 22,053

NOT SARCASTIC

Postive: 9,464
Negative: 24,134

Correlation between the binary sentiment aand the sarcastic nature (according to SARC + Twitter models)

SARCASTIC

Postive: 17,490
Negative: 13,578

NOT SARCASTIC

Postive: 18,487
Negative: 15,111

Correlation between the categorical sentiment and the sarcastic nature (according to SARC + Stanford models)

SARCASTIC

Angry: 2,103
Sad: 6,458
Neutral: 14,342
Like: 6,521
Love: 1,644

NOT SARCASTIC

Angry: 2,347
Sad: 6,838
Neutral: 17,276
Like: 7,137
Love: 0

Step III

HT LEAD GENERATION - ANALYSING THE PERFORMANCE OF THE MODELS

SARC ROC Curve

Netflix ROC Curve

Twitter ROC Curve