The dataset comprises annotations of 225 sentences extracted from 170 Faroese news articles. The analysis was conducted at both the sentence and document levels, incorporating multi-class sentiment labels. The dataset features comparisons between GPT-4’s performance and that of human annotators.
Columns
News article: The full text of the news article.Selected Sentence: The sentence selected for sentiment analysis.Sentence label - GPT-4: GPT-4’s sentiment classification of the selected sentence.Sentence label - Annotator 1: The first human annotator’s sentiment classification of the selected sentence.Sentence label - Annotator 2: The second human annotator’s sentiment classification of the selected sentence.News label - GPT-4: GPT-4’s sentiment classification of the entire news article.News label - Annotator 1: The first human annotator’s sentiment classification of the entire news article.News label - Annotator 2: The second human annotator’s sentiment classification of the entire news article.Topic - GPT4: GPT-4’s classification of the article’s topic.Topic relevance - Annotator 1: The first human annotator’s assessment of the topic’s relevance.Correct topic if not relevant - Annotator 1: The corrected topic by the first annotator if the original classification was deemed not relevant.Topic (National (N) / International (I) / Mixed (M)) - Annotator 1: The topic classification as National, International, or Mixed by the first human annotator.
Release: 05.03.2024
Contact: vesteinn.snaebjarnarson@gmail.com




