Skip to content

UEC-InabaLab/ETCDataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Emotion Transcription in Conversation Dataset

License: CC BY-NC 4.0

The Emotion Transcription in Conversation (ETC) Dataset is a Japanese dialogue dataset of approximately 1,000 conversations. Each utterance is paired with an emotion transcription, a natural language description of the speaker's internal emotional state at the time of the utterance. The dataset also includes emotion labels corresponding to the emotion transcriptions, as well as speakers' personality traits (TIPI-J).

This dataset was constructed as a benchmark for the task of Emotion Transcription in Conversation (ETC): describing the emotional states behind speakers' utterances in natural language.

Note

A Japanese version of this README is available here.

Note

The published data has been quality-checked, and dialogues considered ethically problematic have been excluded. Please note that the analysis reported in the paper is based on the dataset prior to the exclusion of such dialogues and may differ from the statistics of the published version. Additionally, speaker names have been replaced with anonymous IDs assigned by the dataset creators.

Caution

The dialogue content in this dataset was collected via crowdsourcing and does not represent the beliefs or opinions of the dataset creators or their affiliated institutions.

๐ŸŒŸ Statistics

ETC Dataset
# Dialogues 997
# Speakers 198
# Utterances / emotion transcriptions 9,970
Utterances per dialogue 10
Avg. utterance length (characters) 42.72
โ”” Speaker 44.65
โ”” Listener 40.79
Avg. emotion transcription length (characters) 28.88
โ”” Speaker 28.91
โ”” Listener 28.85
# Emotion categories 7 (Ekman's 6 basic emotions + Neutral)
Language Japanese

๐Ÿ“ Data Structure

The etc/ directory contains the dialogue data (dialogues/*.json) and speaker personality trait data based on TIPI-J1 (personality_traits.json).

etc/
โ”œโ”€โ”€ dialogues/              // Dialogue data (one file per dialogue)
โ”‚   โ”œโ”€โ”€ 0001.json
โ”‚   โ”œโ”€โ”€ 0002.json
โ”‚   โ”œโ”€โ”€ ...
โ”‚   โ””โ”€โ”€ 0997.json
โ”œโ”€โ”€ personality_traits.json // Speaker personality traits data
โ””โ”€โ”€ split.json              // Train/Valid/Test split information

๐Ÿ’ฌ Dialogue Data

The dialogue data includes participant IDs, utterances, emotion transcriptions, and emotion labels. Each dialogue begins with the Speaker's utterance, and the Speaker and Listener take turns alternately (10 utterances per dialogue in total).

For dialogue collection, we adopted the dialogue setup from EmpatheticDialogues2. For each dialogue, a specific emotion label (e.g., "impressed," "disappointed," "confident"โ€”32 types in total) was assigned. The Speaker talks about an experience related to that emotion, while the Listener responds to the Speaker's utterances.

Emotion labels consist of 7 categories: Ekman's 6 basic emotions3 (joy, sadness, fear, anger, surprise, and disgust) plus "Neutral." Each emotion transcription was annotated by 3 annotators in a multi-label format.

Key Type Description
dialogue_id int Dialogue ID
dialogue_emotion str Emotion label assigned to the participant pair for the dialogue
participants dict Dictionary of speaker IDs
participants.speaker str Speaker ID
participants.listener str Listener ID
dialogue list (dict) List of utterance information
dialogue.turn int Turn number (1-indexed)
dialogue.role str Role: speaker or listener
dialogue.utterance str Utterance text
dialogue.emotion_transcription str The participant's emotion transcription for the utterance
dialogue.emotions list (list (str)) List of emotion labels for the emotion transcription (multi-label format by 3 annotators)

Example: etc/dialogues/0945.json

{
    "dialogue_id": 945,
    "dialogue_emotion": "ไฟก้ ผใ™ใ‚‹",
    "participants": {
        "speaker": "FQ",
        "listener": "BN"
    },
    "dialogue": [
        {
            "turn": 1,
            "role": "speaker",
            "utterance": "ไฟก้ ผใŒใชใ„ใจใ€ไบบ้–“้–ขไฟ‚ใฃใฆๆง‹็ฏ‰ใงใใชใ„ใ‚‚ใฎใ‹ใชใจๆ€ใ„ใพใ™ใŒใ€ใใ†ใฏ่จ€ใฃใฆใ‚‚่ฃๅˆ‡ใ‚‰ใ‚Œใ‚‹ใ“ใจใ‚‚ใ‚ใ‚‹ใ—ใ€้›ฃใ—ใ„ใงใ™ใ‚ˆใญใ€‚",
            "emotion_transcription": "ใ„ใใชใ‚Šๆทฑใ„่ณชๅ•ใ‚’ใ—ใ€็›ธๆ‰‹ใฏๅ›ฐใ‚‹ใ‹ใชใจๆ€ใ„ใคใคใ‚‚ใ€ไบบๆŸ„ใ‚’็Ÿฅใ‚‹ใŸใ‚ใซ่žใ„ใฆใฟใŸใใชใ‚Šใพใ—ใŸใ€‚",
            "emotions": [
                ["ๆๆ€–"],
                ["ๆๆ€–"],
                ["่ฉฒๅฝ“ใชใ—"]
            ]
        },
        {
            "turn": 1,
            "role": "listener",
            "utterance": "ไบบใจใฎ้–ขไฟ‚ใฃใฆๆœฌๅฝ“ใซ้›ฃใ—ใ„ใงใ™ใ‚ˆใญใ€‚่‰ฏใ‹ใ‚Œใจๆ€ใฃใฆใ—ใŸไบ‹ใŒ็›ธๆ‰‹ใ‹ใ‚‰ใ™ใ‚Œใฐ่ฟทๆƒ‘ใ ใฃใ‚Šใ€ไปฒใŒ่‰ฏใ„ใจๆ€ใฃใฆใ„ใŸใฎใซ่ฃใงๆ‚ชๅฃใ‚’่จ€ใ‚ใ‚Œใฆใ„ใŸใ‚Šๆญฃ่งฃใŒใชใใฆๆ‰‹ๆŽขใ‚Šใงๆง‹็ฏ‰ใ—ใฆใ„ใใ—ใ‹ใ‚ใ‚Šใพใ›ใ‚“ใ‚ˆใญใ€‚",
            "emotion_transcription": "่‡ชๅˆ†ใฏไบบ้–“้–ขไฟ‚ใฎ่ค‡้›‘ใ•ใซๅคงใ—ใฆๆทฑใๅ…ฑๆ„Ÿใ—ใ€้›ฃใ—ใ„ไบ‹ใ‚‚ๅคšใ„ใ‹ใ‚‰ใ“ใ่ช ๅฎŸใซๅ‘ใๅˆใฃใฆไฟก้ ผ้–ขไฟ‚ใ‚’็ฏ‰ใใ“ใจใŒๅคงๅˆ‡ใ ใจไผใˆใŸใ‹ใฃใŸใงใ™ใ€‚",
            "emotions": [
                ["ๆ‚ฒใ—ใฟ"],
                ["ๆ‚ฒใ—ใฟ"],
                ["่ฉฒๅฝ“ใชใ—"]
            ]
        }
        // ...
    ]
}

๐Ÿ‘ค Participant Personality Trait Data

The personality trait data includes TIPI-J (Japanese version of the Ten-Item Personality Inventory)1 questionnaire items, speaker responses, and Big Five scores computed from those responses.

Key Type Description
item dict Questionnaire items (i01โ€“i10)
personality dict Personality trait data keyed by speaker ID
personality.*.participant_id str Participant ID
personality.*.response dict Responses to each questionnaire item
personality.*.score dict Scores for each Big Five dimension
personality.*.score.openness int Openness (2โ€“14)
personality.*.score.conscientiousness int Conscientiousness (2โ€“14)
personality.*.score.extraversion int Extraversion (2โ€“14)
personality.*.score.agreeableness int Agreeableness (2โ€“14)
personality.*.score.neuroticism int Neuroticism (2โ€“14)
{
    "item": {
        "i01": "ๆดป็™บใง๏ผŒๅค–ๅ‘็š„ใ ใจๆ€ใ†",
        "i02": "ไป–ไบบใซไธๆบ€ใ‚’ใ‚‚ใก๏ผŒใ‚‚ใ‚ใ”ใจใ‚’่ตทใ“ใ—ใ‚„ใ™ใ„ใจๆ€ใ†",
        "i03": "ใ—ใฃใ‹ใ‚Šใ—ใฆใ„ใฆ๏ผŒ่‡ชๅˆ†ใซๅŽณใ—ใ„ใจๆ€ใ†",
        // ...
    },
    "personality": {
        "AA": {
            "participant_id": "AA",
            "response": {
                "i01": "2. ใŠใŠใ‚ˆใ้•ใ†ใจๆ€ใ†",
                "i02": "2. ใŠใŠใ‚ˆใ้•ใ†ใจๆ€ใ†",
                // ...
            },
            "score": {
                "openness": 10,
                "conscientiousness": 2,
                "extraversion": 7,
                "agreeableness": 11,
                "neuroticism": 9
            }
        }
        // ...
    }
}

๐Ÿ—‚๏ธ Split Information

split.json contains the Train / Valid / Test split information used in the experiments reported in the paper. Note that the dataset used in the paper's experiments includes dialogues that were later excluded from this published dataset due to ethical concerns.

๐Ÿ›ก๏ธ Guidelines for Use

Caution

Please observe the following guidelines when using this dataset:

  • Do not attempt to identify individuals from the data in this dataset.
  • Do not use this dataset to impersonate any specific speaker.
  • When using this dataset for purposes such as predicting speakers' personality traits, be mindful of the rights of speakers who may not wish to have their personal information inferred.

๐Ÿ“„ Citation

@inproceedings{tanaka-etal-2026-etcdataset,
  title = "Emotion Transcription in Conversation: A Benchmark for Capturing Subtle and Complex Emotional States through Natural Language",
  author = "Tanaka, Yoshiki and 
    Uehara, Ryuichi and 
    Inoue, Koji and 
    Inaba, Michimasa",
  booktitle = "Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)",
  year = "2026",
  pages = "9692--9709",
  publisher = "European Language Resources Association (ELRA)"
}

@inproceedings{tanaka-etal-2026-etcdataset-ja,
    title = "ๅฏพ่ฉฑใซใŠใ‘ใ‚‹ๅฟƒๆƒ…่จ˜่ฟฐ: ่‡ช็„ถ่จ€่ชžใซใ‚ˆใ‚‹ๆฉŸๅพฎใ‹ใค่ค‡้›‘ใชๅฟƒๆƒ…็†่งฃใฎใŸใ‚ใฎใƒ™ใƒณใƒใƒžใƒผใ‚ฏ",
    author = "็”ฐไธญ ็พฉ่ฆ and ไธŠๅŽŸ ้š†ไธ€ and ไบ•ไธŠ ๆ˜‚ๆฒป and ็จฒ่‘‰ ้€šๅฐ†",
    booktitle = "่จ€่ชžๅ‡ฆ็†ๅญฆไผš็ฌฌ32ๅ›žๅนดๆฌกๅคงไผš็™บ่กจ่ซ–ๆ–‡้›†",
    year = "2026",
    pages = "1328--1333"
}

๐Ÿ™‡ Acknowledgments

This work was supported by JSPS KAKENHI Grant Number 25H01382.

โš–๏ธ License

This dataset is licensed under CC BY-NC 4.0.

CC BY-NC 4.0

Footnotes

  1. Atsushi Oshio, ABE Shingo, and Pino Cutrone. Development, reliability, and validity of the japanese version of ten item personality inventory (tipi-j). Japanese Journal of Personality, Vol. 21, No. 1, 2012. โ†ฉ โ†ฉ2

  2. Hannah Rashkin, Eric Michael Smith, Margaret Li, and Y-Lan Boureau. Towards empathetic open-domain conversation models: A new benchmark and dataset. In Anna Korhonen, David Traum, and Lluรญs Mร rquez, editors, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5370โ€“5381, Florence, Italy, July 2019. Association for Computational Linguistics. โ†ฉ

  3. P. Ekman, W. V. Friesen, M. J. O'Sullivan, A. K. Chan, I. Diacoyanni-Tarlatzis, K. G. Heider, R. Krause, W. A. LeCompte, T. K. Pitcairn, P. E. Ricci-Bitti, K. R. Scherer, M. Tomita, and A. Tzavaras. Universals and cultural differences in the judgments of facial expressions of emotion. Vol. 53, pp. 712โ€“717, 1987. โ†ฉ

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors