MEDIC

Overview

The MEDIC dataset is a valuable resource for the computational analysis of empathy in psychotherapy. It is specifically designed to address the gap in datasets that focus on empathic interactions between counselors and clients, which are crucial for successful psychotherapeutic processes.

Dataset Description

The MEDIC dataset is a multimodal collection derived from face-to-face psychological counseling sessions. It consists of 771 video clips, capturing the dynamic and complex nature of therapeutic interactions.

Dataset Composition

The dataset is organized into five primary components within the MEDIC dataset folder:

Text Folder: Contains textual data corresponding to each sample.
Audio Features Folder: Includes audio features of each sample.
2P Audio Features Folder: Stores audio features with two-party (2p) interactions, where different speakers’ features are separated.
2P Keypoints Features Folder: Consists of visual keypoints features related to two-party interactions.
Labels.csv File: Provides the labels corresponding to each dataset sample.

Labels and Annotations

Three labels are proposed to describe the degree of empathy between counselors and their clients:

Expression of Experience: Assesses whether the client has expressed experiences that can trigger empathy.
Emotional Reaction: Indicates the counselor’s empathic emotional reaction.
Cognitive Reaction: Represents the counselor’s empathic cognitive reaction.

These labels are instrumental in analyzing and understanding the empathic interactions in the counseling sessions.

Detailed Structure

Total Samples: 771
Each sample includes the following:
- Text: Narrative or descriptive text data.
- Visual Features: Data related to visual aspects.
- Audio Features: Comprises complete MFCC (Mel-Frequency Cepstral Coefficients) features for each audio sample.
- Labels: Specific labels associated with each sample for classification or identification purposes.

Audio Features Specifics

The audio features are further distinguished into individual and two-party (2p) components:
- Individual Audio Features: Standard MFCC features for each sample.
- Two-Party Audio Features (2p_audio_feature): Includes MFCC features with the audio of different speakers separated for more nuanced analysis.
  - Filenames ending in ‘_con.npy’ correspond to features associated with the counselor.
  - Filenames ending in ‘_cli.npy’ correspond to features associated with the client.

Visual Features Specifics

Two-Party Keypoints Features (2p_keypoints_feature): This folder contains visual keypoints data, particularly focusing on scenarios involving interactions between two parties (e.g., counselor and client).

Recent Dataset Expansion

In addition to the original 771 samples, the dataset has been expanded with 373 new samples sourced from the 看见心理-双面镜计划 (link). This expansion enriches the diversity of the dataset, providing more instances for the analysis of empathic interactions in therapeutic settings.

Usage

This dataset is ideal for research in fields such as conversational analysis, behavioral studies, and multimodal machine learning. It provides a rich source of data for training models that require a combination of textual, audio, and visual inputs.

License and Agreement

To use the MEDIC dataset, users are required to sign an agreement, which encompasses the terms of use and licensing conditions. To download this database, you need to print the agreement, sign it and send it to us. To ensure the responsible and appropriate use of the MEDIC dataset, it is required that the agreement is signed by an individual holding an official position at their respective institution. This ensures that the signatory has the authority and responsibility to adhere to the terms of use and licensing conditions outlined in the agreement.

Contact Us

For any inquiries about the MEDIC dataset, please contact:

Bingzhao Cai (In charge of the database): cbz_2020@mail.ustc.edu.cn
Shangfei Wang: sfwang@ustc.edu.cn

Official Website of our Research Group: USTC-AC