The MEDIC dataset is a valuable resource for the computational analysis of empathy in psychotherapy. It is specifically designed to address the gap in datasets that focus on empathic interactions between counselors and clients, which are crucial for successful psychotherapeutic processes.
The MEDIC dataset is a multimodal collection derived from face-to-face psychological counseling sessions. It consists of 771 video clips, capturing the dynamic and complex nature of therapeutic interactions.
The dataset is organized into five primary components within the MEDIC dataset folder:
Three labels are proposed to describe the degree of empathy between counselors and their clients:
These labels are instrumental in analyzing and understanding the empathic interactions in the counseling sessions.
In addition to the original 771 samples, the dataset has been expanded with 373 new samples sourced from the 看见心理-双面镜计划 (link). This expansion enriches the diversity of the dataset, providing more instances for the analysis of empathic interactions in therapeutic settings.
This dataset is ideal for research in fields such as conversational analysis, behavioral studies, and multimodal machine learning. It provides a rich source of data for training models that require a combination of textual, audio, and visual inputs.
To use the MEDIC dataset, users are required to sign an agreement, which encompasses the terms of use and licensing conditions. To download this database, you need to print the agreement, sign it and send it to us. To ensure the responsible and appropriate use of the MEDIC dataset, it is required that the agreement is signed by an individual holding an official position at their respective institution. This ensures that the signatory has the authority and responsibility to adhere to the terms of use and licensing conditions outlined in the agreement.
For any inquiries about the MEDIC dataset, please contact:
Official Website of our Research Group: USTC-AC