EmoSense

About

EmoSense represents a cutting-edge advancement in emotion detection technology, combining Automatic Speech Recognition (ASR), Emotion Analysis, Speech Emotion Recognition (SER), and Facial Emotion Detection. This multi-modal system offers a comprehensive approach to understanding human emotions expressed through speech, text, and facial expressions. Through EmoSense, we delve into a new realm of personalized experiences, tailored interventions, and insightful analytics across various domains.

WIP

Building novel models for each emotion detection task.

What is EmoSense

Emotions are integral to human communication and interactions, yet accurately detecting and interpreting them presents significant challenges. Existing emotion detection systems often rely on single modalities, such as text or speech, leading to limited accuracy and depth of analysis. Inconsistent or inaccurate emotion detection can hinder personalized user experiences, effective mental health assessments, and interactive technologies.

Problems addressed:

Limited accuracy and depth of emotion analysis with single-modal systems.
Challenges in understanding emotions expressed through speech, text, and facial expressions.
Inconsistent and inaccurate emotion detection hindering personalized experiences and effective assessments.

"EmoSense" is an innovative multi-modal emotion detection system designed to analyze and interpret human emotions through various channels. By integrating Automatic Speech Recognition (ASR), Text Emotion Analysis, Speech Emotion Recognition (SER), and Facial Emotion Detection, EmoSense provides a comprehensive understanding of emotional expressions in speech, text, and facial cues. This project aims to revolutionize emotion detection, offering applications in healthcare, education, customer service, and entertainment for tailored experiences and enhanced interactions.

Models used for each task

Automatic Speech Recognition - Whisper Large v3
Text Emotion Analysis - Fine-tuned RoBERTa for Sequence Classification
Face Detection - Fine-tuned YOLOv8
Face Emotion Detection - Fine-tuned VGG19
Speech Emotion Recognition - Novel SER model as described here

Examples

Here are the results of EmoSense for 5 videos sourced from Youtube:

Input 1 : Dramatic Film Monologue : The Society

Video:

Graphs:

Face Emotions	Speech Emotions

Labeled Transcript: Transcript can be found here

Input 2 : Crazy Rich Asians Mahjong Monologue | Close Up

Video:

Graphs:

Face Emotions	Speech Emotions

Labeled Transcript: Transcript can be found here

Input 3 : Dramatic Monologue | Strong Female Drama Actor, Young Actress Celines Estevez

Video:

Graphs:

Face Emotions	Speech Emotions

Labeled Transcript: Transcript can be found here

Input 4 : Closeup monologue from The Women

Video:

Graphs:

Face Emotions	Speech Emotions

Labeled Transcript: Transcript can be found here

Input 5 : “You Understand?” - short dramatic monologue

Video:

Graphs:

Face Emotions	Speech Emotions

Labeled Transcript: Transcript can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
models		models
outputs		outputs
.gitignore		.gitignore
EmoSense.py		EmoSense.py
Flow Diagram.jpg		Flow Diagram.jpg
README.md		README.md
face_emotion_labeller.pickle		face_emotion_labeller.pickle
ser_emotion_labeller.pickle		ser_emotion_labeller.pickle
textemo_labeller.pickle		textemo_labeller.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmoSense

About

WIP

What is EmoSense

Models used for each task

Examples

Input 1 : Dramatic Film Monologue : The Society

Input 2 : Crazy Rich Asians Mahjong Monologue | Close Up

Input 3 : Dramatic Monologue | Strong Female Drama Actor, Young Actress Celines Estevez

Input 4 : Closeup monologue from The Women

Input 5 : “You Understand?” - short dramatic monologue

About

Releases

Packages

Languages

Mujahid087/EmoSense

Folders and files

Latest commit

History

Repository files navigation

EmoSense

About

WIP

What is EmoSense

Models used for each task

Examples

Input 1 : Dramatic Film Monologue : The Society

Input 2 : Crazy Rich Asians Mahjong Monologue | Close Up

Input 3 : Dramatic Monologue | Strong Female Drama Actor, Young Actress Celines Estevez

Input 4 : Closeup monologue from The Women

Input 5 : “You Understand?” - short dramatic monologue

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages