A computer vision project for recognizing and classifying Kathakali classical dance mudras (hand gestures) using Meta's DINOv3 vision transformer model.
Go to project
A computer vision project for recognizing and classifying Kathakali classical dance mudras (hand gestures) using Metaโs DINOv3 vision transformer model.
Kathakali is a highly stylized classical Indian dance-drama from Kerala, known for its elaborate costumes, detailed gestures, and expressive movements. Central to this art form are the intricate hand gestures called Mudras, which form a sophisticated vocabulary for storytelling. This project leverages state-of-the-art computer vision to automatically recognize and classify these mudras, serving as a tool for learning, preservation, and digital analysis of this ancient art form.
The project utilizes DINOv3, Meta AIโs self-supervised vision transformer, fine-tuned specifically for Kathakali mudra recognition.
Clone the repository:
git clone https://github.com/Sree14hari/Kathakali-Mudra_DinoV3.git
cd Kathakali-Mudra_DinoV3
Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
Install dependencies:
pip install -r requirements.txt
Organize your dataset in the following structure:
data/
train/
pataka/
image_001.jpg
image_002.jpg
...
tripataka/
mayura/
...
val/
pataka/
tripataka/
...
test/
pataka/
tripataka/
...
This project uses DINOv3 (Vision Transformer) as the feature extractor with a custom classification head:
This project is licensed under the MIT License - see the LICENSE file for details.
We welcome contributions to improve Kathakali mudra recognition! Please feel free to:
Development workflow:
git checkout -b feature/your-feature-name
git commit -m "Add your feature"
git push origin feature/your-feature-name
For questions or collaborations regarding this project:
Bridging centuries-old classical art with cutting-edge computer vision technology ๐ญ๐ค
Subscribe to our newsletter!