We introduce the continuous fingerspelling dataset for Indian Sign Language, comprising 1,308 video segments with aligned text annotations. The dataset captures authentic coarticulation patterns from professional signers, supporting research in fingerspelling recognition and sign language processing.
Fingerspelling enables signers to represent proper nouns and technical terms letter-by-letter using manual alphabets, yet remains severely under-resourced for Indian Sign Language (ISL). We present the first continuous fingerspelling dataset for ISL, extracted from the ISH News YouTube channel in which fingerspelling is accompanied by synchronized on-screen text cues. The dataset comprises 1,308 segments from 499 videos, totaling 70.85 minutes and 14,814 characters, with aligned video-text pairs capturing authentic coarticulation patterns. We validate dataset quality through annotation by a proficient ISL interpreter, achieving a 90.67% exact match rate for 150 samples. We further establish baseline recognition benchmarks using a ByT5-small encoder-decoder model, which attains 82.91% Character Error Rate after fine-tuning. This resource supports multiple downstream tasks including fingerspelling transcription, temporal localization, and sign generation.
ISL-Fingerspelling/ ├── videos/ # 1,308 MP4 video files ├── fingerspelling_annotations.csv # Segment annotations ├── localization_timestamps.csv # Temporal localization in source videos └── README.md
Maps video segments to their transcriptions:
uid: Unique segment ID (format: {video_id}_seg{index})text: Fingerspelled textContains temporal boundaries of fingerspelling segments in the original YouTube videos:
video_id: YouTube video IDsegment_index: Segment index within the videostart_time, start_sec: Start timestamp and secondsend_time, end_sec: End timestamp and secondsduration_str, duration_sec: Duration in timestamp and secondstranscript: Fingerspelled textIf you use this dataset in your research, please cite:
@inproceedings{kirandevraj2025islfingerspelling,
title={Continuous Fingerspelling Dataset for Indian Sign Language},
author={Kirandevraj, R and Kurmi, Vinod K and Namboodiri, Vinay P and Jawahar, CV},
booktitle={Proceedings of the Workshop on Sign Language Processing (WSLP) at AACL-IJCNLP},
year={2025}
}
Data sourced from publicly available ISH News YouTube videos. 407 of 499 videos overlap with the iSign dataset. This dataset is released under CC BY-NC 4.0 license for research purposes only.