I/O🔗
callcut.io provides utilities for loading audio files and their annotations,
converting annotations to frame labels, and building PyTorch datasets for training.
The audio loading relies on torchaudio which under-the-hood delegates to
torchcodec and ffmpeg.
Important
ffmpeg must be installed on your system to use callcut.io.
Audio and Annotation Loading🔗
|
Load an audio file. |
|
Load call annotations from a CSV file. |
Recording Metadata🔗
Scan and validate recordings before building datasets. This pre-validation step ensures all recordings have valid annotations and computes metadata (duration, annotation count) without loading the full audio.
|
Scan recordings and return metadata for valid ones. |
|
Metadata about a recording for dataset construction. |
Label Generation🔗
Convert annotation intervals to per-frame binary labels for training.
|
Convert annotation intervals to per-frame binary labels. |
Dataset🔗
PyTorch Dataset for training call detection models.
|
PyTorch Dataset for frame-level call detection. |