Skip to Content
Experimental: Contact Us

This surface is directional and may change before public release. Contact Spiral if you want to use it in an early workflow.

The Audio Extension is intended for speech, music, and event-audio datasets where users need clip metadata, temporal windows, transcripts, features, and deferred waveform materialization.

Planned table functions

FunctionPurpose
audio.scan(path)Discover audio files or read a manifest.
audio.windows(path, window, stride)Produce fixed-length deferred audio windows.
audio.transcripts(path)Read transcript rows aligned to clips or windows.
audio.features(audio_ref)Produce preview feature rows such as duration or sample rate.
audio.decode(audio_ref)Materialize waveform windows.

These names are preview syntax. The functions are not registered in the current default CLI session.

Output shape

Window functions should produce rows like:

ColumnMeaning
clip_idStable clip id.
pathLocal path or object URI.
start_msWindow start time.
duration_msWindow duration.
sample_rateSample rate when known.
labelOptional label.
audio_refDeferred audio window reference.

Example shape

SELECT clip_id, window_ref, label FROM audio.windows('s3://bucket/audio/', window => '10s', stride => '5s');

Notes

  • Metadata queries should avoid waveform decode.
  • Transcript joins should use stable clip/window ids.
  • Materialization options should document sample rate, channel layout, waveform dtype, spectrogram policy, and device placement once implemented.
Last updated on