speechbench

Cross-model ASR comparison — every model × every dataset × 30 clips, on a single GCP T4 spot VM.
Hardware: T4 spot
Project: safecare-maps
Generated: -
Source: github.com/jasontitus/speechbench

Tables are sortable — click any column header. Green = best WER in the dataset. Red = hallucination (WER > 100% means the model generated more output than the reference). Model names link to their HuggingFace pages. Dataset titles link to their HF datasets.