Model Gallery

3 models from 1 repositories

Filter by type:

Filter by tags:

localvqe-v1.1-1.3m
LocalVQE v1.1 (1.3 M parameters, F32) — joint acoustic echo cancellation, noise suppression, and dereverberation for 16 kHz mono speech. DeepVQE-style architecture with an S4D bottleneck and an in-graph DCT-II filterbank. ~9.6× realtime on a desktop CPU; 16 ms algorithmic latency. ~5 MB on disk. v1.1 ships the v16 echoaware checkpoint with improved double-talk and near-end single-talk AECMOS scores.

Repository: localaiLicense: apache-2.0

localvqe-v1.2-1.3m
LocalVQE v1.2 (1.3 M parameters, F32) — compact joint acoustic echo cancellation, noise suppression, and dereverberation for 16 kHz mono speech. Shares the same DeepVQE-style architecture (arch_version 3) as v1.3 but with narrower encoder/decoder widths, so it runs at ~9.7× realtime (~1.6 ms per 16 ms frame on a 4-thread Zen4 CPU) — about ¼ the per-hop cost of v1.3. Widens the echo-search window to 1024 ms (v1.1 used 512 ms). ~5 MB on disk. The budget-friendly choice for low-core or power-constrained devices.

Repository: localaiLicense: apache-2.0

localvqe-v1.3-4.8m
LocalVQE v1.3 (4.8 M parameters, F32) — current default release. Joint acoustic echo cancellation, noise suppression, and dereverberation for 16 kHz mono speech, with a wider encoder/decoder trained from scratch under a noise-floor-aware loss recipe. ~4.7× realtime (~3.3 ms per 16 ms frame on a 4-thread Zen4 CPU); ~19 MB on disk. Improves doubletalk speech quality (+0.25 deg MOS) and far-end echo cancellation (ERLE +5.2–9.3 dB) over v1.2; on far-end-only scenes some users may still prefer v1.2's gentler trade-off. Same 16 ms algorithmic latency as the compact models.

Repository: localaiLicense: apache-2.0