LocalAI - Models

nemo-parakeet-tdt-0.6b

NVIDIA NeMo Parakeet TDT 0.6B v3 is an automatic speech recognition (ASR) model from NVIDIA's NeMo toolkit. Parakeet models are state-of-the-art ASR models trained on large-scale English audio data.

Links

Tags

parakeet-cpp-tdt_ctc-110m

Hybrid TDT+CTC FastConformer, 110M. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-realtime_eou_120m-v1

Cache-aware streaming RNNT FastConformer with end-of-utterance (EOU) detection, 120M. Use with streaming transcription. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-ctc-0.6b

CTC FastConformer, 0.6B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-rnnt-0.6b

RNNT FastConformer, 0.6B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-tdt-0.6b-v2

TDT FastConformer, 0.6B (v2). F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-tdt-0.6b-v3

TDT FastConformer, 0.6B (v3, multilingual). F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-ctc-1.1b

CTC FastConformer, 1.1B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-rnnt-1.1b

RNNT FastConformer, 1.1B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-tdt-1.1b

TDT FastConformer, 1.1B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-tdt_ctc-1.1b

Hybrid TDT+CTC FastConformer, 1.1B. F16 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo Parakeet), byte-identical to NeMo at WER 0. Faster than NeMo on CPU and GPU.

Links

Tags

parakeet-cpp-nemotron-3.5-asr-streaming-0.6b

Multilingual (40+ locales), prompt-conditioned, cache-aware streaming FastConformer RNN-T, 0.6B. Q8_0 GGUF for the parakeet-cpp backend (C++/ggml port of NVIDIA NeMo). Byte-identical to NeMo at WER 0 offline and streaming, about 2.5x faster than NeMo on CPU with no GPU. Select a language with the request "language" field (for example en, de, es, ja-JP), or leave it empty for automatic detection. License OpenMDW-1.1.

Links

Tags

parakeet-crispasr

NVIDIA Parakeet TDT 0.6B v3 (FastConformer + Token-and-Duration Transducer), 25-language ASR. Runs via the CrispASR backend. Default GGUF size ~467 MB.

Links

https://huggingface.co/cstr/parakeet-tdt-0.6b-v3-GGUF

Tags

parakeet-v2-crispasr

NVIDIA Parakeet TDT 0.6B v2 (FastConformer + TDT), English-only ASR. Runs via the CrispASR backend. Default GGUF size ~468 MB.

Links

https://huggingface.co/cstr/parakeet-tdt-0.6b-v2-GGUF

Tags

parakeet-ja-crispasr

NVIDIA Parakeet TDT 0.6B Japanese ASR (F16 default; Q4_K is quantisation-sensitive for this model). Runs via the CrispASR backend. Default GGUF size ~1.24 GB.

Links

https://huggingface.co/cstr/parakeet-tdt-0.6b-ja-GGUF

Tags

parakeet-tdt-1.1b-crispasr

NVIDIA Parakeet TDT 1.1B (42-layer FastConformer encoder), English-only ASR. Runs via the CrispASR backend. Default GGUF size ~808 MB.

Links

https://huggingface.co/cstr/parakeet-tdt-1.1b-GGUF

Tags

parakeet-tdt_ctc-110m-crispasr

NVIDIA Parakeet hybrid TDT+CTC 110M (smallest, CTC decode), English-only ASR. Runs via the CrispASR backend. Default GGUF size ~91 MB.

Links

https://huggingface.co/cstr/parakeet-tdt_ctc-110m-GGUF

Tags

parakeet-tdt_ctc-1.1b-crispasr

NVIDIA Parakeet hybrid TDT+CTC 1.1B (multilingual, casing + punctuation) ASR. Runs via the CrispASR backend. Default GGUF size ~810 MB.

Links

https://huggingface.co/cstr/parakeet-tdt_ctc-1.1b-GGUF

Tags

parakeet-rnnt-0.6b-crispasr

NVIDIA Parakeet RNN-Transducer 0.6B (24-layer FastConformer) ASR. Runs via the CrispASR backend. Default GGUF size ~447 MB.

Links

https://huggingface.co/cstr/parakeet-rnnt-0.6b-GGUF

Tags

parakeet-rnnt-1.1b-crispasr

NVIDIA Parakeet RNN-Transducer 1.1B (42-layer FastConformer) ASR. Runs via the CrispASR backend. Default GGUF size ~770 MB.

Links

https://huggingface.co/cstr/parakeet-rnnt-1.1b-GGUF

Tags

Model Gallery

Filter by type:

Filter by tags:

nemo-parakeet-tdt-0.6b

parakeet-cpp-tdt_ctc-110m

parakeet-cpp-realtime_eou_120m-v1

parakeet-cpp-ctc-0.6b

parakeet-cpp-rnnt-0.6b

parakeet-cpp-tdt-0.6b-v2

parakeet-cpp-tdt-0.6b-v3

parakeet-cpp-ctc-1.1b

parakeet-cpp-rnnt-1.1b

parakeet-cpp-tdt-1.1b

parakeet-cpp-tdt_ctc-1.1b

parakeet-cpp-nemotron-3.5-asr-streaming-0.6b

parakeet-crispasr

parakeet-v2-crispasr

parakeet-ja-crispasr

parakeet-tdt-1.1b-crispasr

parakeet-tdt_ctc-110m-crispasr

parakeet-tdt_ctc-1.1b-crispasr

parakeet-rnnt-0.6b-crispasr

parakeet-rnnt-1.1b-crispasr