Built on next-generation Kaldi and ONNX Runtime, sherpa-onnx is a versatile speech processing toolkit. It provides comprehensive support for streaming and non-streaming ASR, TTS, speaker diarization, speech enhancement, and voice activity detection (VAD). All processing is performed locally, ensuring privacy and functionality without an internet connection.
Speech Processing Capabilities:
The toolkit is designed for cross-platform compatibility across various operating systems and architectures:
The official documentation provides a compatibility matrix detailing supported combinations. sherpa-onnx also extends to embedded hardware, including Raspberry Pi, NVIDIA Jetson, RV1126, and LicheePi4A. Additionally, it supports WebAssembly for in-browser execution.
Developers can integrate sherpa-onnx into their projects using 11 different languages: C++, C, Python, JavaScript, Java, C#, Kotlin, Swift, Go, Dart, Rust, and Pascal (Object Pascal).
The repository includes comprehensive examples and build scripts for each target platform:
android/: Android implementation details.ios-swift/: Samples for iOS development using Swift.python-api-examples/: Python-based demonstrations.wasm/: WebAssembly support for web integration.Specific build scripts, such as build-android-arm64-v8a.sh and build-wasm-simd-asr.sh, facilitate compilation for various architectures.
A wide range of ready-to-use models is available for diverse languages and deployment scenarios:
ASR Models
Other Models The ecosystem includes models for VAD, speech enhancement (GTCRN), and source separation (Spleeter). Users can download these from the official links, with several models offering support for multiple languages and regional dialects.
sherpa-onnx-unity project brings advanced speech processing capabilities directly into the Unity engine.For comprehensive implementation guides and technical details, visit the official documentation: https://k2-fsa.github.io/sherpa/onnx/
NOF0 Open Source AI Trading Arena Puts Crypto Models Head to Head
Open Computer Use: AI Agents with Hands-On Desktop Control
SPV VPN: Fast, Stable, and One-Click Unlimited Access
Feiniao VPN: Free Trial, 4K Streaming & Unblock Netflix (2026 Guide)
AhaSpeed VPN Review: High-Speed Performance, No Ads, and Unlimited Bandwidth
AoxVPN 8.8 Member Day Sale | No-Log VPN Featuring IEPL Private Lines
Space Adventure Story Voice Mode: Build an AI-Powered Voice Game
Agentic-Trading: Multi-Agent Simulator with A2A Protocol and ADK
Agents From Scratch: AI Email Assistant with Human-in-the-Loop Approval
Microsandbox Guide: Secure MicroVM Code Execution in 200ms
DeerFlow: Modular Multi-Agent Research With LangGraph and MCP
PyVideoTrans: Open-Source Video Translation & Dubbing Tool