Qwen3-ASR-Toolkit is a Python-based command-line utility designed to extend the capabilities of the Qwen-ASR API. By employing intelligent Voice Activity Detection (VAD), the tool segments long audio or video files into chunks shorter than three minutes. These segments are then processed concurrently using multiple threads, allowing users to bypass the official API’s duration limits and transcribe hours of content in a fraction of the time.
The toolkit supports nearly any audio or video format through its FFmpeg integration. It handles technical requirements automatically, such as resampling audio to 16kHz mono to ensure compatibility with the API. With a straightforward command-line interface, users only need to provide a DashScope API Key to access its full range of features.
Key Features
How It Works
sudo apt update && sudo apt install ffmpegbrew install ffmpegSetting your API Key as an environment variable is recommended:
# Linux/macOS
export DASHSCOPE_API_KEY="your_api_key_here"
# Windows (PowerShell)
$env:DASHSCOPE_API_KEY="your_api_key_here"
Option 1: Via PyPI (recommended)
pip install qwen3-asr-toolkit
Option 2: From source
git clone https://github.com/QwenLM/Qwen3-ASR-Toolkit.git
cd Qwen3-ASR-Toolkit
pip install .
The basic command syntax is as follows:
qwen3-asr -i <input_file_or_url> [-key <api_key>] [-j <num_threads>] [-c <context>] [-t <tmp_dir>] [-s]
| Parameter | Short | Description | Required |
|---|---|---|---|
--input-file |
-i |
Path to a local file or a remote URL | Yes |
--context |
-c |
Provide context/keywords to improve recognition of specific terms | No |
--dashscope-api-key |
-key |
DashScope API key | No (if env variable is set) |
--num-threads |
-j |
Number of concurrent threads (default: 4) | No |
--tmp-dir |
-t |
Directory for temporary files (default: ~/qwen3-asr-cache) |
No |
--silence |
-s |
Silent mode – suppresses progress information | No |
1. Transcribe a local file
qwen3-asr -i "/path/to/my/long_lecture.mp4"
2. Transcribe a remote audio file
qwen3-asr -i "https://somewebsite.com/audios/podcast_episode.mp3"
3. Increase concurrency and provide an API key manually
qwen3-asr -i "/path/to/my/podcast.wav" -j 8 -key "your_api_key_here"
4. Improve accuracy with context hints
qwen3-asr -i "/path/to/my/tech_talk.mp4" -c "Qwen-ASR, DashScope, FFmpeg, VAD"
5. Execute in silent mode
qwen3-asr -i "/path/to/my/meeting_recording.m4a" -s
SPV VPN: Fast, Stable, and One-Click Unlimited Access
openAgent: Open Source Enterprise AI Platform With RAG and Agent Workflows
Lanjing VPN Review: Unlimited Traffic, CN2 Lines, and Smart Routing
Prompt Optimizer: Write Better AI Prompts in Seconds
Clueless: A Native AI Meeting Assistant for Mac with Live Transcription
InvenTree Inventory: Self-Hosted Stock Control with REST API
AingDesk: Run Local AI Models and Build a Private Knowledge Base
AG-UI Protocol: The Open Standard for Connecting AI Agents to Frontends
ACI.dev: 600+ Tools for AI Agents with Built-In Auth and MCP Support
ALLinSSL: Automated SSL Certificate Lifecycle Management
n8n Autoscaling: Scaling Workers via Redis Queue Without Kubernetes
DeerFlow: Modular Multi-Agent Research With LangGraph and MCP