Releases · collabora/WhisperLive · GitHub

17 Mar 09:38

makaveli10

v0.8.0 Latest

Latest

What's Changed

make setup.sh to work on macos by @adamsz-lume in #379
Add hi transcription video; add blog post section by @makaveli10 in #381
Auto convert hf custom whisper to ct2(faster-whisper) by @makaveli10 in #378
Add iOS client for WhisperLive (Audio-Transcription-iOS) by @ParkMazorika in #382
Add srt download opt browser ext by @makaveli10 in #385
issue 371 by @klonikar in #384
Change max_clients max_connection_time from server only by @makaveli10 in #387
Integrate live translation by @makaveli10 in #391
fix(run_server.py): help text for max_connection_time argument by @locnnil in #397
feat: support HuggingFace model IDs for faster_whisper_custom_model_path by @makaveli10 in #412
Add support for new REST API matching OpenAI transcription API by @boxerab in #414
README.md: add instructions for running client by @JenySadadia in #415
Enable timestamps for transcripted text by @JenySadadia in #418
feat: update to support faster whisper 1.2.0 by @AlexStansfield in #398
Bump openai-whisper version to 20250625 by @makaveli10 in #419
Add cross-client GPU batch inference for faster_whisper backend by @ianwh02 in #422
Fix NoneType crash in _process_single when VAD filters all audio by @ianwh02 in #427
feat: make display_segments configurable in Client/TranscriptionClient by @nightcityblade in #425
Expose version in package root and update dependencies in setup.py by @makaveli10 in #429
Fix crash when no --files provided; use microphone input instead by @makaveli10 in #430

New Contributors

@adamsz-lume made their first contribution in #379
@ParkMazorika made their first contribution in #382
@klonikar made their first contribution in #384
@locnnil made their first contribution in #397
@boxerab made their first contribution in #414
@JenySadadia made their first contribution in #415
@AlexStansfield made their first contribution in #398
@ianwh02 made their first contribution in #422
@nightcityblade made their first contribution in #425

Full Changelog: v0.7.1...v0.8.0

Contributors

boxerab, klonikar, and 8 other contributors

Assets 2

15 May 04:56

makaveli10

v0.7.1

What's Changed

Integrate openvino by @makaveli10 in #348
Fix typo in setup.py by @Perseus14 in #353
Remove clip_audio from faster_whisper backend by @makaveli10 in #354
Remove blank segment feature by @makaveli10 in #359
Resolve daemon warnings for threading methods by @emmanuel-ferdman in #366
Upgrade tensorrt_llm to v0.18.2 by @makaveli10 in #368
Add possibility to configure more parameters by @giubots in #367
Add: support for secure WebSocket (WSS) connections by @rover0811 in #373
Add transcription callback to Client for handling transcription results by @xXLosKrachosXx in #374
Update tensorrt_llm docker setup by @makaveli10 in #375
Add transcription callback parameter to TranscriptionClient #361 by @xXLosKrachosXx in #376

Full Changelog: v0.6.3...v0.7.1

Contributors

Perseus14, emmanuel-ferdman, and 4 other contributors

Assets 2

26 Feb 14:12

makaveli10

v0.6.3

What's Changed

Fix setup.py onnxruntime version for py312 pypi installation support by @makaveli10 in #341

Full Changelog: v0.6.2...v0.6.3

Contributors

makaveli10

Assets 2

07 Feb 11:37

makaveli10

v0.6.2

What's Changed

Revert to 12.4.1 base image by @makaveli10 in #321
Replace ffmpeg with av lib for resampling, rtsp & hls streams by @makaveli10 in #331
Add support py 312 by @makaveli10 in #333
Add option to mute audio playback for file input by @makaveli10 in #334

Full Changelog: v0.6.1...v0.6.2

Contributors

makaveli10

Assets 2

16 Jan 05:13

makaveli10

v0.6.1

What's Changed

Fix skipped audio chunk by @makaveli10 in #318
Upgrade silero vad v5 by @makaveli10 in #319
Fix package name by @makaveli10 in #320

Full Changelog: v0.6.0...v0.6.1

Contributors

makaveli10

Assets 2

07 Jan 12:40

makaveli10

v0.6.0

What's Changed

Upgrade tensorrt-llm==0.10.0 by @makaveli10 in #276
Expose client manager args by @makaveli10 in #284
Fix srt file missing segments by @makaveli10 in #292
Upgrade faster-whisper 1.1.0rc0 by @makaveli10 in #296
Upgrade faster_whisper==1.1.0 official release by @makaveli10 in #298
Support loading hf models by @makaveli10 in #297
Fix requirements & tests for py38 by @makaveli10 in #299
Upgrade tensorrt_llm==0.15.0 by @makaveli10 in #301
Set docker tesnorrt job timeout to 60 mins by @makaveli10 in #307
Add lock to thread shared variables updates/reads by @makaveli10 in #316

Full Changelog: v0.5.1...v0.6.0

Contributors

makaveli10

Assets 2

05 Sep 04:07

makaveli10

v0.5.1

What's Changed

Backslash missing in example by @t-nil in #236
Making backend arg safer by @berkaybilik in #243
Pin silero VAD onnx model version to v4.0 by @makaveli10 in #247
Fix deadlock issue in FFmpeg subprocess by ensuring stderr is consumed by @sondt2709 in #257
Discard no speech segments by @makaveli10 in #262
Initialize last_segment to None by @makaveli10 in #272
Set compute_type based on device capability by @makaveli10 in #274

Full Changelog: v0.5.0...v0.5.1

Contributors

t-nil, sondt2709, and 2 other contributors

Assets 2

07 Jun 05:57

makaveli10

v0.5.0

What's Changed

Improve cpu and gpu Dockerfiles, resulting in much smaller images by @peldszus in #206
Add option: save network stream to local file while transcribing by @fraic in #192
fix: limit CPU usage for VAD onnxruntime inference session by setting… by @makaveli10 in #215
Add support for RTSP stream by @dshepelev15 in #212
Make writing audio frames optional by @makaveli10 in #216
Expose the srt file location of Transcription client by @chien-liu in #224
Update tensorrt llm to v0.9.0 by @makaveli10 in #227
Single model mode by @peldszus in #223

Full Changelog: v0.4.1...v0.5.0

Contributors

peldszus, dshepelev15, and 3 other contributors

Assets 2

22 Mar 07:01

makaveli10

v0.4.1

What's Changed

Don't clear server_error flag on close. by @jsichi in #187
Fix client close by @makaveli10 in #190

Full Changelog: v0.4.0...v0.4.1

Contributors

jsichi and makaveli10

Assets 2

20 Mar 06:35

makaveli10

v0.4.0

What's Changed

build & push docker image on every new tag by @makaveli10 in #161
Upgrade faster whisper==1.0.1 by @makaveli10 in #163
Handle failure on systems without microphones by @FlippFuzz in #173
Fix faster whisper version in setup.py by @FlippFuzz in #175
Add support for processing same audio stream via multiple clients running different tasks. by @jsichi in #174

New Contributors

@FlippFuzz made their first contribution in #173
@jsichi made their first contribution in #174

Full Changelog: v0.3.0...v0.4.0

Contributors

jsichi, makaveli10, and FlippFuzz

Assets 2