release v0.4.0

- 更新 README 和用户手册，增加 Vosk 引擎的使用说明 - 修改构建配置，支持 Vosk 引擎的打包 - 更新版本号至 0.4.0，准备发布新功能
2026-02-04 12:24:42 +08:00 · 2025-07-11 01:33:04 +08:00
parent d354a6fefa
commit 0b8b823b2e
33 changed files with 283 additions and 93 deletions
--- a/docs/engine-manual/en.md
+++ b/docs/engine-manual/en.md
@@ -1,6 +1,6 @@
 # Caption Engine Documentation

-Corresponding Version: v0.3.0
+Corresponding Version: v0.4.0

 ![](../../assets/media/structure_en.png)

@@ -80,6 +80,10 @@ def convert_audio_to_text(s_lang, t_lang, audio_type, chunk_rate, api_key):
            break
 ```

+### Caption Translation
+
+Some speech-to-text models don't provide translation functionality, requiring an additional translation module. This part can use either cloud-based translation APIs or local translation models.
+
 ### Data Transmission

 After obtaining the text of the current audio stream, it needs to be transmitted to the main program. The caption engine process passes the caption data to the Electron main process through standard output.
@@ -149,4 +153,4 @@ Data receiver code is as follows:

 ## Reference Code

-The `main-gummy.py` file under the `caption-engine` folder in this project serves as the entry point for the default caption engine. The `src\main\utils\engine.ts` file contains the server-side code for acquiring and processing data from the caption engine. You can read and understand the implementation details and the complete execution process of the caption engine as needed.
+The `main-gummy.py` file under the `caption-engine` folder in this project serves as the entry point for the default caption engine. The `src\main\utils\engine.ts` file contains the server-side code for acquiring and processing data from the caption engine. You can read and understand the implementation details and the complete execution process of the caption engine as needed.
--- a/docs/engine-manual/ja.md
+++ b/docs/engine-manual/ja.md
@@ -1,6 +1,6 @@
 # 字幕エンジンの説明文書

-対応バージョン：v0.3.0
+対応バージョン：v0.4.0

 この文書は大規模モデルを使用して翻訳されていますので、内容に正確でない部分があるかもしれません。

@@ -82,6 +82,10 @@ def convert_audio_to_text(s_lang, t_lang, audio_type, chunk_rate, api_key):
            break
 ```

+### 字幕翻訳
+
+音声認識モデルによっては翻訳機能を提供していないため、別途翻訳モジュールを追加する必要があります。この部分にはクラウドベースの翻訳APIを使用することも、ローカルの翻訳モデルを使用することも可能です。
+
 ### データの伝送

 現在の音声ストリームのテキストを得たら、それをメインプログラムに渡す必要があります。字幕エンジンプロセスは標準出力を通じて電子メール主プロセスに字幕データを渡します。
@@ -121,4 +125,4 @@ sys.stdout.reconfigure(line_buffering=True)
 ...
 ```

-データ受信側のコードは
+データ受信側のコードは
--- a/docs/engine-manual/zh.md
+++ b/docs/engine-manual/zh.md
@@ -1,6 +1,6 @@
 # 字幕引擎说明文档

-对应版本：v0.3.0
+对应版本：v0.4.0

 ![](../../assets/media/structure_zh.png)

@@ -80,6 +80,10 @@ def convert_audio_to_text(s_lang, t_lang, audio_type, chunk_rate, api_key):
            break
 ```

+### 字幕翻译
+
+有的语音转文字模型并不提供翻译，需要再添加一个翻译模块。这部分可以使用云端翻译 API 也可以使用本地翻译模型。
+
 ### 数据传递

 在获取到当前音频流的文字后，需要将文字传递给主程序。字幕引擎进程通过标准输出将字幕数据传递给 electron 主进程。