Commit Graph

46 Commits

Author SHA1 Message Date
yaofanguk
bc31d1ff2d 去除propainter 2026-04-10 16:08:17 +08:00
flavioy
0bae013097 优化字幕检测算法、添加多语言翻译支持
Some checks failed
Docker Build and Push / check-secrets (push) Successful in 2s
Docker Build and Push / build-and-push (cpu, latest) (push) Has been skipped
Docker Build and Push / build-and-push (cuda, 11.8) (push) Has been skipped
Docker Build and Push / build-and-push (cuda, 12.6) (push) Has been skipped
Docker Build and Push / build-and-push (cuda, 12.8) (push) Has been skipped
Docker Build and Push / build-and-push (directml, latest) (push) Has been skipped
Build Windows CPU / build (push) Has been cancelled
Build Windows CUDA 11.8 / build (push) Has been cancelled
Build Windows CUDA 12.6 / build (push) Has been cancelled
Build Windows CUDA 12.8 / build (push) Has been cancelled
Build Windows DirectML / build (push) Has been cancelled
- 自适应采样间隔:根据视频帧率调整(60fps+→4, 30fps+→3, 低帧率→2)
- filter_and_merge_intervals 复杂度从 O(n²) 优化为 O(n log n)
- detect_subtitle 区域过滤:单区域快速路径,多区域匹配即停
- 插值逻辑改用 zip 预计算 max_gap,更高效
- SubtitleDetectMode 枚举值改为英文key,通过翻译系统显示本地化名称
- 7种语言文件添加 SubtitleDetectMode 翻译(中/繁/英/日/韩/越/西)
- 旧配置值自动迁移兼容

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-09 00:17:01 +08:00
flavioy
93d822d067 修复LAMA模式100%卡死:帧区间扩展超出视频总帧数导致FramePrefetcher死锁
- 限制字幕区间end不超过frame_count,防止内循环消费哨兵后外层永久阻塞
- LAMA批量推理改为mini-batch(4帧),避免GPU OOM
- 各inpaint模型空inpaint_area时返回原始帧
- FFmpeg子进程添加600s超时保护

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-08 23:34:53 +08:00
flavioy
1ab2eb96cf 美化控制台输出:时间戳、颜色标签、线程安全优化
- PlainTextEdit 替换为 TextEdit 支持 HTML 富文本
- 每条日志添加 [HH:MM:SS] 时间戳
- 根据消息类型自动着色(错误红/成功绿/警告橙/信息蓝)
- 修复字幕检测模型无 ONNX providers 时输出空括号的问题
- HTML 特殊字符转义防止注入
- 清理 gui.py closeEvent 中多余的注释代码

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-07 23:29:16 +08:00
flavioy
6c61ddc6b0 性能优化:帧采样、FFmpeg编码、帧预读取、消除冗余拷贝
- 字幕检测:每3帧采样一次OCR,中间帧插值填充,检测速度提升约3倍
- 视频编码:cv2.VideoWriter(mp4v) 替换为 FFmpeg libx264 管道编码,画质更好、体积更小
- 帧预读取:后台线程预解码视频帧,I/O 与模型推理重叠
- 消除 deepcopy:numpy 数组改用 .copy() 替代 copy.deepcopy,降低内存开销
- 清理冗余颜色空间转换中的 np.array() 包装

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-07 22:58:52 +08:00
Jason
50c89c1217 同时支持PP-OCRv4 PP-OCRv5模型 2025-05-22 19:52:36 +08:00
Jason
afcc93b9cc 支持设置视频保存路径 2025-05-22 19:50:24 +08:00
Jason
9edeb437fb 支持不同任务使用不同选区 2025-05-22 15:09:59 +08:00
Jason
65888f4274 强制命令行使用英语 2025-05-22 15:09:59 +08:00
Jason
e26e23ad6a 支持设置时间选区
支持方向键快进快退(ctrl + -> or shirft + -> or ->)
2025-05-22 15:09:59 +08:00
Jason
7cf4e882cd 初步支持 macOS (Apple Silicon)
升级至PP-OCRv5
Tested with Python 3.13
感谢用户 "期望" 提供的macOS设备用于开发测试
2025-05-22 13:02:28 +08:00
Jason
c673b5ccd1 支持多字幕区域 2025-05-22 08:42:00 +08:00
Jason
f78e985e1c 使用PySide6-Fluent-Widgets重构整套UI
添加任务列表组件并优化视频加载逻辑
支持可视化显示字幕区域
整理所有模型, 分别为STTN智能擦除, STTN字幕检测, LAMA, ProPainter, OpenCV
提高处理性能
新增CPU运行模式并优化多语言支持
修复Propainter模式部分视频报错

本次提交新增了CPU运行模式,适用于无GPU加速的场景。同时,优化了多语言支持,新增了日语、韩语、越南语等语言配置文件,并更新了README文档以反映新的运行模式和多语言支持。此外,修复了部分代码逻辑,提升了系统的稳定性和兼容性。
2025-05-22 08:41:59 +08:00
Jason
746db4bced DirectML版本支持运行STTN模型(Windows) 2025-04-25 13:01:31 +08:00
Jason
3770ccdcfd 改用PaddleOCR, 跟随主线更新 2025-04-25 13:00:30 +08:00
天涯古巷
53baf28326 Revert "由于PySimpleGUI作者故意移除免费的旧版本,改用PySimpleGUI-4-foss" 2025-04-25 11:03:16 +08:00
Jason
97b4159d38 DirectML版本支持运行STTN模型(Windows) 2025-04-24 15:56:13 +08:00
Jason
c60234f4ec 改用PaddleOCR, 跟随主线更新 2025-04-24 15:46:38 +08:00
天涯古巷
3dc8f3bfe0 Update main.py 2024-10-23 16:41:01 +08:00
YaoFANGUK
d6736d9206 添加sttn训练代码 2024-01-08 17:48:21 +08:00
YaoFANGUK
2d1eb11fd6 增大视野,保证去除效果 2024-01-05 16:57:40 +08:00
YaoFANGUK
f9dd30fddf 兼容安卓手机不能分享生成视频的问题 2024-01-04 14:33:33 +08:00
YaoFANGUK
a7e62db98a 屏蔽windows删除文件报错 2023-12-29 09:33:07 +08:00
YaoFANGUK
945aeb9bc8 新增文件类型判断 2023-12-29 09:23:42 +08:00
YaoFANGUK
6ea7482344 minor 2023-12-29 08:46:36 +08:00
YaoFANGUK
ba396d9569 未传入字幕区域时,进行全屏处理 2023-12-29 08:45:20 +08:00
YaoFANGUK
d25e34f621 vsr v1.1.0 2023-12-28 14:24:17 +08:00
YaoFANGUK
0496e06cb8 优化效果 2023-12-28 12:04:32 +08:00
YaoFANGUK
125a06ca50 修改config备注 2023-12-28 10:59:46 +08:00
YaoFANGUK
0d12922b50 minor 2023-12-28 08:55:59 +08:00
YaoFANGUK
efa49df6ab 修复图片没有文本时报错的bug 2023-12-28 08:45:33 +08:00
YaoFANGUK
491b4089ec 修改备注 2023-12-27 20:39:06 +08:00
YaoFANGUK
313c3d37a7 继续修复bug 2023-12-27 20:32:00 +08:00
YaoFANGUK
4d3d4b59bd 修复自动检测文本时若mask高度大于宽度进程卡住的bug 2023-12-27 09:22:48 +08:00
YaoFANGUK
979595c5fb 修复进度条错误 2023-12-26 19:28:34 +08:00
YaoFANGUK
18d57f2a18 修复bug 2023-12-26 19:12:48 +08:00
YaoFANGUK
f7ce8013e0 非gui运行则关闭预览,减少计算 2023-12-26 17:27:53 +08:00
YaoFANGUK
935c341c32 修复卡住bug 2023-12-26 17:21:30 +08:00
YaoFANGUK
41a95dac0f 新增视频inpaint方法 2023-12-26 10:11:09 +08:00
YaoFANGUK
ceb44ba034 sttn优化 2023-12-22 18:05:32 +08:00
YaoFANGUK
43c1c5113b 优化代码 2023-12-22 12:42:14 +08:00
天涯古巷
921902aa2b Update main.py 2023-12-14 22:14:54 +08:00
YaoFANGUK
29c5317a69 修复bug 2023-12-13 19:50:15 +08:00
YaoFANGUK
88de54f20e 使用场景检测算法 2023-12-12 17:24:44 +08:00
YaoFANGUK
6d741cd9e4 新增视频场景检测 2023-12-12 17:06:05 +08:00
YaoFANGUK
5e048175d3 vsr v1.0.0 2023-12-08 17:12:26 +08:00