New
v0.0.1
这是PillOCR的0.0.1版,包含以下功能:
- 将截图转化为markdown/latex,截图功能支持系统截图和微信截图等,使用系统截图时需设置将自动将截图复制到剪贴板;
- 可以选择公式的包装符,还可以在输入框中自行输入(左右包装符之间以空格隔开);
- 可以自定义模型的url、api,需要模型具有视觉功能,如GPT-4o(-mini),火山引擎的视觉理解模型等;
- windows版可以通过快捷键启动、停止,且可以自定义快捷键。macos版暂不支持快捷键启停;
- 可以设置代理(如GPT模型需要设置代理); PS:macos版暂时无法隐藏程序坞的图标。
This is version 0.0.1 of PillOCR, which includes the following features:
- Convert screenshots into Markdown/LaTeX, supporting system screenshots, WeChat screenshots, and more.
- Option to choose formula delimiters or manually input them in the text box (left and right delimiters should be separated by a space).
- Customizable model URLs and APIs, requiring models with visual capabilities, such as GPT-4o(-mini), ByteDance's visual understanding models, etc.
- The Windows version supports hotkeys for launching and stopping, with customizable shortcuts (macOS version does not support this yet).
- Proxy settings can be configured (e.g., for GPT models that require a proxy).
Note: The macOS version currently cannot hide the Dock icon.