Version 0.3.1
如果下载速度慢,可以考虑中国境内下载地址:中国境内下载地址。
Windows版支持Windows 10及以上版本,Windows 7此版本无直接可用版本(因为Chrome 109是最后一个支持Windows 7的Chrome版本),但v0.2.0的32位版本可用,且可以通过自行编译软件来运行,因此如想使用Windows 7采集数据,请下载v0.2.0的32位版本或自行下载代码并编译。
Unclaimed project
Are you a maintainer of EasySpider? Claim this project to take control of your public changelog and roadmap.
Changelog
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Last updated about 2 months ago
如果下载速度慢,可以考虑中国境内下载地址:中国境内下载地址。
Windows版支持Windows 10及以上版本,Windows 7此版本无直接可用版本(因为Chrome 109是最后一个支持Windows 7的Chrome版本),但v0.2.0的32位版本可用,且可以通过自行编译软件来运行,因此如想使用Windows 7采集数据,请下载v0.2.0的32位版本或自行下载代码并编译。
The Windows version supports Windows 10 and above. There is no direct usable version available for Windows 7, but the 32-bit version of v0.2.0 is available and can be run by compiling the software yourself. Therefore, if you want to use Windows 7 for data collection, please download the 32-bit version of v0.2.0 or download the code and compile it yourself.
The MacOS version supports all chipsets, including Intel, M1, M2, and other processors. However, the minimum operating system requirement is 11.1. For lower operating system versions, please download the code and compile it yourself. An example compilation method can be found in this post.
同理,Linux版只适用于Ubuntu 20.04及以上版本、Deepin、Debian及其衍生版本,如想使用其他Linux发行版采集数据,请自行下载代码并编译。
Similarly, the Linux version is only compatible with Ubuntu 20.04 and above, Deepin, Debian, and their derivatives. If you want to use other Linux distributions for data collection, please download the code and compile it yourself.
B站最新版特性视频已上传,新视频非常有用,推荐大家观看。
注意,v0.3.1版本任务tasks文件夹内.json文件和之前所有版本均不兼容,请重新设计v0.3.1版本任务。
Note that the '.json' file in the tasks folder of the v0.3.1 version is not compatible with all previous versions. Please redesign the v0.3.1 version task.
判断条件和循环条件中同样增加了执行自定义脚本,并根据自定义脚本的返回值是否为真来作为条件判断和循环的判断条件,同样极大的增加了任务的可操作性。循环中增加了用代码break的操作设定,自定义操作可以操作循环内元素。
可同时生成多种XPath供用户选择,并预装了XPath Helper扩展供大家调试XPath。
增加采集元素背景图片地址,当前页面标题,当前页面URL地址功能。
增加保存元素截图功能,如要截图某元素或整个网页页面,可以用此功能(配合无头模式效果更好)。
增加下载图片功能。
增加OCR识别元素功能(使用此功能需首先自行安装Tesseract库:https://blog.csdn.net/u010454030/article/details/80515501)
可直接提取对元素执行JavaScript代码后的返回值,实现如正则表达式,获得元素背景颜色等功能。
增加切换下拉选项功能,采集下拉选项正在选中的值和文本。
Custom scripts are also supported in the conditions and loop conditions. The return value of the custom script determines the condition for the judgment of conditions and loops, greatly enhancing the flexibility of tasks. The ability to use the break statement within a loop is added, allowing custom operations to manipulate elements within the loop.
Multiple XPath expressions are generated simultaneously for user selection, and the XPath Helper extension is pre-installed for XPath debugging.
Added the functionality to extract the background image URL of elements, current page title, and current page URL.
Added the capability to save screenshots of elements or entire web pages. This feature works best in headless mode.
Added the functionality to download images.
Added OCR recognition of elements. To use this feature, Tesseract library needs to be installed first: https://tesseract-ocr.github.io/tessdoc/Installation.html
Directly extract the return value of executing JavaScript code on elements, allowing for functionalities such as regular expression matching and obtaining the background color of elements.
Added the capability to switch dropdown options and extract the selected value and text of dropdown options.
Significantly improved user guidance and explanations to make the software more user-friendly. This includes instructions on handling iframe tags, explanations of parameter meanings for various options, and explanations on modifying the XPath for loop items, and more.
Added instructions on how to execute tasks from the command line.
Added parallel mode which can run different tasks concurrently.
Added headless mode configuration, allowing the software to run without a browser interface.
Fixed the issue where Chinese paths couldn't be recognized correctly when using user-configured browser modes.
Fixed the issue where the program would freeze when there was no unconditional branch in the conditional branching.
Fixed the issue where the input box would freeze after saving a task.
Added the option to set the maximum waiting time for page load in the "Open Page" and "Click element" operations.
Added the functionality to move the mouse to an element.
Displays a prompt when an element cannot be found.
Fixed the webpage scrolling bug.
New Field Function at Extract Data operation.
The task name is initialized with the value of the page title upon the first visit.
Added version update prompts.
Added the information of the publisher as requested.
Updated Chrome version to 113.