您现在的位置是:网站首页> 软件下载软件下载
jsoup解析下载 jsoup Java HTML解析器 v1.12.1 最新免费版 下载-
2025-08-29
56人已围观
简介 jsoup解析下载 jsoup Java HTML解析器 v1.12.1 最新免费版 下载-
jsoup 1.12.1 发布了,该版本包含众多可用性的提升,提升了解析速度和内存效率,修复了不少 bug 。
jsoup 是一款 Java 的HTML 解析器,可直接解析某个URL地址、HTML文本内容。它提供了一套非常省力的API,可通过DOM,CSS以及类似于JQuery的操作方法来取出和操作数据。
jsoup主要功能如下:
从一个URL,文件或字符串中解析HTML;
使用DOM或CSS选择器来查找、取出数据;
可操作HTML元素、属性、文本;
jsoup是基于MIT协议发布的,可放心使用于商业项目。
示例代码:
完整的改进记录如下:
Changes
Change: removed deprecated method to disable TLS cert checking in Connection.validateTLSCertificates().
Change: some internal methods have been rearranged; if you extended any of the Jsoup internals you may need to make updates.
Updated jetty-server (which is used for integration tests) to latest 9.2 series (9.2.28).
Improvements
Improvement: documents now remember their parser, so when later manipulating them, the correct HTML or XML tree builder is reused, as are the parser settings like case preservation.
Improvement: Jsoup now detects the character set of the input if specified in an XML Declaration, when using the HTML parser. Previously that only happened when the XML parser was specified.
Improvement: if the document's input character set does not support encoding, flip it to one that does.
Improvement: if a start tag is missing a > and a new tag is seen with a <, treat that as a new tag. (This differs from the HTML5 spec, which would make at attribute with a name beginning with <, but in practice this impacts too many pages.
Improvement: performance tweaks when parsing start tags, data, tables.
Improvement: added Element.nextElementSiblings() and Element.previousElementSiblings()
Improvement: treat center tags as block tags.
Improvement: allow forms to be submitted with Content-Type=multipart/form-data without requiring a file upload; automatically set the mime boundary.
Improvement: Jsoup will now detect if an input file or URL is binary, and will refuse to attempt to parse it, with an IO Exception. This prevents runaway processing time and wasted effort creating meaningless parsed DOM trees.
Bug Fixes
Bugfix: when using the tag case preserving parsing settings, certain HTML tree building rules where not followed for upper case tags.
Bugfix: when converting a Jsoup document to a W3C DOM, if an element is namespaced but not in a defined namespace, set it to the global namespace.
Bugfix: attributes created with the Attribute constructor with just spaces for names would incorrectly pass validation.
Bugfix: some pseudo XML Declarations were incorrectly handled when using the XML Parser, leading to an IOOB exception when parsing.
Bugfix: when parsing URL parameter names in an attribute that is not correctly HTML encoded, and near the end of the current buffer, those parameters may be incorrectly dropped. (Improved CharacterReader mark/reset support.)
Bugfix: boolean attribute values would be returned as null, vs an empty string, when accessed via the Attribute#getValue() method.
Bugix: orphan Attribute objects (i.e. created outside of a parse or an Element) would throw an NPE on Attribute#setValue(val)
Bugfix: Element.shallowClone() was not making a clone of its attributes.
Bugfix: fixed an ArrayIndexOutOfBoundsException in HttpConnection.looksLikeUtf8() when testing small strings in specific character ranges.
相关内容
- 可视化开发工具下载 VG网页操作神器(可视化开发工具)V7.8.1.1 中文安装版 下载-
- mysql数据库迁移工具下载 DBSync for MySQL & PostgreSQL(数据库双向迁移)V3.83 特别安装版(附激活教程) 下载-
- 知识管理工具下载 佳文知识管理软件 v2.0 免费绿色版 下载-
- JetBrainsRider免费下载 JetBrains Rider(C语言编辑器) 2020.3.4 特别版 附激活教程 下载-
- LEVI2PI下载 LEVI2PI(人机界面程序转换) V2.2.3 中英文免费安装版 下载-
- IPA签名软件下载 苹果应用IPA一键签名工具 V1.7.7 免费安装版 下载-
- 网刻软件下载 CXDN网刻 v4.1.0.3 绿色免费版(附使用教程) 下载-
- matlab2019a免费下载 MATLAB R2019a v9.6.0.1135713 Update 3 中文特别版(含许可文件+升级步骤) 下载-
- NI LabView 2019下载 NI LabView 2019 v19.0.1 64位/32位 特别安装版(附注册机+教程) 下载-
- unity Pro 2019下载 unity Pro v2019.3.10f1 激活中文版(附激活教程+替换文件) 64位 下载-
点击排行
- 猿编程工具下载 猿编程客户端 v3.31.0 官方中文安装版 下载-
- ojdbc14.jar下载 ojdbc14.jar包 官方免费版 下载-
- 猿编程工具下载 猿编程IDE软件 v1.5.2 免费绿色版 下载-
- 反编译助手下载 CFR反编译助手 v1.0 中文绿色免费版 下载-
- collections1.0.jar下载 google collections1.0.jar 官方免费版 下载-
- led闪字风扇编程软件下载 USB DATA DOWNLOAD SYATEM(led编程软件) v2.0 绿色免费版 下载-
- PHP Report Maker 12下载 PHP Report Maker 12(PHP报告生成) v12.0.7 英文特别安装版(附注册机+激活教程) 下载-
- htmlparser.jar下载 htmlparser.jar v1.6 官方免费版 下载-
本栏推荐
-
猿编程工具下载 猿编程客户端 v3.31.0 官方中文安装版 下载-
-
ojdbc14.jar下载 ojdbc14.jar包 官方免费版 下载-
-
猿编程工具下载 猿编程IDE软件 v1.5.2 免费绿色版 下载-
-
反编译助手下载 CFR反编译助手 v1.0 中文绿色免费版 下载-
-
collections1.0.jar下载 google collections1.0.jar 官方免费版 下载-
-
led闪字风扇编程软件下载 USB DATA DOWNLOAD SYATEM(led编程软件) v2.0 绿色免费版 下载-
-
PHP Report Maker 12下载 PHP Report Maker 12(PHP报告生成) v12.0.7 英文特别安装版(附注册机+激活教程) 下载-
猜你喜欢
- 猿编程工具下载 猿编程客户端 v3.31.0 官方中文安装版 下载-
- ojdbc14.jar下载 ojdbc14.jar包 官方免费版 下载-
- 猿编程工具下载 猿编程IDE软件 v1.5.2 免费绿色版 下载-
- 反编译助手下载 CFR反编译助手 v1.0 中文绿色免费版 下载-
- collections1.0.jar下载 google collections1.0.jar 官方免费版 下载-
- led闪字风扇编程软件下载 USB DATA DOWNLOAD SYATEM(led编程软件) v2.0 绿色免费版 下载-
- PHP Report Maker 12下载 PHP Report Maker 12(PHP报告生成) v12.0.7 英文特别安装版(附注册机+激活教程) 下载-