Unclaimed project
Are you a maintainer of stormcrawler? Claim this project to take control of your public changelog and roadmap.
Claim this projectChangelog
stormcrawler
A scalable, mature and versatile web crawler based on Apache Storm
Back to changelogNew
stormcrawler-3.2.0
What's Changed
- Release 3.1.0 by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1316
- Bump Apache Storm from 3.1.1 to 2.6.4 & archetype 3.0 to 3.1.0 by @kunalpal97 in https://github.com/apache/incubator-stormcrawler/pull/1319
- #1299 - Add DISCLAIMER to JAR files by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1320
- #1300 - Fix "files in jars have odd dates" by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1321
- Bump org.yaml:snakeyaml from 2.2 to 2.3 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1307
- Bump org.awaitility:awaitility from 4.2.0 to 4.2.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1310
- Bump org.jacoco:jacoco-maven-plugin from 0.8.11 to 0.8.12 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1305
- Bump org.netpreserve:jwarc from 0.29.0 to 0.30.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1304
- Bump org.apache.maven.plugins:maven-surefire-plugin from 3.2.1 to 3.5.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1308
- Bump aws.version from 1.12.663 to 1.12.772 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1302
Bump org.apache.solr:solr-solrj from 9.6.1 to 9.7.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1309Bump com.microsoft.playwright:playwright from 1.46.0 to 1.47.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1306Bump org.wiremock:wiremock from 3.5.4 to 3.9.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1311Bump selenium.version from 4.24.0 to 4.25.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1314#1323 Update archetype Storm version from 2.6.4 by @mvolikas in https://github.com/apache/incubator-stormcrawler/pull/1325Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1322Bump OpenSearch to 2.17 + fix archetype version in README by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1324Bump org.mockito:mockito-core from 5.13.0 to 5.14.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1334Bump junit.version from 5.11.0 to 5.11.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1333Bump org.apache.maven.plugins:maven-archetype-plugin from 3.2.1 to 3.3.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1332Bump org.apache.maven.archetype:archetype-packaging from 3.2.1 to 3.3.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1330Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1326Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1335Bump log4j2.version from 2.23.0 to 2.24.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1328Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1337Bump org.jetbrains:annotations from 24.1.0 to 25.0.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1331Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1338Bump com.github.crawler-commons:urlfrontier-API from 2.3.1 to 2.4 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1327Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1340Store metadata as WARC Metadata records by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1341Improve robustness of WARC generation by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1342Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.0 to 3.5.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1350Bump junit.version from 5.11.1 to 5.11.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1345Fix configuration for Github's linguist by @mvolikas in https://github.com/apache/incubator-stormcrawler/pull/1344Bump testcontainers.version from 1.20.1 to 1.20.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1346Bump org.mockito:mockito-core from 5.14.0 to 5.14.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1349Bump aws.version from 1.12.772 to 1.12.773 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1351Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.10.0 to 3.10.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1347Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1352#1354 Fix: fix some typos in project by @psxjoy in https://github.com/apache/incubator-stormcrawler/pull/1355Fix #1312 "Sha512 hash of source release is missing the file part " by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1356Bump de.thetaphi:forbiddenapis from 3.7 to 3.8 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1359Bump org.jetbrains:annotations from 25.0.0 to 26.0.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1358Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1360Trivial: version number in warc/README fix #1317 by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1363Bugfix nofollow instructions in rel tags ignored by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1362Bump org.jetbrains:annotations from 26.0.0 to 26.0.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1368Bump com.microsoft.playwright:playwright from 1.47.0 to 1.48.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1366Connect to a remote instance using web sockets by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1361Bump aws.version from 1.12.773 to 1.12.776 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1367Bump org.mockito:mockito-core from 5.14.1 to 5.14.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1369Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1370Bump tika.version from 2.9.2 to 3.0.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1365Apache Storm 2.7.0 by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1371Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1372#1353 Fix for URLFrontier spout not taking into account the crawl ID by @klockla in https://github.com/apache/incubator-stormcrawler/pull/1373Bump junit.version from 5.11.2 to 5.11.3 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1375Bump com.ibm.icu:icu4j from 75.1 to 76.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1376Bump aws.version from 1.12.776 to 1.12.777 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1377Bump org.wiremock:wiremock from 3.9.1 to 3.9.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1378Bump testcontainers.version from 1.20.2 to 1.20.3 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1379Remove references to ES in OpenSearch module by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1374Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1380Fix #1313 "Exclude "__files" from Source Release Artifacts"" by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1384#1301 - add build doc for the source release by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1383[1385] bugfix - check for null before the for-each loop by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1386Sync conf files in root and archetype + explicit values for sniff conf by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1388Detect multi addresses separated by ; in a single String. Fixes #1382 by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1387Bump org.apache.maven.plugins:maven-archetype-plugin from 3.3.0 to 3.3.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1390Bump selenium.version from 4.25.0 to 4.26.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1393Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.1 to 3.5.2 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1392Bump org.apache.maven.plugins:maven-javadoc-plugin from 3.10.1 to 3.11.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1394Bump org.apache.maven.archetype:archetype-packaging from 3.3.0 to 3.3.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1395Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1398#620 Add support for shards - SolrSpout by @mvolikas in https://github.com/apache/incubator-stormcrawler/pull/1343#1403 - Downgrade log4j2 to Storm's version. Fixes #1403 by @tballison in https://github.com/apache/incubator-stormcrawler/pull/1404#1401 Drop Java-based Topologies by @mvolikas in https://github.com/apache/incubator-stormcrawler/pull/1402#1405 - bump development version to 3.2.0-SNAPSHOT by @tballison in https://github.com/apache/incubator-stormcrawler/pull/1406#1409 - remove wrapper element by @tballison in https://github.com/apache/incubator-stormcrawler/pull/1410Fixes Issues mentioned in IPMC Vote by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1417Bump aws.version from 1.12.777 to 1.12.778 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1415Bump com.microsoft.playwright:playwright from 1.48.0 to 1.49.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1419Bump opensearch.version from 2.17.0 to 2.18.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1400Bump testcontainers.version from 1.20.3 to 1.20.4 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1422Bump org.netpreserve:jwarc from 0.30.0 to 0.31.1 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1420Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1424Update to Storm 2.7.1 by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1425Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1426Adress IPMC feedback by @rzo1 in https://github.com/apache/incubator-stormcrawler/pull/1423Prevent Dependabot from suggesting dependency updates for Jackson by @jnioche in https://github.com/apache/incubator-stormcrawler/pull/1433Bump org.jsoup:jsoup from 1.18.1 to 1.18.3 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1431Bump aws.version from 1.12.778 to 1.12.779 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1427Bump org.codehaus.mojo:license-maven-plugin from 2.4.0 to 2.5.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1430Bump org.wiremock:wiremock from 3.9.2 to 3.10.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1432Bump selenium.version from 4.26.0 to 4.27.0 by @dependabot in https://github.com/apache/incubator-stormcrawler/pull/1428Regenerated License file after dependency upgrades by @github-actions in https://github.com/apache/incubator-stormcrawler/pull/1434[MINOR] update URLs in tests by @pjfanning in https://github.com/apache/incubator-stormcrawler/pull/1435New Contributors
- @kunalpal97 made their first contribution in https://github.com/apache/incubator-stormcrawler/pull/1319
- @psxjoy made their first contribution in https://github.com/apache/incubator-stormcrawler/pull/1355
- @klockla made their first contribution in https://github.com/apache/incubator-stormcrawler/pull/1373
Full Changelog: https://github.com/apache/incubator-stormcrawler/compare/stormcrawler-3.1.0...stormcrawler-3.2.0