Java / Crawlers

0
webmagic πŸ‚
8744 (+2) ⭐

A scalable web crawler framework for Java.

0
crawler4j 🌿
3805 (+1) ⭐

Open Source Web Crawler for Java

0
WebCollector 🌿
2560 (+0) ⭐

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

0
storm-crawler 🌿
609 (+0) ⭐

Scalable web crawler based on Apache Storm

0
356 (+0) ⭐

Open-source Enterprise Grade Search Engine Software

0
sitemapgen4j 🌿
122 (+0) ⭐

SitemapGen4j is a library to generate XML sitemaps in Java.

0
woothee-java 🌿
48 (+0) ⭐

Woothee Java implementation and Hive UDF

0
TACIT 🌿
101 (+0) ⭐

We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components: 1. Crawling plugins 2. Corpus management 3. Analysis plugins. TACIT's open-source plugin platform allows the architecture to easily adapt with the rapid developments text analysis.

0
12 (+0) ⭐

REST and STREAMING crawlers of Twitter (java)

0
serritor 🌿
12 (+0) ⭐

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

65964 Java libraries
(19711 libraries)
Go
(91279 libraries)
(49687 libraries)
(15829 libraries)
(28285 libraries)
C#
(44227 libraries)
(23861 libraries)
(43533 libraries)
(13910 libraries)
(9899 libraries)
(24353 libraries)
(16171 libraries)
(157943 libraries)
(15145 libraries)
Vue
(13838 libraries)
CSS
(73555 libraries)
(65772 libraries)
(56113 libraries)
(11541 libraries)
C++
(94808 libraries)
C
(78769 libraries)
(46021 libraries)
(39208 libraries)
(10877 libraries)
(65964 libraries)
PHP
(98124 libraries)
(123592 libraries)
(128517 libraries)
(6440 libraries)
Nim
(4094 libraries)
D
(11126 libraries)
(40046 libraries)
(2527 libraries)