Java / Crawlers

0
webmagic πŸ‚
8843 (+2) ⭐

A scalable web crawler framework for Java.

0
crawler4j 🌿
3860 (+1) ⭐

Open Source Web Crawler for Java

0
WebCollector 🌿
2595 (+0) ⭐

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

0
storm-crawler 🌿
627 (+0) ⭐

Scalable web crawler based on Apache Storm

0
365 (+0) ⭐

Open-source Enterprise Grade Search Engine Software

0
sitemapgen4j 🌿
125 (+0) ⭐

SitemapGen4j is a library to generate XML sitemaps in Java.

0
woothee-java 🌿
50 (+0) ⭐

Woothee Java implementation and Hive UDF

0
TACIT 🌿
101 (+0) ⭐

We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components: 1. Crawling plugins 2. Corpus management 3. Analysis plugins. TACIT's open-source plugin platform allows the architecture to easily adapt with the rapid developments text analysis.

0
14 (+0) ⭐

REST and STREAMING crawlers of Twitter (java)

0
serritor 🌿
12 (+0) ⭐

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

68159 Java libraries
(20802 libraries)
Go
(96074 libraries)
(52298 libraries)
(17256 libraries)
(29174 libraries)
C#
(46548 libraries)
(24378 libraries)
(44166 libraries)
(14269 libraries)
(10288 libraries)
(25309 libraries)
(16454 libraries)
(164133 libraries)
(15873 libraries)
Vue
(15557 libraries)
CSS
(77603 libraries)
(73245 libraries)
(59395 libraries)
(12516 libraries)
C++
(99557 libraries)
C
(82125 libraries)
(48221 libraries)
(44242 libraries)
(11017 libraries)
(68159 libraries)
PHP
(101203 libraries)
(130651 libraries)
(142431 libraries)
(6805 libraries)
Nim
(4497 libraries)
D
(11405 libraries)
(41105 libraries)
(2694 libraries)