Java / Crawlers

0
webmagic πŸ‚
8616 (+3) ⭐

A scalable web crawler framework for Java.

0
WebCollector 🌿
2538 (+0) ⭐

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

0
crawler4j 🌿
3741 (+0) ⭐

Open Source Web Crawler for Java

0
storm-crawler 🌿
597 (+0) ⭐

Scalable web crawler based on Apache Storm

0
351 (+0) ⭐

Open-source Enterprise Grade Search Engine Software

0
sitemapgen4j 🌿
122 (+0) ⭐

SitemapGen4j is a library to generate XML sitemaps in Java.

0
woothee-java 🌿
48 (+0) ⭐

Woothee Java implementation and Hive UDF

0
TACIT 🌿
99 (+0) ⭐

We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components: 1. Crawling plugins 2. Corpus management 3. Analysis plugins. TACIT's open-source plugin platform allows the architecture to easily adapt with the rapid developments text analysis.

0
12 (+0) ⭐

REST and STREAMING crawlers of Twitter (java)

0
serritor 🌿
12 (+0) ⭐

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

64686 Java libraries
(19110 libraries)
Go
(88361 libraries)
(48259 libraries)
(15059 libraries)
(27814 libraries)
C#
(42806 libraries)
(23578 libraries)
(43157 libraries)
(13718 libraries)
(9691 libraries)
(23822 libraries)
(16030 libraries)
(153671 libraries)
(14353 libraries)
Vue
(12975 libraries)
CSS
(71503 libraries)
(61821 libraries)
(54344 libraries)
(10986 libraries)
C++
(91706 libraries)
C
(76662 libraries)
(44720 libraries)
(36300 libraries)
(10788 libraries)
(64686 libraries)
PHP
(96432 libraries)
(119328 libraries)
(120206 libraries)
(6177 libraries)
Nim
(3864 libraries)
D
(10938 libraries)
(39393 libraries)
(2447 libraries)