Java / Crawlers

0
webmagic 🌿
9880 (+3) ⭐

A scalable web crawler framework for Java.

0
crawler4j 🌿
4114 (+0) ⭐

Open Source Web Crawler for Java

0
WebCollector 🌿
2798 (+0) ⭐

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

0
415 (+0) ⭐

Open-source Enterprise Grade Search Engine Software

0
storm-crawler 🌿
713 (+0) ⭐

A scalable, mature and versatile web crawler based on Apache Storm

0
sitemapgen4j 🌿
143 (+0) ⭐

SitemapGen4j is a library to generate XML sitemaps in Java.

0
woothee-java 🌿
55 (+0) ⭐

Woothee Java implementation and Hive UDF

0
201 (+0) ⭐

The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)

0
serritor 🌿
19 (+0) ⭐

Serritor is an open source web crawler framework built upon Selenium and written in Java. It can be used to crawl dynamic web pages that require JavaScript to render data.

0
14 (+0) ⭐

REST and STREAMING crawlers of Twitter (java)

83116 Java libraries
(30416 libraries)
Go
(132778 libraries)
(71793 libraries)
(28411 libraries)
(37492 libraries)
C#
(64677 libraries)
(27479 libraries)
(47622 libraries)
(17494 libraries)
(13278 libraries)
(34798 libraries)
(19165 libraries)
(201662 libraries)
(20659 libraries)
Vue
(27612 libraries)
CSS
(108094 libraries)
(138685 libraries)
(84866 libraries)
(20301 libraries)
C++
(137016 libraries)
C
(105384 libraries)
(63603 libraries)
(84939 libraries)
(11972 libraries)
(83116 libraries)
PHP
(119093 libraries)
(189124 libraries)
(265844 libraries)
(10941 libraries)
Nim
(7220 libraries)
D
(13578 libraries)
(47357 libraries)
(3627 libraries)