Code Repositories

Find and share code repositories
Labels (1)
Super Guru
Repo Description

A web crawler bot written in Spark with Kafka and Tika to replace Nutch. It renders Javascript and processes files with Tika.

https://github.com/uscdataScience/sparkler/wiki/sparkler-0.1

Repo Info
Github Repo URL https://github.com/USCDataScience/sparkler
Github account name USCDataScience
Repo name sparkler
1,517 Views
0 Kudos
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.
Version history
Last update:
‎12-18-2016 03:00 AM
Updated by:
Contributors
Top Kudoed Authors