Code Repositories

Find and share code repositories
Labels (1)
Super Guru
Repo Description

A web crawler bot written in Spark with Kafka and Tika to replace Nutch. It renders Javascript and processes files with Tika.

Repo Info
Github Repo URL
Github account name USCDataScience
Repo name sparkler
0 Kudos
Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.
Version history
Last update:
‎12-18-2016 03:00 AM
Updated by:
Top Kudoed Authors