Code Repositories

Find and share code repositories
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
avatar
New Contributor
Repo Description
  • ambari_blueprints.py - Ambari Blueprint tool using Ambari API
  • hadoop_hdfs_time_block_reads.jy - Hadoop HDFS per-block read timing debugger
  • hadoop_hdfs_files_native_checksums.jy - fetches native HDFS checksums via API
  • hadoop_hdfs_files_stats.jy - fetches HDFS file stats via API
  • pig-text-to-elasticsearch.pig / pig-text-to-solr.pig - bulk indexes unstructured files in Hadoop to Elasticsearch or Solr/SolrCloud clusters, Pig Jython UDFs
  • ipython-notebook-pyspark.py - per-user authenticated IPython Notebook + PySpark integration
  • spark-json-to-parquet.py
Repo Info
Github Repo URL https://github.com/harisekhon/pytools
Github account name harisekhon
Repo name pytools
1,351 Views