Member since
09-18-2015
3274
Posts
1159
Kudos Received
426
Solutions
05-20-2016
03:52 AM
@Ali Bajwa Just created this https://www.linkedin.com/pulse/hawqhdb-hadoop-hive-hbase-neeraj-sabharwal
... View more
05-18-2016
12:07 PM
Chronos is a replacement for cron.
A fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
Marathon is a framework for Mesos that is designed to launch long-running applications, and, in Mesosphere, serves as a replacement for a traditional system
In Mesosphere, Chronos compliments Marathon as it provides another way to run applications, according to a schedule or other conditions, such as the completion of another job. It is also capable of scheduling jobs on multiple Mesos slave nodes, and provides statistics about job failures and successes. Source
Install https://mesos.github.io/chronos/docs/ and gist
... View more
05-17-2016
11:31 AM
1 Kudo
Original Post
DC/OS - a new kind of operating system that spans all of the servers in a physical or cloud-based datacenter, and runs on top of any Linux distribution.
Source
Projects
More details https://docs.mesosphere.com/overview/components/
Let's cover Mesos in this post
Frameworks (Application running on mesos) http://mesos.apache.org/documentation/latest/frameworks/
I used http://mesos.apache.org/gettingstarted/ to install Mesos in my local machine. I am launching c++, java and python framework in this demo.
Slide Share http://www.slideshare.net/tomasbart/introduction-to-apache-mesos
... View more
04-15-2016
09:09 AM
4 Kudos
Original Post
Calcite is a highly customizable engine for parsing and planning queries on data in a wide variety of formats. It allows database-like access, and in particular a SQL interface and advanced query optimization, for datanot residing in a traditional database.
Apache Calcite is a dynamic data management framework.
It contains many of the pieces that comprise a typical database management system, but omits some key functions: storage of data, algorithms to process data, and a repository for storing metadata.
Calcite intentionally stays out of the business of storing and processing data. As we shall see, this makes it an excellent choice for mediating between applications and one or more data storage locations and data processing engines. It is also a perfect foundation for building a database: just add data. Source
Tutorial https://calcite.apache.org/docs/tutorial.html
Demo:
Read DEPT and EMPS table
Create a test table based on existing csv example. Read the tutorial link to understand the model.json and schema.
In the demo, you can see that I am running explain plan on the queries and then I used smart.json to change the plan.
Watch the demo and then read the following links
model.json https://calcite.apache.org/docs/tutorial.html#schema-discovery
Query tuning https://calcite.apache.org/docs/tutorial.html#optimizing-queries-using-planner-rules
Calcite https://calcite.apache.org/
This page describes the SQL dialect recognized by Calcite’s default SQL parser.
Adapters
JDBC driver
Calcite is embedded in Drill, Hive and Kylin.
... View more
Labels:
02-18-2016
03:38 PM
@dtraver Is it ok to share the slide deck in PDF format? or Could you attache the deck in this article?
... View more
02-15-2016
08:19 PM
1 Kudo
Use case: User want to map ad group hdpadmin using Yarn queue manager view. Environment: HDP 2.3.4 and Ambari 2.2.0 Originial request was made by one of HCC users. Thread link Question : How to assigned capacity scheduler queue based on AD group. Solution/Demo:
[root@phdns02 scripts]# id neeraj
uid=29800018(neeraj) gid=29800018(neeraj) groups=29800018(neeraj),29800017(hdpadmin)
[root@phdns02 scripts]#
... View more
Labels:
02-15-2016
03:21 PM
5 Kudos
Use case: We want to control the kafka broker, producer and consumer policies using Ranger without having kerberos. "What is a recommended way to set-up policies when trying to control access to Kafka over a non-secure channel?"
Original
doc
Demo
I have defined 3 policies as shown below:
Broker, Publisher and Consumer is controlled at IP level. With one click you can revoke the access from the consumer.
Demo
commands
Happy Hadooping!!!
... View more
Labels:
02-15-2016
03:19 AM
1 Kudo
Bug : https://issues.apache.org/jira/browse/AMBARI-14466 [root@phdns01 ~]# ambari-server start Using python /usr/bin/python2 Starting ambari-server Ambari Server running with administrator privileges. Organizing resource files at /var/lib/ambari-server/resources... WARNING: setpgid(31734, 0) failed - [Errno 13] Permission denied Server PID at: /var/run/ambari-server/ambari-server.pid Server out at: /var/log/ambari-server/ambari-server.out Server log at: /var/log/ambari-server/ambari-server.log Waiting for server start.................... Ambari Server 'start' completed successfully. [root@phdns01 ~]# wget https://issues.apache.org/jira/secure/attachment/12779059/AMBARI-14466.patch patch -p1 < AMBARI-14466.patch File to patch: /usr/sbin/ambari_server_main.py Issue resolved.
... View more
Labels: