Member since
05-30-2018
1322
Posts
715
Kudos Received
148
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 4051 | 08-20-2018 08:26 PM | |
| 1952 | 08-15-2018 01:59 PM | |
| 2377 | 08-13-2018 02:20 PM | |
| 4111 | 07-23-2018 04:37 PM | |
| 5021 | 07-19-2018 12:52 PM |
08-06-2016
01:55 PM
1 Kudo
if you want to do it inside docker here is article on how to https://community.hortonworks.com/content/kbentry/42229/sparkpycharmpybuilder-on-docker.html
... View more
08-05-2016
03:18 PM
Can you check the following setting: XAAUDIT.HDFS.IS_ENABLED Flag to enable/disable hdfs audit logging. If the hdfs audit logging is turned off, it will not log any access control to hdfs
... View more
08-05-2016
02:52 PM
I am confused by your last comment. Have you configured ranger and audit?
... View more
08-05-2016
05:04 AM
And i assume you are looking for the audit logs? If so look under audit tab. If you did that and nothing shows up, you need to configure the audit db. Look at link i provided.
... View more
08-04-2016
05:29 PM
I would use the executeprocess processor. You would execute for example a python or shell script to do the logic I shared above. make sense?
... View more
08-04-2016
05:27 PM
"Does ambari support the cluster that is distributed across multiple machines?" As long as the machines are isolated with correct IP routing and forwarding I don't see this as a blocker. Each essentially will require its own DNS/IP which you will have to configure through virtualbox. Each have same ip can get extremely tricky with port forwarding.
... View more
08-04-2016
05:23 PM
@John Jackson I am not aware of any capabilities for multipule secondary namenodes. If that would be possible then I assume enhancements to the journal keeper service will need to be made to keep multipule instances in sync. Would like to see others opinion if that is possible and supported. To the best of my knowledge it is not. One possibility you have is HDFS federation. where you can have multipule name nodes and each once having secondary name node. Each namenode would be responsible for slice of your namespace. here is the architecture Multiple Namenodes/Namespaces In order to scale the name service horizontally, federation uses multiple independent Namenodes/namespaces. The Namenodes are federated; the Namenodes are independent and do not require coordination with each other. The Datanodes are used as common storage for blocks by all the Namenodes. Each Datanode registers with all the Namenodes in the cluster. Datanodes send periodic heartbeats and block reports. They also handle commands from the Namenodes. Users may use ViewFs to create personalized namespace views. ViewFs is analogous to client side mount tables in some Unix/Linux systems. More on federation here.
... View more
08-04-2016
03:29 PM
2 Kudos
@K Hajjar Sharing some thoughts you could try. You may have option to use execute process which would fetch size from file and store size to local disk. Run the loop again and check new size against size stored to local disk. If size has not changed you can routeonstragegy thereafter.
... View more
08-04-2016
02:10 PM
@Joshua Adeleke Ranger supports audit logging hdfs and solr in 2.4.2. More info here. Please advise why you are using hbase for audit logs?
... View more