Member since
10-06-2015
273
Posts
202
Kudos Received
81
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3129 | 10-11-2017 09:33 PM | |
2663 | 10-11-2017 07:46 PM | |
2025 | 08-04-2017 01:37 PM | |
1786 | 08-03-2017 03:36 PM | |
1679 | 08-03-2017 12:52 PM |
06-22-2017
05:55 AM
1 Kudo
@heta desai Take a look at the below demo using Nifi to fetch the tweets and Spark and Zeppelin for the analysis. https://community.hortonworks.com/articles/30213/us-presidential-election-tweet-analysis-using-hdfn.html
... View more
06-22-2017
12:47 AM
1 Kudo
@mel mendoza Kafka is a message broker so it only receives files/events from publishers and makes them available for consumption by consumers. It does not do any processing. Spark streaming would dictate how files/events are read. Since Spark Streaming does micro-batching it will read several files/events from Kafka and process them together in a micro-batch. I believe this will achieve what you are asking to do, it'll be on the Spark side though, not Kafka. As always, if you find this post helpful, don't forget to "accept" answer.
... View more
06-20-2017
10:21 PM
1 Kudo
@Deepak Rawat In VMware, right click on your sandbox VM, choose settings and then modify the number of CPUs and RAM that are configured to 1 CPU, also set the RAM to half of what your machine has. Now you should be able to start your sandbox VM. Please note that the Sandbox is configured to run with 4 CPUs and 8GB RAM. Running a configuration less than that will cause the sandbox to run very slow and possibly some services/jobs fail. As always, if you find this post helpful, don't forget to "accept" answer.
... View more
06-19-2017
01:44 PM
1 Kudo
@Gaurav Vats You can reset the Ambari admin password by using the below steps. 1. Start your sandbox, ssh using a terminal or the web browser link 2. Run the following commands: ambari-admin-password-reset The following text will appear Please set the password for admin: Please retype the password for admin: 3. After setting and retyping your new password, type the command: ambari-agent restart 4. Ambari Admin password should be reset 5. Open Ambari login page. Verify your new password allows you to login as admin user. As always, if you find this post helpful, don't forget to "accept" answer.
... View more
06-16-2017
02:35 PM
1 Kudo
@Dinesh Das
Prohibition against dataset combinations: https://community.hortonworks.com/content/kbentry/63664/how-to-create-a-ranger-policy-that-prohibits-combi.html
Data Expiry-based access policy: https://community.hortonworks.com/articles/92083/using-atlas-and-ranger-to-enforce-data-expiration.html Location-specific access policies: https://community.hortonworks.com/articles/57314/customizing-ranger-policies-with-dynamic-context.html As always, if you find this post helpful, don't forget to "accept" answer.
... View more
06-14-2017
01:45 PM
1 Kudo
@Qinglin Xia You are correct. When you look at the lineage graph, anything leading up to the entity in consideration (which is highlighted in red) is called "lineage". "Impact" is how that particular entity influences others down the graph.
... View more
06-14-2017
12:00 PM
A side note: You should not partition on any columns with high cardinality such as IDs. You would use bucketing instead
... View more
06-14-2017
11:57 AM
A side note: You should not partition on any columns with high cardinality such as IDs. You would use bucketing instead
... View more
06-14-2017
11:22 AM
@Prakhar Agrawal Take a look at the DistributedMapCache. It allows you to save flowfile data for consumption by processors across nodes and processors. There are two controller components, the DistributesMapCacheServer (runs on one node) and the DistricutedMapCacheClient (runs on all nodes if you've got a cluster). There are also the processors that interact with the DistributedMapCache, namely FetchDistributedMapCache and PutDistributedMapCache Take a look here for some elaboration/documentation: https://nifi.apache.org/docs.html https://community.hortonworks.com/questions/35223/distributedmapcacheclientservice-nifi-wecrawlerxml.html Here for examples using DistributedMapCache: https://github.com/hortonworks-gallery/nifi-templates/tree/master/templates (webcrawler.xml template) http://funnifi.blogspot.ca/2016/04/
... View more
06-11-2017
08:39 PM
@Subrahmanya Oruganti To use PutSQL you need to pass it the built SQL string, as you've already figured. One processor you can use for that is the ReplaceText processor. You can use regex to parmeterize and create the output string using the "Search Value" and "Replacement Value" fields. To find out more about the ReplaceText processor see the link below. http://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.3.0/org.apache.nifi.processors.standard.ReplaceText/index.html As always, if you find this post helpful, don't forget to "accept" answer.
... View more