Member since
08-28-2014
17
Posts
0
Kudos Received
2
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
9670 | 03-17-2015 04:26 AM | |
3216 | 01-14-2015 08:53 PM |
07-09-2015
06:03 AM
Thanks for the quick response. After I enable Key-Value Store Indexer service and edit service wide morphline (textbox) config as you suggested above, should I maintain the morphlines.conf file in /etc/hbase-solr/conf directory where I have created for the batch indexing purpose? In other words, if I create morphlinesX.conf, morphlinesY.conf and morphlinesZ.conf, should I update the service wide morphline configuration on KV Store Indexer also? My observation is, when I have those 3 morphlines files in /etc/hbase-solr/conf directory, and enable KV Store indexer service with default configuration, the corresponding 3 collections are active and started generating solr index documents. Further, after updating the KV Store indexer service --> configuration --> servicewide --> morphlines --> morphlines file (textbox), and deploy the client configuration, where it will be updated? Where could I verify the deployed configuraiton from CM admin console? Please clarify.
... View more
07-06-2015
05:55 AM
Dear Team, Presently, I'm using Solr batch indexing on hbase tables to create solr documents. I have to enable NRT feature on 3 hbase tables. How to handle multiple morphlines for "Key-Value Store Indexer" service? I know, Morphlines file holds an array of 'id's, but how do we handle the SOLR_LOCATOR section where we have to specify the collection name? can we give a comma seperated list of collection names? I'm using CDH 5.3.1 and CM 5.3.0. Presently I have 3 morphline files say morphlineX.conf, morphlineY.conf and morphlineZ.conf Thanks, YBSNR
... View more
Labels:
03-22-2015
11:43 PM
Hi Harsh, The TTL option works well on most of the tables/cases. But, flume agents loads data to staging tables contineously. In this case, when we run compaction, the regions will go offline and data load fails. So, I had to turnoff the major compaction. Can you help me on how to handle major compaction on these tables to purge old data using TTL? Thanks
... View more
03-17-2015
04:26 AM
Thank you.. Looks like TTL is a good option. But I remember, Major compaction was running for days. When we keep the frequent/ periodic compaction enabled, regions were going offline. how to optimize and control the compactions? To enable TTL, should we compromize on the availability of region? Please guide me
... View more
03-17-2015
12:17 AM
Hi All, Since my hadoop cluster capacity is low and there is no business need to keep old data, I'm trying to find and delete records older than 200 days in hbase tables. I found that there is no tool or ready to use program available to achieve this. Can someone give me the best approach to accomplish this? Should I write a MR Job? If yes, is there any pseudo code or algorithm? Thanks
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache HBase
01-14-2015
08:53 PM
Hi Gautam, I'm trying to download the software on client network. It has a proxy. I think, the proxy is not stopping it because I was able to download the CDH package from same website. I was able to download the package on my personal laptop completely. unfortunately it is not permitted to copy files from outside the client network. Thanks, Surya
... View more
01-14-2015
08:41 PM
yes, I cleared the cache, temporary internet files etc and tried again. Also tried wget on linux box. No luck. Thanks, Surya
... View more
01-12-2015
09:58 PM
Thanks Gautam, I tried to download it from archive.cloudera.com also.. no luck. It again stopped at 347,647 KB. http://archive.cloudera.com/cm5/repo-as-tarball/5.3.0/cm5.3.0-sles11.tar.gz Do we have any alternative mirrors for CM download? CDH download was successful.
... View more
01-12-2015
08:10 PM
Hi, I'm trying to download the Cloudera manager 5.3.0 repo as tarball from the following URL. http://archive-primary.cloudera.com/cm5/repo-as-tarball/5.3.0/cm5.3.0-sles11.tar.gz The original size is 680 MB but the download was stopped after downloading 50% of the original size, at 347,647 KB. I tried several times to download on UK based windows 7 OS to winscp to linux box later, but all the times, the download stopped at 347,647 KB. Do we have alternative URLs or mirrors to download it ? Thanks, Surya
... View more
Labels:
- Labels:
-
Cloudera Manager