Member since
01-18-2016
163
Posts
32
Kudos Received
19
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1399 | 04-06-2018 09:24 PM | |
1424 | 05-02-2017 10:43 PM | |
3909 | 01-24-2017 08:21 PM | |
23898 | 12-05-2016 10:35 PM | |
6570 | 11-30-2016 10:33 PM |
05-02-2017
10:43 PM
If you mean Solr on HDFS, the answer is "it depends." If you have a high number of frequent updates to your index, I usually recommend local storage. On the other hand, if your updates are more batch, and not a constant stream, then using HDFS is a convenient option. If you mean installing Solr on HDF, the only supported option and use case is installing Ambari Infra. The Ambari Infra component is Solr under the covers, but it is only supported for use with HDP and HDF components such as Ranger for User Audit records. There's no support to use Ambari Infra for indexing your own data.
... View more
02-24-2017
03:29 AM
@Tony Bolt It seems like the solr.hdfs.confdir is not being used. It's unlikely to be the problem, have you checked permissions?
... View more
02-03-2017
09:05 PM
@mqureshi made great points. Also, note that you do not have to store any fields in Solr. You can choose True for either or both: stored=true/false, indexed=true/false. Of course if stored=false, you won't see the value in results but you will at a minimum, see the "uniqueKey" which would be your "id" field. You could also look at the HBase Indexer: https://community.hortonworks.com/articles/1181/hbase-indexing-to-solr-with-hdp-search-in-hdp-%2023.html
... View more
01-24-2017
08:21 PM
2 Kudos
I have a solution.... We had this issue, caused by this: https://issues.apache.org/jira/browse/RANGER-1249 To fix it, create a file: /etc/ranger/admin/ranger-admin-env-tz.sh Add this: JAVA_OPTS="$JAVA_OPTS -Duser.timezone=UTC" Now chmod +x /etc/ranger/admin/ranger-admin-env-tz.sh Now restart ranger admin. I know this was asked long ago, but if this solves your problem (which it did for me), please accept the answer.
... View more
12-14-2016
05:00 PM
Hi Maxime, Did you start ambari-agent on each node? Ssh to each node as root (probably) or ambari (depending on how you set it up) and run ambari-agent status and/or ambari-agent-start. I'm pretty sure you will see an error if there is no communication with the agents. If an operation failed, you will see a red line. As you drill in, you will probably see a number of green lines, but you need to find the failed ones which are red and drill into them.
... View more
12-14-2016
04:29 PM
@Maxime Savary - the logs to start each service can be found by looking in Ambari "Ops" button near the top by the "Alerts" button. Drill down to the failed (red) action and eventually you will get to the logs. there are two logs for each operation in this window - stderr and stdout. You don't have to go to the file system. These are the ambari-agent logs for specific operations. The other half of the equation is to look on the server where the service is starting. For example, look at /var/log/hadoop/hdfs/hadoop-hdfs-datanode.log or hadoop-hdfs-namenode.log. These files are going to be on whatever server the service is supposed to run on. Please click Reply rather than entering a new answer when you are replying.
... View more
12-14-2016
02:58 PM
@Maxime Savary - Once ambari-server is up, and ambari-agent is up on each node, in Ambari s you can stop and start the cluster using the Action button below the list of services. https://docs.hortonworks.com/HDPDocuments/Ambari-2.1.0.0/bk_Ambari_Users_Guide/content/_starting_and_stopping_all_services.html If this does not work, you need to look at the logs. You can post logs here and we will try to help.
... View more
12-14-2016
02:35 PM
Did you restart all services? If so, you'll need to look at the logs to see what's going on. Start by looking at the logs in Ambari from the start commands for each service. If that doesn't give answers, you may need to look at the logs on one of the data nodes.
... View more
12-12-2016
06:23 PM
What is your environment? Are you using the HDP Sandbox in VirtualBox?
... View more
12-12-2016
06:23 PM
@Avijeet Dash - Terry made all good points. Note that using SolrCloud does not require using HDFS. SolrCloud can also use local storage and it is not uncommon. Sometimes people misunderstand when we don't point this out. The optimal choice of HDFS vs local depends on the use case, but local storage is usually preferred over HDFS if your index has a high level of updates/adds. SolrCloud automatically replicates your data and is fault tolerant, but still, SolrCloud has the advantages Terry mentioned.
... View more