Member since
03-16-2016
707
Posts
1753
Kudos Received
203
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
5126 | 09-21-2018 09:54 PM | |
6489 | 03-31-2018 03:59 AM | |
1968 | 03-31-2018 03:55 AM | |
2176 | 03-31-2018 03:31 AM | |
4816 | 03-27-2018 03:46 PM |
07-29-2016
02:21 PM
4 Kudos
@knatarasan If by "HDP 2.2.9" you mean Hortonworks Data Platform 2.2.9, then the response is No. The Web UI comes with Hive 2.0 which is only available in the HDP 2.5 Technical Preview sandbox. The Web UI has been implemented by this ticket: https://issues.apache.org/jira/browse/HIVE-12338. If this response is acceptable, please vote/accept it as best answer.
... View more
07-29-2016
02:01 PM
5 Kudos
@Jay Kumar No, unless the user has its own leaf queue, which is not the case here. This a feature that everyone wants to see added. Anyway, look at https://community.hortonworks.com/questions/13296/yarn-que-utilization-report.html to see how you can see the resource utilization for a given period per queue. One work around for you would be to create a leaf queue for each user in each parent queue. It is ugly, but it will address your needs until the expected feature is in place. If this response is acceptable, please accept it as best answer.
... View more
07-29-2016
09:19 AM
9 Kudos
Introduction This
article is a complement to Geo-spatial Queries with Hive using ESRI Geometry and Spatial Framework for Hadoop and includes a few more findings, mainly documenting the differences
between ST_Geometry functions supported in Hive and those in commercial spatial packages for Oracle, SQL Server or Netezza. The Hive UDF's are modeled after existing
implementations of ST_Geometry. Some functions exist only in Hive’s
implementation, a few behave different or don’t exist. Additional ST_Geometry Functions in Hive ST_GeomFromJSON - Create a geometry from Esri
JSON ST_AsJSON -
Return Esri JSON representation of geometry ST_GeomFromGeoJSON - Create a geometry from GeoJSON ST_AsGeoJSON -
Return GeoJSON representation of geometry (requires geometry-api-java version
1.1) ST_PointZ -
constructor for 3D points from direct position ST_SetSRID -
set the spatial-reference ID, for a geometry constructed without it ST_GeodesicLengthWGS84 - geodesic length in meters
rather than in angles ST_Bin, ST_BinEnvelope - aggregate into
regular grid, tutorial here ST_Geometry Functions with Different Behavior Overloaded
constructors - These overloaded constructors differ from other
ST_Geometry implementations in how the caller can specify the spatial-reference
ID. Default SRID is plane, when the SRID is not specified. Hive does not accept SRID in second argument - wrap with ST_SetSRID or use ST_GeomFromText. Applies to ST_Point, ST_LineString, ST_Polygon, ST_MultiPoint, ST_MultiLineString, ST_MultiPolygon. ST_PointN - Return type varies in
the case of index out of range - Hive: null; ST_AsText -
The OGC WKT standard dictates that a MultiPoint is represented as MULTIPOINT
((1 2),(3 4)); however some
existing WKT parsers accept only MULTIPOINT (1 2, 3 4). ST_AsText outputs the former,
compliant format, with the nested parentheses. ST_Envelope -
In the case of a point or a vertical or horizontal line, ST_Envelope may either
apply a tolerance or return an empty envelope. ST_Intersection - In the case where the two
geometries intersect in a lower dimension, ST_Intersection may drop the
lower-dimension intersections, or output a closed linestring. ST_Intersection(ST_Polygon(2,0,3,1, 2,1), ST_Polygon(1,1, 4,1, 4,4, 1,4))) -- MULTIPOLYGON EMPTY
or
LINESTRING(2 1, 3 1, 2 1) ST_Union -
may drop lower-dimension members of the union ST_Union(ST_LineString(2,3,4,5), ST_Point(1,1)) -- MULTILINESTRING ((2 3, 4 5)) ST_SymmetricDiff - Hive-spatial follows the naming
in the Esri implemention of ST_Geometry. For the OGC naming, use an alias: create temporary function ST_SymDifference as 'com.esri.hadoop.hive.ST_SymmetricDiff'; ST_Geometry Functions Not Supported in Hive ST_GeomCollection, ST_NumGeometries, ST_GeometryN -
collection of varying geometry types is not supported; hive supports arrays ST_Geometry - no constructor of this name -
use one of the other constructors, e.g. ST_GeomFromText ST_Curve, ST_Surface, ST_MultiCurve, ST_MultiSurface -
Curve and Surface constructors not supported ST_PointOnSurface - ST_PointOnSurface is not
supported on Hive ST_GeoSize - ST_GeoSize is not supported on
Hive ST_Transform - ST_Transform is not supported
on Hive ST_Equalsrs - ST_Equalsrs is not supported on
Hive Key Resources Complete list of functions
supported in Hive: https://github.com/Esri/spatial-framework-for-hadoop/blob/master/hive/function-ddl.sql UDF documentation: https://github.com/Esri/spatial-framework-for-hadoop/wiki/UDF-Documentation Tips To share functions globally across
sessions create them without “temporary” option. This has the advantage that you do not need to declare the functions for every session. create function ST_AsBinary as'com.esri.hadoop.hive.ST_AsBinary' You can also include the jar file in the create
function statement which makes it easier to create a permanent declaration. For example, for the definition of
the ST_Point function you would write the following SQL statement: create function ST_Point as ‘com.esri.hadoop.hive.ST_Point‘ using jar ‘hdfs://YourHDFSClientNode:8020/esri/spatial-sdk-hive-1.1.1-SNAPSHOT.jar’;
Final Notes As discussed with ESRI recently, there are no plans to open source all spatial functions currently available for
traditional RDBMS like Oracle, SQL Server, or Netezza, as those are commercially
licensed packages. The best option to compensate for the 5-10% missing
functions is to contribute to ESRI’s open source repository: https://github.com/Esri/spatial-framework-for-hadoop. ESRI does not provide a commercial library
for Hive including all spatial functions. Thanks to Mansour Raad, Big Data Expert at ESRI for his candide and valuable input. Check Mansour's last article: Geospatial Big Data - the next big trend in analytics
... View more
Labels:
07-28-2016
05:42 PM
5 Kudos
@Srinivas Santhanam Add semicolon at the end and make sure that your ambari user is mapped to an OS user that has access to the path add jar /tmp/udfs/esri-geometry-api.jar; add jar /tmp/udfs/spatial-sdk-hadoop.jar; My suggestion is to place these libraries in HDFS with your Ambari user that has hdfs priviles. Such you can have access to the libraries from any node with hive client. Example: add jar hdfs://whateverhostname:8020/tmp/esri/esri-geometry-api.jar; If the response addresses your problem, don't forget to vote and accept best answer. It is a stimulus for the effort. Thank you.
... View more
07-28-2016
05:24 PM
3 Kudos
@k v vittal hebbar I assume that lines as you actually wrote are like the below (in two lines with semicolon at the end): ADD JAR /tmp/udfs/esri-geometry-api.jar; ADD JAR /tmp/udfs/spatial-sdk.jar; If that is true, then check whether your ambari view user is mapped to an OS user that has privileges on /tmp/udfs/ located files. Example: You also have a better option to add these libraries to HDFS as such you can access them from any node with a Hive client. add jar hdfs://whateverhost:8020/tmp/esri/esri-geometry-api.jar; Try the hdfs approach. If this addresses your response please vote and accept it as the best answer.
... View more
07-28-2016
02:20 AM
5 Kudos
@Carles San Agustin You need to increase your OS ulimit. Most likely you have some tables with multiple partitions and processes that access them. You will need to restart your servers and change the ulimit on all nodes. This requires downtime. It is a good practice to do it upfront estimating how the cluster will be used in regard to file descriptors. See this: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.0/bk_installing_manually_book/content/ref-729d1fb0-6d1b-459f-a18a-b5eba4540ab5.1.html Also section 1.2.8 here. I cannot tell you what is the magic number for you, it depends on what you do and what the servers can provide as resources, but I have seen ulimit being set from tens of thousands to hundreds of thousands. The minimum requirement for installing Hortonworks Data Platform is 10,000. Try various numbers. If this response helps, please vote/accept it as the best answer.
... View more
07-28-2016
02:10 AM
3 Kudos
@kanwar singh user is a reserved word in hive. You need to use escape characters for it. They are the back ticks, e.g. `user`. Since you are passing the statement as a string, you may have to escape the back-tick ` as well using a back-slash, \`. Additionally, commands issued in hive require a semi-colon at the end. To test the above theory just try to drop a table that does not have a reserved keyword in hive, e.g. MyTable. If this helps, please vote/accept answer.
... View more
07-26-2016
06:41 PM
@Prakash Punj Good lesson learned for all of us. It is a good practice to save a file with a name like hue.ini.old for anything unix. Hue is not the only that reads all .ini files from a directory.
... View more
07-25-2016
11:27 PM
2 Kudos
@sidharth mishra This is just a WARN and it is kind of expected by design. org.apache.zookeeper.server.quorum.QuorumCnxManager class implements a connection manager for leader election using TCP. It maintains one connection for every pair of servers. The tricky part is to guarantee that there is exactly one connection for every pair of servers that are operating correctly and that can communicate over the network. If two servers try to start a connection concurrently, then the connection manager uses a very simple tie-breaking mechanism to decide which connection to drop based on the IP addressed of the two parties. For every peer, the manager maintains a queue of messages to send. If the connection to any particular peer drops, then the sender thread puts the message back on the list. As this implementation currently uses a queue implementation to maintain messages to send to another peer, we add the message to the tail of the queue, thus changing the order of messages. Although this is not a problem for the leader election, it could be a problem when consolidating peer communication. If this response clarifies it, vote/accept it as the answer.
... View more