Member since
09-09-2018
5
Posts
1
Kudos Received
0
Solutions
02-19-2019
10:57 AM
You will need to use both dot notation and join notation: https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_complex_types.html When complex types are nested inside each other, you use a combination of joins, pseudocolumn names, and dot notation to refer to specific fields at the appropriate level. This is the most frequent form of query syntax for complex columns, because the typical use case involves two levels of complex types, such as an ARRAY of STRUCT elements. SELECT id, phone_numbers.area_code FROM contact_info_many_structs INNER JOIN contact_info_many_structs.phone_numbers phone_numbers LIMIT 3; You can express relationships between ARRAY and MAP columns at different levels as joins. You include comparison operators between fields at the top level and within the nested type columns so that Impala can do the appropriate join operation.
... View more
01-02-2019
07:59 AM
I think it is indexed in the internal search index of the Hue service. But this is just a guess.. did not checked the source code.
... View more
09-10-2018
07:03 AM
1 Kudo
I think this part of the certification page may be helpful as to why the documentation is available. I added bold for emphasis on the sentence that applies here.
Exam Question Format
You are given eight to twelve customer problems with a unique large data set, a CDH cluster, and 120 minutes. For each problem, you must implement a technical solution with a high degree of precision that meets all the requirements. You may use any tool or combination of tools on the cluster (see list below) -- you get to pick the tool(s) that are right for the job. You must possess enough knowledge to analyze the problem and arrive at an optimal approach given the time allowed. You need to know what you should do and then do it on a live cluster, including a time limit and while being watched by a proctor.
... View more