Member since
09-24-2015
816
Posts
488
Kudos Received
189
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3173 | 12-25-2018 10:42 PM | |
| 14198 | 10-09-2018 03:52 AM | |
| 4764 | 02-23-2018 11:46 PM | |
| 2481 | 09-02-2017 01:49 AM | |
| 2914 | 06-21-2017 12:06 AM |
05-09-2016
10:54 AM
2 Kudos
I've been advising users to truncate the table by removing entries older than 1 or 2 months. The DB is used to support search in Ranger UI, and the events are independent, so, IMO, no harm is done. They are also storing audit data in hdfs as well, and all data remains available there if needed. When you truncate the table by date, you can use the event_time field, for example in case of Mysql delete from xa_access_audit where event_time < DATE_SUB(now(), INTERVAL 1 MONTH);
... View more
05-09-2016
09:46 AM
In your Avro table declaration in Hive, can you try to replace VARCHAR(n) with STRING? I just tried to import as Avro one of my tables from Mysql with 3 columns: int, varchar(16) and timestamp. When I declared my varchar(16) as string in Hive it worked, when I used varchar(16) in Hive I got an error when I tried "SELECT *" and it failed. Also, on import, Sqoop creates an .avsc file in your current directory, you can inspect that file to find out what types Sqoop used when creating Avro files.
... View more
05-09-2016
07:17 AM
Okay, no problems, please consider to accept the answer, to help us manage answered questions. Tnx!
... View more
05-09-2016
05:45 AM
Yes, that's correct. There is also a script /usr/lib/python2.6/site-packages/ambari_agent/HostCleanup.py which you could use to remove users, and remove folders on each node, but you have already done that manually. You can still run it though. Also make sure to remove hdp-select on each node: "yum remove -y hdp-select". After that you have to recreate Mysql ambari database, and Oozie database if you are using Mysql. After all that, run "ambari-server setup" and "start" and when you open Ambari next time you will be able to create a new cluster.
... View more
05-08-2016
09:58 PM
You haven't cleared Ambari DB. Stop ambari-server and do this
ambari-server reset # answer yes to all prompts
dropdb -U ambari ambari # default PW is "bigdata"
ambari-server setup # continue with Ambari install
... View more
05-06-2016
03:34 PM
Yeah, doesn't look good, can you try to "get" one avro file to your local files system and check is it correct. You can use Avro Tools. Or if Avro is not a must, import the table as text file, and explore the Avro path later.
... View more
05-06-2016
01:23 PM
Okay, that's exactly what Ambari does for you when you use "localhost" (though Ambari is using FQDNs not IPs).
... View more
05-06-2016
01:19 PM
2 Kudos
Q1) Why only two Nodes have all the Mapper tasks running. Why are the other nodes not running any mapper? A: Mappers and reducers only run in Yarn containers on nodes running Node Manager. Click on your Yarn service in Ambari, and in the Summary tab check how many NMs do you have. You most likely have only 2 NMs on Machine1 and Machine2. Now, why only 2 mappers per machine? That depends on your Yarn and MapReduce settings. If no other jobs are running it means each node can run only 2 mappers at a time. To confirm check your yarn.nodemanager.resource.memory-mb in Yarn, and mapreduce.map.memory.mb in Mapreduce. Q2) If one mapper is per 128 MB file block, why the mapper task on machine 1 is showing 21627027 byes (21120 MB) of data ? A: 21627027 bytes is 21,627,027 or about 21M, not 21120M and so less than 128M. Note also that all blocks are not 128M, some are smaller (if a file has 150M one block will be "full" 128M, another one will be only 22M).
... View more
05-06-2016
12:49 PM
1 Kudo
Try this -- Imported table
CREATE EXTERNAL TABLE DimSampleDesc_avro(SmapiName_ver varchar(30),...) STORED AS AVRO LOCATION '/dataload/tohdfs/reio/odpdw/may2016/DimSampleDesc';
-- External ORC table
CREATE EXTERNAL TABLE DimSampleDesc(SmapiName_ver varchar(30), ...) STORED AS ORC LOCATION '/datastore/hdfstohive/reio/odpdw/may2016';
INSERT OVERWRITE TABLE DimSampleDesc SELECT * FROM DimSampleDesc_avro;
... View more