Member since
09-29-2015
44
Posts
33
Kudos Received
8
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1493 | 03-07-2017 02:30 PM | |
720 | 12-14-2016 02:53 AM | |
6758 | 12-07-2015 03:58 PM | |
2554 | 11-06-2015 07:40 PM | |
1141 | 10-26-2015 05:59 PM |
06-16-2016
05:35 AM
Same issue Mark reported on HDP 2.4 Sandbox using sqoop import on a single table. Example command sqoop import --connect jdbc:mysql://192.168.1.17:3306/test --username drice --password hadoop --table client --hive-table default.client --hive-import -m 1 NOTE Marks workaround worked new command sqoop import --connect jdbc:mysql://192.168.1.17:3306/test --username drice --password hadoop --table client --hive-table default.client --hive-import -m 1 --driver com.mysql.jdbc.Driver
... View more
04-07-2016
01:38 PM
1 Kudo
Tom is correct, but if using virtual client such as vmware fusion you'll have to do it differently. Grab the MAC address of the VM by selecting the VM->Virtual Machine->Settings->Network Adapter->Advanced Options and copy the MAC Address vi /Library/Preferences/VMware Fusion/vmnet8/dhcp.conf add the line at the bottom of the conf file host hdp24 { hardware ethernet 00:0C:29:42:61:D7; fixed-address 192.168.245.133; }
... View more
03-22-2016
04:39 PM
@Ram Note that disks are required for NN also. See post related to sizing of NN. https://community.hortonworks.com/questions/1692/any-recommendation-on-how-to-partition-disk-space.html#answer-1762
... View more
12-07-2015
03:58 PM
2 Kudos
Hey @Wes Floyd , not that I'm aware of but if you look at RMP-3737 looks like this is coming in DAL-M20 or HDP 2.3.4
... View more
11-25-2015
08:43 PM
Yes syncsort is a partner look here.
... View more
11-25-2015
08:41 PM
@pbalasundaram I've seen a customer use JRecord to build a mapreduce inputformat for HDFS, mapreduce, etc. Look around within github and you should see examples. Obviously this would take more work than just using something off the shelf. Besides syncsort, there are capabilities within attunity also.
... View more
11-12-2015
01:40 AM
1 Kudo
Martin, I have some contacts over at PepperData that I can introduce you to. Also I do know of a customer successfully using PepperData. Give me a call tomorrow to discuss.
... View more
11-06-2015
07:40 PM
1 Kudo
@vsomani@hortonworks.com Similar question here Answer I copied below. each HDFS block occupies ~250 bytes of RAM on NameNode (NN), plus an additional ~250 bytes will be required for each file and directory. Block size by default is 128 MB so you can do the calculation pertaining to how much RAM will support how many files. To guarantee persistence of the filesystem metadata the NN has to keep a copy of its memory structures on disk also the NN dirs and they will hold the fsimage and editlogs. Editlogs captures all changes that are happening to HDFS (such as new files and directories), think redo logs that most RDBM's use. The fsimage is a full snapshot of the metadata state. The fsimage file will not grow beyond the allocated NN memory set and the edit logs will get rotated once it hits a specific size. It always safest to allocate significantly more capacity for NN directory then needed example say 4 times what is configured for NN memory, but if disk capacity isn't an issue allocate 500 GB+ if can spare (more capacity is very common especially when setting up a 3+3 or 4+4 RAID 10 mirrored set). Setting up RAID at the disk level like RAID1 or RAID 1/0 makes sense and thus having RAID allows for a single directory to be just fine.
... View more
10-26-2015
05:59 PM
Got this information from Sai Nukavarapu and wanted to add it back to the thread in case anyone else needs to have more context to these meanings. - What MemorySeconds means and vCoreSeconds mean. 1. memorySeconds - The amount of memory the application has allocated (megabyte-seconds); Aggregated amount of memory (in megabytes) the application has allocated X the number of seconds the application has been running. 2. The amount of CPU resources the application has allocated (virtual core-seconds) Aggregated number of vcores that the application has allocated X the number of seconds the application has been running.
... View more
10-21-2015
03:38 AM
1 Kudo
@hfaouaz@hortonworks.com - each HDFS block occupies ~250 bytes of RAM on NameNode (NN), plus an additional ~250 bytes will be required for each file and directory. Block size by default is 128 MB so you can do the calculation pertaining to how much RAM will support how many files. To guarantee persistence of the filesystem metadata the NN has to keep a copy of its memory structures on disk also the NN dirs as you mentioned and they will hold the fsimage and editlogs. Editlogs captures all changes that are happening to HDFS (such as new files and directories), think redo logs that most RDBM's use. The fsimage is a full snapshot of the metadata state. The fsimage file will not grow beyond the allocated NN memory set and the edit logs will get rotated once it hits a specific size. It always safest to allocate significantly more capacity for NN directory then needed say 4 times what is configured for NN memory, but if disk capacity isn't and issue allocate 500 GB+ if can spare (more capacity is very common especially when setting up a 3+3 or 4+4 RAID 10 mirrored set). Setting up RAID at the disk level like RAID1 or RAID 1/0 makes sense and thus having RAID allows for a single directory to be just fine.
... View more
- « Previous
-
- 1
- 2
- Next »