Member since 
    
	
		
		
		06-20-2016
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                488
            
            
                Posts
            
        
                433
            
            
                Kudos Received
            
        
                118
            
            
                Solutions
            
        My Accepted Solutions
| Title | Views | Posted | 
|---|---|---|
| 3604 | 08-25-2017 03:09 PM | |
| 2515 | 08-22-2017 06:52 PM | |
| 4197 | 08-09-2017 01:10 PM | |
| 8977 | 08-04-2017 02:34 PM | |
| 8949 | 08-01-2017 11:35 AM | 
			
    
	
		
		
		09-22-2016
	
		
		01:41 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Thank you for confirming.   
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-21-2016
	
		
		06:37 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Syncsort is a 3rd party partner of ours.  We work very closely with them but we do not support them.  I suggest you head over to their support site http://bigdatakb.syncsort.com/Default.htm#KB/FAQ/297_Creating_DMX-h_Job_Tutorial.htm 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-21-2016
	
		
		10:48 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 @Gaurab D Just to be sure ... are you using DMX-H or simply DMX.  The H in the first on means Hadoop.  This is the one the integrates with Hadoop and Yarn. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		07:15 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		3 Kudos
		
	
				
		
	
		
					
							 In the HDP 2.5 release notes it says that Hive 2.1 is TP  http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_release-notes/content/tech_previews.html     Hive   New in this release:   Stored Procedures Client-Side  LLAP Daemons (HIVE-7193)  LLAP / Spark security interface  Hive 2.1       In the HDP 2.5 Release email to customers it was stated that Hive 2.1 is TP but that Hive ACID is certified for production with Hive 1.2.1  
 Apache Hive
 
 Includes Apache Hive 1.2.1 for production and Hive 2.1 (Technical Preview) for cutting-edge performance  Hive LLAP (Technical Preview): Persistent query servers and optimized in-memory caching for blazing fast SQL. Up to 25x faster for BI workloads. 100% compatible with existing Hive workloads  Hive ACID and Streaming Ingest certified for production use with Hive 1.2.1  Dynamic user-based security policies for data masking and filtering  HPL/SQL: Procedural programming within Hive  Hive View v1.5.0, improved robustness and security  Parquet format fully certified with Hive 1.2.1 / 2.1     In the Hortonworks.com Hive Overview section it states (confusingly) that ACID is GA in Hive 2.1 (though originated in 0.14)  http://hortonworks.com/apache/hive/#section_3           
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		05:38 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 (Am using VirtualBox and do not encounter this problem) 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		04:33 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 Awesome, thanks Michael.  Just had the same problem.  (Had to start HBase as well ... anyway to know the sequence of dependencies?  Understood that Atlas has many) 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		02:00 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 @Saumitra Buragohain  Thank you for putting this into proper perspective before parachuting into the weeds! 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		01:36 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		3 Kudos
		
	
				
		
	
		
					
							 Syncsort DMX-h is an excellent ETL tool that is YARN-enabled  http://www.syncsort.com/en/Products/BigData/DMXh 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2016
	
		
		12:37 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		2 Kudos
		
	
				
		
	
		
					
							 You need to take three approaches:   
  minimize your data before join (e.g. load only columns needed for join and output, filter before join), then  optimize your joins, then  optimize settings (including compressing intermediate results)   For 1, see: https://pig.apache.org/docs/r0.7.0/cookbook.html  For 1 and 2, see: https://pig.apache.org/docs/r0.9.1/perf.html  After performing these optimizations, for 3 see:   
 http://chimera.labs.oreilly.com/books/1234000001811/ch08.html#pig_tuning   Also, be sure you are running pig on Tez. 
						
					
					... View more