Member since 
    
	
		
		
		06-16-2014
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                40
            
            
                Posts
            
        
                2
            
            
                Kudos Received
            
        
                1
            
            
                Solution
            
        My Accepted Solutions
| Title | Views | Posted | 
|---|---|---|
| 11621 | 07-24-2014 08:36 PM | 
			
    
	
		
		
		10-09-2014
	
		
		07:16 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    There are also hue.ini files like /var/run/cloudera-scm-agent/process/XXXX-hue-BEESWAX_SERVER/hue.ini.     Maybe these are the exact configure files for hue? But each time I restart the hue service, it will generate a new one such hue.ini file, and the changes in files like /opt/cloudera/parcels/CDH-4.7.0-1.cdh4.7.0.p0.40/etc/hue/hue.ini have no effect on these files. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		10-09-2014
	
		
		06:43 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    Thanks for reply.     There are a number of hue.ini files in my system. I'm not sure which one is the configure file in use.     /opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/share/hue/desktop/conf/hue.ini  /opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/etc/hue/hue.ini  /opt/cloudera/parcels/CDH-4.7.0-1.cdh4.7.0.p0.40/share/hue/desktop/conf/hue.ini  /opt/cloudera/parcels/CDH-4.7.0-1.cdh4.7.0.p0.40/etc/hue/hue.ini  /opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/share/hue/desktop/conf/hue.ini  /opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/etc/hue/hue.ini     Is there any way I can figure out which one HiveServer is used by Beeswax now? 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		10-09-2014
	
		
		04:52 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    Maybe I should give some details about my case.     I'm now using CDH4.7. When I download the result of query in Beeswax, sometimes the hue service would crash. Therefore, I'm thinking about using HiveServer2 to access hive data through JDBC. However, I'm not sure that if starting HiveServer2 would has any unwanted effect on Beeswax.     Since I know little about HiveServer2, so maybe some of my ideas about that is totally incorrect. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		10-09-2014
	
		
		04:40 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    Hi, I have some question about HiveServer and Beeswax.     As far as I known, Hive CLI access hive via HiveServer1 while Beeline though HiveServer2.  I wonder how Beeswax access hive? Using HiveServer1 or HiveServer2?     Is that HiveServer1 always runing? Or it only starts when Hive CLI launches?     If I start HiveServer2 as a service, are there any conflicts between these two servers that would lead to the failure of Beeswax? 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
			
	
					
			
		
	
	
	
	
				
		
	
	
- Labels:
- 
						
							
		
			Apache Hive
			
    
	
		
		
		09-15-2014
	
		
		02:34 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Thanks so much for resolving my long time confusion!     I know that HAR can lead to smaller metadata, however, I still do not understand why HAR can save disk space.     8 1m size files would occupy 8 1m HDFS blocks, and the disk space used is 24m. HAR combines these files into a 8m har file occupying one 8m block, but the disk space used is still 24m. Or is any kind of compression used in HAR?       
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-15-2014
	
		
		01:32 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    If I use HAR to archive these 8 files, would they be placed into one HDFS block (assuming that they are all less than 1m) ?     If it is true, I can save 7/8 disk space in this case.    
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-15-2014
	
		
		01:25 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Thanks for your reply.     The -ls command tells me the size of the file, but what I want to know is the occupied disk space. The jar file is 3922 bytes long, but it actually occupy one HDFS block (128M) according to your first anwser. Is it right?     Is there any way I can check the actual occupied space?   
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-14-2014
	
		
		10:57 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							    The HDFS block size in my system is set to be 128m. Does it mean that if I put 8 files less than 128m to HDFS, they would occupy 3G disk space (replication factor = 3) ?  When I use "hadoop fs -count ", it only show the size of files. How could I know the actual occupied space of HDFS file ?     And how about I use HAR to archive these 8 files ? Can it save some space ? 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
			
	
					
			
		
	
	
	
	
				
		
	
	
- Labels:
- 
						
							
		
			HDFS
			
    
	
		
		
		08-17-2014
	
		
		11:59 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							Since upgrading CM needs more testing, I downgraded impala to 1.0 instead.    Now using impala through impala-shell is ok.    However, the impala query component in hue is still not working. It can recognize all the database created in hive, but always return "No server logs for this query" to all queries.
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		 
        






