Member since 
    
	
		
		
		09-12-2015
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                15
            
            
                Posts
            
        
                2
            
            
                Kudos Received
            
        
                0
            
            
                Solutions
            
        
			
    
	
		
		
		03-09-2017
	
		
		05:31 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Scala and Python are both easy to program and help data experts get productive fast. Data scientists often prefer to learn both Scala and Python for Spark but Python is usually the second favourite language for Apache Spark, as Scala was there first. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		10-19-2016
	
		
		06:20 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 I am facing problem where disk writes are very slow due to heavy write operations.  Using rsync I am copying logs to logserver and logserver is not able to do faster writes.  Any suggestions to make log writes faster?  Req. is to have 600+ rsync process to copy logs to logserver and each log folder of more then 8gb and more then 9k files in each log folder. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		10-18-2016
	
		
		10:43 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Problem : Deletion of log folder with below details  Log size is around 10gb  No . of files around 9k files.  Simple rm -rf  log folder  doesn't works well. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-20-2016
	
		
		12:28 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 @Constantin Stanca did you down voted the questions? if yes why. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-20-2016
	
		
		12:22 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 @Sunile Manjee thanks a lot for detailed answer  
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-19-2016
	
		
		01:10 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 No I am not sure which one to use?  hive or solr so thats the reason I tagged. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-18-2016
	
		
		03:48 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 I am facing issue of huge data in mysql table which is increasing very fast , so to scale what is the other alternative?  Some of the columns are type of blob as they are raw data.  So to query faster what is the right approach.  Some more background of problem :   data is increasing fast  need to query 90% of time on some last 3 weeks data  older data is not used for query frequently  some of the columns are of raw data type i.e blob and they are used frequently to query   Finally I have to query data and expose it via rest api  So somebody can help me in complete design with some explainations? 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
			
	
					
			
		
	
	
	
	
				
		
	
	
- Labels:
 - 
						
							
		
			Apache Hadoop
 - 
						
							
		
			Apache Hive
 - 
						
							
		
			Apache Solr