Member since
09-12-2015
15
Posts
2
Kudos Received
0
Solutions
03-09-2017
05:31 PM
Scala and Python are both easy to program and help data experts get productive fast. Data scientists often prefer to learn both Scala and Python for Spark but Python is usually the second favourite language for Apache Spark, as Scala was there first.
... View more
10-19-2016
06:20 AM
1 Kudo
I am facing problem where disk writes are very slow due to heavy write operations. Using rsync I am copying logs to logserver and logserver is not able to do faster writes. Any suggestions to make log writes faster? Req. is to have 600+ rsync process to copy logs to logserver and each log folder of more then 8gb and more then 9k files in each log folder.
... View more
10-18-2016
10:43 AM
Problem : Deletion of log folder with below details Log size is around 10gb No . of files around 9k files. Simple rm -rf log folder doesn't works well.
... View more
08-20-2016
12:28 AM
@Constantin Stanca did you down voted the questions? if yes why.
... View more
08-20-2016
12:22 AM
@Sunile Manjee thanks a lot for detailed answer
... View more
08-19-2016
01:10 AM
No I am not sure which one to use? hive or solr so thats the reason I tagged.
... View more
08-18-2016
03:48 PM
I am facing issue of huge data in mysql table which is increasing very fast , so to scale what is the other alternative? Some of the columns are type of blob as they are raw data. So to query faster what is the right approach. Some more background of problem : data is increasing fast need to query 90% of time on some last 3 weeks data older data is not used for query frequently some of the columns are of raw data type i.e blob and they are used frequently to query Finally I have to query data and expose it via rest api So somebody can help me in complete design with some explainations?
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Hive
-
Apache Solr