About sihi_yassine

sihi_yassine · ‎07-03-2018

What about if I want to use PANDAS and Matplotlib, should I use Pyspark?

sihi_yassine · ‎07-02-2018

I ve check the list of interpreters that are installed on my zeppelin, and I found out that python doesn't belong to the list. now for use python command I use %spark.pyspark. I would know if it's a good idea to use pyspark instead of python, and is it recommanded to have python interpreted even if I have pyspark which works fine for python code?

sihi_yassine · ‎04-11-2018

The logic is quite simple: 128Mb is a multiple of "2" which means we can represent the number in binary like: 128Mb= 131072 Kb= 134217728 b = 1000000000000000000000000000 Binary With this number we don't wast any bit when we stock data on memory You can say that is a norme of storage of data in the computer science not just for big data

sihi_yassine · ‎04-04-2018

Awesome, works like a charm

sihi_yassine · ‎04-04-2018

I have a lot of external table in my hive warhouse and I would to drop all these tables with data automatically. how can I do this?

sihi_yassine · ‎04-01-2018

I had a external table that contains some string columns, now I need to change the datatype of some columns, so I used : ALTER TABLE table CHANGE col col type; but this query gives me a error: org.apache.spark.sql.AnalysisException: ALTER TABLE CHANGE COLUMN is not supported for changing column 'id' with type 'StringType' to 'id' with type 'LongType'; any suggestion would be greatly welcome, thanks

sihi_yassine · ‎03-26-2018

@Andrea L like Michael Young said, Sqoop doesn't suppot importing from or exporting to Hive. it's also recommanded to use the export/import hive queries to move your data between two hive, check this out: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport however, the CSV method can generate problems of separator or even if the data is numerous it would be necessary to group them in one CSV file, which is not reassuring.

sihi_yassine · ‎03-20-2018

@Shashank V C Why don't you use "beeline"!! cuz I don't think hdfs can know notice the difference between external table and no external table!

sihi_yassine · ‎03-01-2018

Hi @hema moger if your remote server isn't belong to your cluster, you will have bring data on one of server in your cluster and than use "hdfs fs" to put data on your cluster. in other case, i.e the remote server belong to the cluster , than you just need to run "hadoop fs -put" of you're csv file

sihi_yassine · ‎02-26-2018

@Jay Kumar SenSharma nc: connect to 10.166.54.12 port 8020 (tcp) failed: Connection refused tcp 0 0 10.166.54.12:8020 0.0.0.0:* LISTEN 19578/java

Online	Offline
Last Visited	‎08-16-2018 01:04 PM

Member Since	‎11-03-2017 05:37 PM
Last Visited	‎08-16-2018 01:04 PM
Posts	94
Kudos received	13

Cloudera Community

Re: Why hdfs block size is 128 MB? Why it is not 1...

Re: Zeppelin Notebook Install error

Re: Ambari agent is not exist

Re: Access denied for user 'ambari'@localhost (usi...

Re: Python interpreter not configured in Zeppelin

Python interpreter not configured in Zeppelin

Re: Why hdfs block size is 128 MB? Why it is not 1...

Re: Drop external hive table with data

Drop external hive table with data

change datatype of external table

Re: Sqoop import data from hive to csv.

Re: How to get all the External Hive table details...

Re: Import data from remote server to HDFS

Re: remote access to clusters through hadoop fs -l...