About nanyim_alain

nanyim_alain · ‎06-27-2016

Thank you. But I've already done this step and I needed to handle multiple files. Currently this is solved thank you

nanyim_alain · ‎06-27-2016

Hello ! Thank you very much for your suggestions. These methods have worked and I found another very suitable method is that of using the dataframe . Cordially

nanyim_alain · ‎06-27-2016

Hello, I want to know is how can I run a python script that contains commands spark ? Here is my python script that I would run into a python environment : #!/usr/bin/python2.7 from pyspark.sql import HiveContext from pyspark import SparkContext from pandas.DataFrame.ix import DataFrame as df hive_context = HiveContext(sc) qvol1 = hive_context.table("table") qvol2 = hive_context.table("table") qvol1.registerTempTable("qvol1_temp") qvol2.registerTempTable("qvol2_temp") df=hive_context.sql("request") df.show()

nanyim_alain · ‎06-24-2016

Hello, Please I want to read a hive table from a python script. Can you help me please? My cordial thanks

nanyim_alain · ‎06-22-2016

Thank you. but I would go directly from the csv file to the hive orc table format without creating the textfile data. Thank

nanyim_alain · ‎06-22-2016

Hello, Is it possible to import data from a CSV file into a hive table the orc format? Thank

nanyim_alain · ‎06-21-2016

Hello, Thank you for the directive. But I 'm new to the dataframe and what I try to do is be able to make it to retrieve the values of the indices i and i + 1 for example. Best regards

nanyim_alain · ‎06-14-2016

Hello, Please I will like to iterate and perform calculations accumulated in a column of my dataframe but I can not. Can you help me? Thank you Here the creation of my dataframe. I would like to calculate an accumulated blglast the column and stored in a new column from pyspark.sql import HiveContext from pyspark import SparkContext from pandas import DataFrame as df sc =SparkContext() hive_context = HiveContext(sc) tab = hive_context.table("table") tab.registerTempTable("tab_temp") df=hive_context.sql("SELECT blglast FROM tab_temp AS b limit 50") df.show()

nanyim_alain · ‎06-13-2016

Thank you. By adding the attribute --map-column-hive Date=Timestamp to Sqoop everything works.

nanyim_alain · ‎06-10-2016

Thanks. But it creates another error : "Hive does not support the SQL type for column date" Thanks

Online	Offline
Last Visited	‎01-18-2017 11:10 AM

Member Since	‎04-14-2016 04:00 PM
Last Visited	‎01-18-2017 11:10 AM
Posts	54
Kudos received	9

Cloudera Community

Re: Read hive table with a python script

Re: Import data directly in as-parquetfile format

Re: run a python script containing commands spark

Re: Read hive table with a python script

run a python script containing commands spark

Read hive table with a python script

Re: import csv data into hive table orc format

import csv data into hive table orc format

Re: Iterate a dataframe

Iterate a dataframe

Re: import with sqoop smalldatetime from sql serve...

Re: import with sqoop smalldatetime from sql serve...