Created 09-26-2015 05:18 PM
I am working on a solution where I will have a Hadoop cluster with Hive running and I want to send jobs and hive queries from a .NET application to be processed and get notified when they are done. What is the recommended API, library here?
Created 09-26-2015 05:24 PM
After a bit of research, I found the hadoopsdk on codeplex is a good place to start.
As far as very basic connection examples go, try this blog for an example, but note that the connection for HDInsight is slightly different now it's all using the templeton interface, so this will get you going:
var db =newHiveConnection( webHCatUri:newUri("http://localhost:50111"), userName:(string)"hadoop", password:(string)null);var result = db.ExecuteHiveQuery("select * from w3c");
If you are looking to do full on MapReduce on HDInsight, then you probably want to take a look at the C# MapReduce examples with the sdk on codeplex.
Note that the default HDInsight install also comes with some good samples, which include a bit of data to play with and some powershell scripts and .NET code to get you started.
If there are other recommendations I am all ears.
Created 09-26-2015 05:24 PM
After a bit of research, I found the hadoopsdk on codeplex is a good place to start.
As far as very basic connection examples go, try this blog for an example, but note that the connection for HDInsight is slightly different now it's all using the templeton interface, so this will get you going:
var db =newHiveConnection( webHCatUri:newUri("http://localhost:50111"), userName:(string)"hadoop", password:(string)null);var result = db.ExecuteHiveQuery("select * from w3c");
If you are looking to do full on MapReduce on HDInsight, then you probably want to take a look at the C# MapReduce examples with the sdk on codeplex.
Note that the default HDInsight install also comes with some good samples, which include a bit of data to play with and some powershell scripts and .NET code to get you started.
If there are other recommendations I am all ears.
Created 12-04-2015 04:13 PM
A couple of links to external resources:
Created 12-07-2015 11:46 AM
for hive queries, can't you just use odbc driver?
Created 07-01-2016 06:16 PM
.NET SDK for Hadoop
https://hadoopsdk.codeplex.com/wikipage?title=Simple%20Linq%20To%20Hive%20Query&referringTitle=Home
.NET Driver for Phoenix
https://github.com/Azure/hdinsight-phoenix-sharp
https://www.nuget.org/packages/Microsoft.Phoenix.Client/1.0.0-preview
.NET for Kafka
https://cwiki.apache.org/confluence/display/KAFKA/Clients#Clients-.net
Mobius: C# Binding for Spark
https://github.com/Microsoft/Mobius
Spark for .NET Developers Introduction
https://msdn.microsoft.com/en-us/magazine/mt595756.aspx
You can also look at https://github.com/MSRCCS/Prajna
Created 07-01-2016 06:16 PM
.NET SDK for Hadoop
https://hadoopsdk.codeplex.com/wikipage?title=Simple%20Linq%20To%20Hive%20Query&referringTitle=Home
.NET Driver for Phoenix
https://github.com/Azure/hdinsight-phoenix-sharp
https://www.nuget.org/packages/Microsoft.Phoenix.Client/1.0.0-preview
.NET for Kafka
https://cwiki.apache.org/confluence/display/KAFKA/Clients#Clients-.net
Mobius: C# Binding for Spark
https://github.com/Microsoft/Mobius
Spark for .NET Developers Introduction
https://msdn.microsoft.com/en-us/magazine/mt595756.aspx
You can also look at https://github.com/MSRCCS/Prajna