1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1841 | 04-03-2024 06:39 AM | |
| 2857 | 01-12-2024 08:19 AM | |
| 1578 | 12-07-2023 01:49 PM | |
| 2340 | 08-02-2023 07:30 AM | |
| 3224 | 03-29-2023 01:22 PM |
06-17-2016
02:12 AM
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/pig/backend/executionengine/ExecException
at java.lang.Class.getDeclaredMethods0(Native Method) I am running the example on HDP 2.4 and get this error. Investigating logs.
... View more
Labels:
- Labels:
-
Apache Pig
06-17-2016
02:11 AM
Can you post logs, details, versions, schemas
... View more
06-16-2016
11:41 PM
Sometimes it's easy to share files: https://forums.virtualbox.org/viewtopic.php?t=15679 You just pick a directory, set Auto-mount "Yes" and Access to "Full" and hit OK. For some of us, depending on versions of VirtualBox, the VM and host operating system; things might not work. It also can get broken when host operating system or VM updates. Log into the VM as root and try this (if it's HDP sandbox or another running Centos). cd /opt/VBoxGuestAdditions-*/init sudo ./vboxadd setup
modprobe -a vboxguest vboxsf vboxvideo
rm -rf /media/sf_Downloads
mkdir /media/sf_Downloads
mount -t vboxsf Downloads /media/sf_Downloads
For me that worked and my Downloads directory was shared so I could move files to my Sandbox and off for development. There are some other things you can try and certainly rebooting everyone helps. For me, this worked fine.
... View more
Labels:
06-16-2016
08:23 PM
3 Kudos
How will Apache Calcite be used to enhance Apache NiFi's SQL abilities?
... View more
Labels:
- Labels:
-
Apache Calcite
-
Apache NiFi
06-16-2016
02:47 PM
It worked on my raspberry pi 2 but it is a bit slow
... View more
06-16-2016
02:47 PM
It would be pretty easy to write a processor that uses this library: https://github.com/vikrantlabde/iso8583-Java or https://github.com/chochos/j8583 Chris Mazo
... View more
06-16-2016
02:38 PM
6 Kudos
Most code for current big data projects and for the code you are going to write is going to be JVM based (Java and Scala mostly). There is certainly a ton of R, Python, Shell and other languages. For this tutorial we will focus on JVM tools. The great thing about that is that Java and Scala Static Code Analysis Tools will work for analyzing your code. JUnit test are great for testing the basic code and making sure you isolate out functionality from Hadoop and Spark specific interfacing.
General Java Tools for Testing
http://junit.org/
http://checkstyle.sourceforge.net/ http://pmd.github.io/pmd-5.4.2/pmd-java/rules/index.html Testing Hadoop (A Great Overview)
https://github.com/mfjohnson/HadoopTesting https://www.infoq.com/articles/HadoopMRUnit Example: I have a Hive UDF
written in Java that I can test via Junit to ensure that the main functionality
works. (See: UtilTest) import static org.junit.Assert.assertEquals;
import org.junit.Test;
/**
* Test method for
* {@link com.dataflowdeveloper.deprofaner.ProfanityRemover#fillWithCharacter(
* int, java.lang.String)}.
*/
@Test
public void testFillWithCharacterIntString() {
assertEquals("XXXXX", Util.fillWithCharacter(5, "X") );
}
As you can see this is just a plain old JUnit Test, but it's one step in the process to make sure you can test your code before it is deployed. Also Jenkins and other CI tools are great at running JUnits are part of their continuous build and integration process. A great way to test your application is with a small Hadoop cluster or simulated one. Testing against a Sandbox downloaded on your laptop is a great way as well. Testing Integration
with a Mini-Cluster https://github.com/hortonworks/mini-dev-cluster https://github.com/sakserv/hadoop-mini-clusters Testing Hbase
Applications Artem Ervits has a great article on Hbase Unit Testing. https://community.hortonworks.com/repos/15674/variety-of-hbase-unit-testing-utilities.html https://github.com/dbist/HBaseUnitTest Testing Apache NiFi
Processors
http://docs.hortonworks.com/HDPDocuments/HDF1/HDF-1.2.0.1/bk_DeveloperGuide/content/instantiate-testrunner.html http://www.nifi.rocks/developing-a-custom-apache-nifi-processor-unit-tests-partI/ Testing Apache NiFi
Scripts
https://github.com/mattyb149/nifi-script-tester http://funnifi.blogspot.com/2016/06/testing-executescript-processor-scripts.html Testing Oozie
https://oozie.apache.org/docs/4.2.0/ENG_MiniOozie.html Testing Hive Scripts
https://cwiki.apache.org/confluence/display/Hive/Unit+Testing+Hive+SQL http://hakunamapdata.com/beetest-a-simple-utility-for-testing-apache-hive-scripts-locally-for-non-java-developers/ https://github.com/klarna/HiveRunner https://github.com/edwardcapriolo/hive_test http://finraos.github.io/HiveQLUnit/ Testing Hive UDF
http://blog.matthewrathbone.com/2013/08/10/guide-to-writing-hive-udfs.html https://cwiki.apache.org/confluence/display/Hive/PluginDeveloperKit Using
org.apache.hive.pdk.HivePdkUnitTest and org.apache.hive.pdk.HivePdkUnitTests in your Hive plugin so that it will be included in unit tests. Testing Pig Scripts
http://pig.apache.org/docs/r0.8.1/pigunit.html http://www.slideshare.net/Skillspeed/hdfs-and-big-data-tdd-using-pig-unit-webinar http://www.slideshare.net/SwissHUG/practical-pig-and-pig-unit-michael-noll-july-2012 Testing Apache Spark
Applications
http://www.jesse-anderson.com/2016/04/unit-testing-spark-with-java/ https://github.com/holdenk/spark-testing-base http://www.slideshare.net/hkarau/effective-testing-for-spark-programs-strata-ny-2015 https://developer.ibm.com/hadoop/2016/03/07/testing-your-apache-spark-code-with-junit-4-0-and-intellij/ http://www.slideshare.net/knoldus/unit-testing-of-spark-applications Testing Apache Storm
Applications
Debugging an Apache Storm Topology https://github.com/xumingming/storm-lib/blob/master/src/jvm/storm/TestingApiDemo.java
... View more
Labels:
06-15-2016
10:50 PM
2 Kudos
Cdc is cool and coming
... View more
06-15-2016
08:49 PM
Kind of a start https://github.com/esasisa/UDF A Clojure Lein template https://clojars.org/simpleudf/lein-template You can work with UDFs in Spark, which is interesting. https://blog.knoldus.com/2016/04/19/udf-overloading-in-spark/
... View more