Created 10-09-2016 01:54 PM
Hi, let me briefly describe my problem:
Initial task was to try some NLP practices in close real-world problems. We decided to start with simple classification problem - predict a genre for a music lyric. We had a strong requirement of using Java, Spark 2.0.0 with ML library (not MLLib). ML library has a limited number of algorithms, so we started with binary classification for 2 genres and simple pipeline with Word2Vec and Logistic Regression. It showed acceptable results. Than we decided to add one genre. So we had to some other algorithm, because Logistic Regression works only for binary problems. So we've tried 3 approaches:
Maybe you could recommend some other approach based on your experience? Any help is very appreciated. I have to reming that we're strongly tied to Spark 2.0.0 + ML library due to DevOps infrastructure.
Created 10-25-2016 05:31 PM
Have you tried DeepLearning4J with Spark? or H2O with Spark.
Both of those are very sophisticated NLP.
https://dzone.com/articles/in-progress-natural-language-processing
Created 10-25-2016 05:31 PM
Have you tried DeepLearning4J with Spark? or H2O with Spark.
Both of those are very sophisticated NLP.
https://dzone.com/articles/in-progress-natural-language-processing
Created 01-26-2017 02:46 PM
Created 01-26-2017 02:58 PM